One API key. Access GPT-4o, Claude Opus, Gemini Pro, Llama 3.1, and 30+ models. OpenAI-compatible. 99.9% uptime SLA. Sub-second latency.
from openai import OpenAI client = OpenAI( api_key="sk-inf-...", # your InferGate key base_url="https://api.infergate.xyz/v1" # drop-in replacement ) response = client.chat.completions.create( model="claude-3-5-sonnet-20241022", # or gpt-4o, gemini-2.0-flash, ... messages=[{"role": "user", "content": "Hello!"}], max_tokens=1024, ) print(response.choices[0].message.content)