Multi-Provider Routing
Route requests to 30+ AI providers through a single OpenAI-compatible endpoint. Supports chat, responses, audio, and image APIs.
Loading OmniRoute...
Documentation - v3.7.4
AI gateway for multi-provider LLMs. One endpoint for OpenAI, Anthropic, Gemini, DeepSeek, GitHub Copilot, Claude Code, Cursor, and 100+ more providers.
Run npx omniroute or clone from GitHub and run npm start.
Go to Endpoint -> Registered Keys. Generate one key per environment.
Add provider accounts via OAuth login, API key, or free-tier auto-connect.
Point your IDE or API client to https://<host>/v1. Use provider prefix, for example gh/gpt-5.1-codex.
Route requests to 30+ AI providers through a single OpenAI-compatible endpoint. Supports chat, responses, audio, and image APIs.
Create model combos with fallback chains and balancing strategies: round-robin, priority, random, least-used, and cost-optimized.
Automatically create optimized combos based on your connected providers, usage patterns, and model capabilities.
Integrated web search with 5 providers (Serper, Brave, Exa, Tavily, Perplexity) including analytics and cost tracking.
Real-time token counting, cost calculation per provider/model, and detailed usage breakdown by API key and account.
Visual analytics with charts for requests, tokens, errors, latency, costs, and model popularity over time.
Live health checks, provider status, circuit breaker states, and automatic rate limit detection with exponential backoff.
Persistent conversational memory with extraction, injection, retrieval, and summarization across sessions.
Extensible skill system with built-in and custom skills, sandbox execution, request interception, and context injection.
Agent Communication Protocol (ACP) registry for managing agent-to-agent workflows and tool orchestration.
Manage IDE configurations, export/import backups, discover codex profiles, and configure settings from the dashboard.
API key authentication, IP filtering, prompt injection guard, domain policies, session management, and audit logging.
127 providers across three connection types.
if/Qoder AIqw/Qwen Codegemini-cli/Gemini CLIkr/Kiro AIaq/Amazon Qcc/Claude Code/Antigravitycx/OpenAI Codexgh/GitHub Copilotgitlab-duo/GitLab Duocu/Cursor IDEkmc/Kimi Codingkc/Kilo Codecl/Clineagentrouter/AgentRouteropenrouter/OpenRouterqianfan/Baidu Qianfanglm/GLM Codingglmcn/GLM Coding (China)glmt/GLM Thinkingbcp/Alibaba Coding Plankimi/Kimikmca/Kimi Coding (API Key)minimax/Minimax Codingminimax-cn/Minimax (China)crof/CrofAIalicode/Alibabaalicode-intl/Alibaba Intlopenai/OpenAIazure/Azure OpenAIazure-ai/Azure AI Foundrybedrock/Amazon Bedrockwatsonx/IBM watsonx.ai Gatewayoci/OCI Generative AIsap/SAP Generative AI Hubmdl/Modalreka/Rekanlpc/NLP Cloudrunway/Runwayanthropic/Anthropicgemini/Gemini (Google AI Studio)ds/DeepSeekgroq/Groqbb/Blackbox AIxai/xAI (Grok)mistral/Mistralpplx/Perplexitytogether/Together AIfireworks/Fireworks AIcerebras/Cerebrascohere/Coherenvidia/NVIDIA NIMnebius/Nebius AIsiliconflow/SiliconFlowhyp/Hyperbolicnb/NanoBananaollamacloud/Ollama Cloudhf/HuggingFacesynthetic/Synthetickg/Kilo Gatewayvertex/Vertex AIvp/Vertex AI Partnerszai/Z.AIopencode-zen/OpenCode Zenopencode-go/OpenCode Goali/Alibaba Cloud (DashScope)lc/LongCat AIpol/Pollinations AIpu/Puter AIcf/Cloudflare Workers AIscw/Scaleway AIdeepinfra/DeepInfravag/Vercel AI Gatewaylambda/Lambda AIsamba/SambaNovanscale/nScaleovh/OVHcloud AIbaseten/Basetenpublicai/PublicAImoonshot/Moonshot AImeta/Meta Llama APIv0/v0 (Vercel)morph/Morphfeatherless/Featherless AIfriendli/FriendliAIllamagate/LlamaGateheroku/Heroku AIgaladriel/Galadrieldatabricks/Databricksdatarobot/DataRobotclarifai/Clarifaisnowflake/Snowflake Cortexwandb/Weights & Biases Inferencevolcengine/Volcengineai21/AI21 Labsgigachat/GigaChat (Sber)venice/Venice.aicodestral/Codestralupstage/Upstagemaritalk/Maritalkmimo/Xiaomi MiMoinet/Inference.netnanogpt/NanoGPTpredibase/Predibasebytez/Bytezaiml/AI/ML APInovita/Novita AIpi/PiAPIggo/GoAPIlz/LaoZhang AIglhf/GLHF Chatcablyai/CablyAIthebai/TheB.AIfenayai/FenayAIempower/Empowernous/Nous Researchpetals/Petalspoe/Poegitlab/GitLab Duo PATchutes/Chutes.aivoyage/Voyage AIjina/Jina AIfal/Fal.aistability/Stability AIbfl/Black Forest Labsrecraft/Recrafttopaz/TopazPoint clients to one base URL and route by model prefix (for example: gh/, cc/, kr/, openai/).
Create combo models in Dashboard and keep client config stable while providers rotate internally.
Track tokens and cost by provider, account, and API key in Usage and Analytics tabs.
https://<host>/v1/chat/completionsgh/..., cc/...)gh/./responses./chat/completions.cu/ prefix for Cursor models.cc/ (Claude) or antigravity/ (Antigravity) prefix.OmniRoute exposes two operational protocols in addition to OpenAI-compatible APIs: MCP for tool execution and A2A for agent-to-agent workflows.
Use MCP over stdio to let clients discover and call OmniRoute tools with audit visibility.
omniroute --mcpUse A2A JSON-RPC to submit tasks synchronously or via SSE streaming.
GET /.well-known/agent.json
POST /a2a (JSON-RPC: message/send | message/stream)Register and manage agents via the ACP registry for inter-agent communication and tool sharing.
Dashboard -> Agents
Dashboard -> CLI ToolsOmniRoute exposes 29 tools via Model Context Protocol for agent orchestration.
Health checks, combo management, quota monitoring, cost reporting, and model catalog access.
omniroute_get_healthomniroute_list_combosomniroute_get_combo_metricsomniroute_switch_comboomniroute_check_quotaomniroute_route_requestomniroute_cost_reportomniroute_list_models_catalogomniroute_web_searchRoute simulation, budget guards, strategy switching, resilience profiles, and provider metrics.
omniroute_simulate_routeomniroute_set_budget_guardomniroute_set_routing_strategyomniroute_set_resilience_profileomniroute_test_comboomniroute_get_provider_metricsomniroute_best_combo_for_taskomniroute_explain_routeomniroute_get_session_snapshotomniroute_db_health_checkomniroute_sync_pricingView cache statistics and flush semantic or signature caches.
omniroute_cache_statsomniroute_cache_flushSearch, add, and clear persistent conversational memory entries.
omniroute_memory_searchomniroute_memory_addomniroute_memory_clearList, enable, execute, and monitor custom skill executions.
omniroute_skills_listomniroute_skills_enableomniroute_skills_executeomniroute_skills_executions| Method | Path | Notes |
|---|---|---|
POST | /v1/chat/completions | OpenAI-compatible chat endpoint (default). |
POST | /v1/responses | Responses API endpoint (Codex, o-series). |
POST | /v1/completions | Legacy completions endpoint for text generation. |
GET | /v1/models | Model catalog for all connected providers. |
POST | /v1/embeddings | Text embedding generation (OpenAI, Cohere, Voyage). |
POST | /v1/moderations | Content moderation and safety classification. |
POST | /v1/rerank | Document reranking for retrieval-augmented generation (Cohere, Jina). |
POST | /v1/search | Web search with 5 providers (Serper, Brave, Exa, Tavily, Perplexity). |
GET | /v1/search/analytics | Analytics and metrics for search requests. |
POST | /v1/audio/transcriptions | Audio transcription (Deepgram, AssemblyAI). |
POST | /v1/audio/speech | Text-to-speech generation (ElevenLabs, OpenAI TTS). |
POST | /v1/images/generations | Image generation (NanoBanana). |
POST | /v1/videos/generations | Video generation (ComfyUI, SD WebUI workflows). |
POST | /v1/music/generations | Music generation via ComfyUI workflows. |
POST | /v1/messages | Anthropic-native messages endpoint. |
POST | /v1/messages/count_tokens | Count tokens for a given message payload. |
POST | /v1/files | File upload for multimodal inputs. |
POST | /v1/batches | Batch processing for bulk API requests. |
GET | /v1/ws | WebSocket endpoint for real-time streaming. |
POST | /chat/completions | Rewrite helper for clients without /v1. |
POST | /responses | Rewrite helper for Responses without /v1. |
GET | /models | Rewrite helper for model discovery without /v1. |
Use the provider prefix before the model name to route to a specific provider. Example: gh/gpt-5.1-codex routes to GitHub Copilot.
| Prefix | Provider | Type |
|---|---|---|
if/ | Qoder AI | Free Tier |
qw/ | Qwen Code | Free Tier |
gemini-cli/ | Gemini CLI | Free Tier |
kr/ | Kiro AI | Free Tier |
aq/ | Amazon Q | Free Tier |
cc/ | Claude Code | OAuth |
/ | Antigravity | OAuth |
cx/ | OpenAI Codex | OAuth |
gh/ | GitHub Copilot | OAuth |
gitlab-duo/ | GitLab Duo | OAuth |
cu/ | Cursor IDE | OAuth |
kmc/ | Kimi Coding | OAuth |
kc/ | Kilo Code | OAuth |
cl/ | Cline | OAuth |
agentrouter/ | AgentRouter | API Key |
openrouter/ | OpenRouter | API Key |
qianfan/ | Baidu Qianfan | API Key |
glm/ | GLM Coding | API Key |
glmcn/ | GLM Coding (China) | API Key |
glmt/ | GLM Thinking | API Key |
bcp/ | Alibaba Coding Plan | API Key |
kimi/ | Kimi | API Key |
kmca/ | Kimi Coding (API Key) | API Key |
minimax/ | Minimax Coding | API Key |
minimax-cn/ | Minimax (China) | API Key |
crof/ | CrofAI | API Key |
alicode/ | Alibaba | API Key |
alicode-intl/ | Alibaba Intl | API Key |
openai/ | OpenAI | API Key |
azure/ | Azure OpenAI | API Key |
azure-ai/ | Azure AI Foundry | API Key |
bedrock/ | Amazon Bedrock | API Key |
watsonx/ | IBM watsonx.ai Gateway | API Key |
oci/ | OCI Generative AI | API Key |
sap/ | SAP Generative AI Hub | API Key |
mdl/ | Modal | API Key |
reka/ | Reka | API Key |
nlpc/ | NLP Cloud | API Key |
runway/ | Runway | API Key |
anthropic/ | Anthropic | API Key |
gemini/ | Gemini (Google AI Studio) | API Key |
ds/ | DeepSeek | API Key |
groq/ | Groq | API Key |
bb/ | Blackbox AI | API Key |
xai/ | xAI (Grok) | API Key |
mistral/ | Mistral | API Key |
pplx/ | Perplexity | API Key |
together/ | Together AI | API Key |
fireworks/ | Fireworks AI | API Key |
cerebras/ | Cerebras | API Key |
cohere/ | Cohere | API Key |
nvidia/ | NVIDIA NIM | API Key |
nebius/ | Nebius AI | API Key |
siliconflow/ | SiliconFlow | API Key |
hyp/ | Hyperbolic | API Key |
nb/ | NanoBanana | API Key |
ollamacloud/ | Ollama Cloud | API Key |
hf/ | HuggingFace | API Key |
synthetic/ | Synthetic | API Key |
kg/ | Kilo Gateway | API Key |
vertex/ | Vertex AI | API Key |
vp/ | Vertex AI Partners | API Key |
zai/ | Z.AI | API Key |
opencode-zen/ | OpenCode Zen | API Key |
opencode-go/ | OpenCode Go | API Key |
ali/ | Alibaba Cloud (DashScope) | API Key |
lc/ | LongCat AI | API Key |
pol/ | Pollinations AI | API Key |
pu/ | Puter AI | API Key |
cf/ | Cloudflare Workers AI | API Key |
scw/ | Scaleway AI | API Key |
deepinfra/ | DeepInfra | API Key |
vag/ | Vercel AI Gateway | API Key |
lambda/ | Lambda AI | API Key |
samba/ | SambaNova | API Key |
nscale/ | nScale | API Key |
ovh/ | OVHcloud AI | API Key |
baseten/ | Baseten | API Key |
publicai/ | PublicAI | API Key |
moonshot/ | Moonshot AI | API Key |
meta/ | Meta Llama API | API Key |
v0/ | v0 (Vercel) | API Key |
morph/ | Morph | API Key |
featherless/ | Featherless AI | API Key |
friendli/ | FriendliAI | API Key |
llamagate/ | LlamaGate | API Key |
heroku/ | Heroku AI | API Key |
galadriel/ | Galadriel | API Key |
databricks/ | Databricks | API Key |
datarobot/ | DataRobot | API Key |
clarifai/ | Clarifai | API Key |
snowflake/ | Snowflake Cortex | API Key |
wandb/ | Weights & Biases Inference | API Key |
volcengine/ | Volcengine | API Key |
ai21/ | AI21 Labs | API Key |
gigachat/ | GigaChat (Sber) | API Key |
venice/ | Venice.ai | API Key |
codestral/ | Codestral | API Key |
upstage/ | Upstage | API Key |
maritalk/ | Maritalk | API Key |
mimo/ | Xiaomi MiMo | API Key |
inet/ | Inference.net | API Key |
nanogpt/ | NanoGPT | API Key |
predibase/ | Predibase | API Key |
bytez/ | Bytez | API Key |
aiml/ | AI/ML API | API Key |
novita/ | Novita AI | API Key |
pi/ | PiAPI | API Key |
ggo/ | GoAPI | API Key |
lz/ | LaoZhang AI | API Key |
glhf/ | GLHF Chat | API Key |
cablyai/ | CablyAI | API Key |
thebai/ | TheB.AI | API Key |
fenayai/ | FenayAI | API Key |
empower/ | Empower | API Key |
nous/ | Nous Research | API Key |
petals/ | Petals | API Key |
poe/ | Poe | API Key |
gitlab/ | GitLab Duo PAT | API Key |
chutes/ | Chutes.ai | API Key |
voyage/ | Voyage AI | API Key |
jina/ | Jina AI | API Key |
fal/ | Fal.ai | API Key |
stability/ | Stability AI | API Key |
bfl/ | Black Forest Labs | API Key |
recraft/ | Recraft | API Key |
topaz/ | Topaz | API Key |
Automation endpoints for proxy registry, scope assignments, and legacy proxy migration.
| Method | Path | Notes |
|---|---|---|
GET | /api/providers | List all registered provider connections. |
POST | /api/providers | Create a new provider connection. |
PUT | /api/providers/:id | Update an existing provider connection. |
DELETE | /api/providers/:id | Delete a provider connection. |
POST | /api/providers/:id/test | Test connectivity and authentication for a provider. |
GET | /api/providers/:id/models | List available models for a specific provider. |
GET | /api/settings | Retrieve current application settings. |
PUT | /api/settings | Update application settings. |
GET | /api/settings/payload-rules | Get payload transformation rules. |
PUT | /api/settings/payload-rules | Update payload transformation rules. |
GET | /api/v1/management/proxies | List saved proxy registry items (supports pagination). |
POST | /api/v1/management/proxies | Create a reusable proxy item in the registry. |
GET | /api/v1/management/proxies/health | Get 24h/rolling health metrics per saved proxy from proxy logs. |
PUT | /api/v1/management/proxies/bulk-assign | Assign or clear one proxy across many scope IDs in one request. |
GET | /api/v1/management/proxies/assignments | List proxy assignments by scope, scope_id, or proxy_id. |
PUT | /api/v1/management/proxies/assignments | Assign or clear proxy for global/provider/account/combo scope. |
POST | /api/settings/proxies/migrate | Import legacy proxyConfig maps into registry assignments. |