OmniRoute
A free, open-source AI gateway that unifies 160+ providers behind one endpoint with smart 4-tier auto-fallback, prompt compression saving 15-75% tokens, and multi-platform support.
At a Glance
About OmniRoute
OmniRoute is a free, open-source AI gateway that routes requests across 160+ AI providers through a single OpenAI-compatible endpoint, with zero downtime via smart 4-tier auto-fallback (Subscription → API Key → Cheap → Free). It features a built-in 5-mode prompt compression pipeline that saves 15-75% tokens automatically on every request, a 3-level proxy system for bypassing geographic restrictions, and full MCP Server and A2A Protocol support. The project is written in 100% TypeScript, backed by 4,690+ automated tests, and runs on Web, Desktop (Electron), Android (Termux), and as a PWA.
- Smart 4-Tier Auto-Fallback — Install via
npm install -g omniroute, then configure a combo chain in the dashboard to automatically route Subscription → API Key → Cheap → Free providers with zero downtime. - Prompt Compression (5 Modes) — Enable in Dashboard → Settings → Compression; choose from Off, Lite (~15%), Standard/Caveman (~30%), Aggressive (~50%), or Ultra (~75%) to reduce token usage on every request transparently.
- 160+ Provider Support — Connect OAuth providers (Claude Code, Codex, Gemini CLI, GitHub Copilot, Cursor), API key providers (OpenAI, Anthropic, DeepSeek, Groq, xAI, Mistral, and 120+ more), and 11 genuinely free providers including Kiro, Qoder, Pollinations, and LongCat.
- Format Translation — OmniRoute automatically translates between OpenAI, Claude, Gemini, and Responses API formats so any OpenAI-compatible tool works with any provider.
- MCP Server (29 Tools) — Run
omniroute --mcpfor stdio transport, or connect via SSE/HTTP; supports 10 granular permission scopes and a SQLite-backed audit trail. - A2A Protocol — Full JSON-RPC 2.0 + SSE streaming agent-to-agent protocol with task lifecycle management and an agent card at
/.well-known/agent.json. - 3-Level Proxy System — Configure global, per-provider, or per-API-key proxies in the dashboard; includes a built-in 1proxy free marketplace with 500+ community-shared proxies and TLS fingerprint spoofing.
- 10 Multi-Modal APIs — One endpoint covers chat completions, embeddings, image generation, video, music, audio TTS/STT, reranking, moderation, and web search.
- Full Observability — Unified 4-tab log dashboard (request, proxy, audit, console), p50/p95/p99 telemetry, per-token cost tracking, budget controls, and a built-in LLM evaluation framework.
- Multi-Platform — Runs as an npm global package, Docker image (AMD64 + ARM64), Electron desktop app, Android via Termux, or as a PWA; available in 40+ languages with RTL support.
Community Discussions
Be the first to start a conversation about OmniRoute
Share your experience with OmniRoute, ask questions, or help others learn from your insights.
Pricing
Open Source (Free)
Fully free and open-source under the MIT License. Self-host via npm, Docker, Electron, or Termux. All features included at no cost.
- 160+ AI provider integrations
- Smart 4-tier auto-fallback
- 5-mode prompt compression
- MCP Server (29 tools)
- A2A Protocol
Capabilities
Key Features
- Smart 4-tier auto-fallback (Subscription → API Key → Cheap → Free)
- 5-mode prompt compression pipeline (Off / Lite / Standard / Aggressive / Ultra)
- 160+ AI provider integrations
- Format translation (OpenAI ↔ Claude ↔ Gemini ↔ Responses API)
- MCP Server with 29 tools and 3 transports (stdio/SSE/HTTP)
- A2A Protocol (JSON-RPC 2.0 + SSE streaming)
- 3-level proxy system with TLS fingerprint spoofing
- 1proxy free marketplace (500+ community proxies)
- 10 multi-modal APIs (chat, embeddings, images, video, music, TTS, STT, reranking, moderation, web search)
- OAuth PKCE auto token refresh for 8 providers
- Multi-account round-robin per provider
- Circuit breakers with exponential backoff and anti-thundering herd
- Semantic + signature two-tier cache
- Unified observability dashboard with p50/p95/p99 telemetry
- Built-in LLM evaluation framework with golden set
- Cost analytics and budget controls
- 13 routing/balancing strategies
- Cloudflare / Tailscale / ngrok tunnel support
- Desktop app (Electron), Android (Termux), PWA
- 40+ language UI with RTL support
- 4,690+ automated tests across 517 files
- WebSocket bridge (/v1/ws)
- Request deduplication and idempotency
- OpenAPI 3.0 live spec with Try-It UI
Integrations
Demo Video

