LiteLLM

LiteLLM is an open-source LLM gateway (proxy server) and Python SDK that lets teams call 100+ model providers through the OpenAI API format. It adds platform features—load-balancing and fallbacks, spend tracking per key/user/team, budgets and RPM/TPM limits, virtual keys, and an admin UI. It integrates with observability stacks (Langfuse, LangSmith, OpenTelemetry, Prometheus) and supports logging to S3/GCS. Enterprise options layer on SSO/JWT auth and audit logs, plus fine-grained guardrails per project.

No discussions yet

Be the first to start a discussion about LiteLLM

Demo Video for LiteLLM

Developer

BerriAI

litellm.ai

BerriAI

𝕏LiteLLM

1 AI Tool

BerriAI is the team behind LiteLLM, an open-source LLM gateway and SDK that unifies access to 100+ providers with cost tracking, securi…read more

BerriAI developer profile

Pricing and Plans

(Open Source)

Open Source

Free

100+ provider integrations
OpenAI-compatible endpoints
Virtual keys, teams, budgets
Rate limits, load balancing, guardrails
Observability integrations (Langfuse, LangSmith, OTEL, Prometheus)

Enterprise (Cloud or Self-Hosted)

Contact for pricing

Everything in OSS
Enterprise support & custom SLAs
JWT/SSO, audit logs, advanced guardrails
Usage-based pricing; contact sales

System Requirements

Operating System

WINDOWS, MACOS, LINUX

Memory (RAM)

2GB minimum (4GB+ recommended for proxy + logs)

Processor

Modern 64-bit CPU (x86_64 or ARM64)

Disk Space

200MB+ for proxy binaries/config; additional for logs and Docker images

AI Capabilities

Unified OpenAI-format API for multiple providers

Provider routing and automatic fallbacks

Streaming responses

Spend tracking and usage attribution

Budgets, rate limiting, and quotas

Guardrails and moderation hooks

Observability and metrics export

Batching, caching, and prompt formatting

← Back to all tools

Stats on LiteLLM

Related Tools

Osaurus

Local Inference

Osaurus is a local-first AI runtime optimized for Apple Silicon that runs open-source models on Mac with privacy and no cloud dependency.

GitHub Copilot CLI

Command Line Assistants

GH Copilot graduated into its own CLI. Not a rename—an upgrade. It’s now an agent that can understand your repo, propose edits, run commands, and even open PRs—with approvals.

Cursor CLI

19d

Command Line Assistants

Terminal-based Cursor Agent you can run interactively or headless to write, review, and modify code from any shell or CI system.

Memex

22d

Vibe Coding

An Everything Builder platform for creating software without coding expertise.

Grep

22d

Code Intelligence

Effortlessly search for code, files, and paths across a million GitHub repositories.

Warp Code

28d

Vibe Coding

Agentic coding inside Warp's terminal: code editor, diff-first review, codebase indexing, and multi-agent control with top LLMs (GPT-5, Claude, Gemini).

Xcode 26

1mo

Development Environments

Apple's macOS IDE with built-in AI coding assistance and native ChatGPT and Claude account support.

DeepWiki

1mo

Documentation

AI documentation you can talk to for any public GitHub repo—architecture diagrams, source-linked pages, semantic search, and Q&A. Free for public repos; private repo support via Devin.

Prompt Engineering Guide

1mo

Prompt Engineering

Open-source MIT-licensed guide by DAIR.AI covering prompting techniques, model-specific tips, examples, and resources for building with LLMs.

mini-SWE-agent

1mo

AI Coding Assistants

100-line Python coding agent with CLI/TUI that fixes GitHub issues and automates repo tasks. Model-agnostic via LiteLLM, sandboxable (Docker/Podman/Bubblewrap), with an Inspector to browse trajectories.

Newsletter

Get the latest AI Dev Tools in your inbox

Curated tools, community insights, and AI news from EveryDev.ai

No spam — unsubscribe anytime

EveryDev.ai

Everywhere

You Scroll.

r/EveryDevAI

@everydev-ai

Threads

@everydev.ai

YouTube

@everydevai

Bluesky

@everydevai.bsky.social

Mastodon

@EveryDevAI

X / Twitter

@everydevai