Agenta
Open-source LLMOps platform for prompt management, evaluation, and observability for developer and product teams.
At a Glance
About Agenta
Agenta is an open-source LLMOps platform that helps developers and product teams build reliable LLM applications. It covers the LLM development lifecycle with tools for prompt management, evaluation, and observability. Agenta provides a web UI, APIs, and SDKs so teams can collaborate, run systematic evaluations, and monitor production behavior.
- Prompt management: Organize, version, and collaborate on prompts so subject-matter experts can edit prompts outside the codebase; use the playground to iterate and run side-by-side comparisons.
- Evaluation: Run automatic evaluations at scale, human annotation workflows, and online evaluation against production traffic to validate changes before and after deployment.
- Observability & tracing: Capture traces and user feedback via API, debug agent execution flows, and track cost and failure cases over time to add to test sets.
- Playground and test sets: Experiment with prompts and test sets inside the UI to reproduce and fix edge cases, then promote validated prompts to production.
- Integrations and SDKs: Integrate with major LLM providers and frameworks via API and SDKs; Agenta is OpenTelemetry-compliant for tracing and supports adding custom or self-hosted models.
To get started, sign up for the web product or self-host the MIT-licensed project, add your model providers and prompts, create test sets, and run evaluations from the UI or via the API/SDK.
Community Discussions
Be the first to start a conversation about Agenta
Share your experience with Agenta, ask questions, or help others learn from your insights.
Pricing
Free
Get started with Agenta at no cost with Unlimited prompts and 2 seats included.
- Unlimited prompts
- 2 seats included
- 20 evaluations per month included
- 5k traces per month included
- 30 days retention period
Pro
Includes additional seats, higher trace allowances, in-app support, and longer retention.
- 3 seats included
- Up to 10 seats
- Unlimited evaluations
- 10k traces per month included then $5 per 10k
- In-app support
- 90 days retention period
Business
Everything from Pro plus enterprise security, compliance, and extended retention.
- Unlimited seats
- Unlimited evaluations
- 1M traces per month included then $5 per 10k
- Role-based access control
- SOC 2 reports
- Private Slack channel
- 365 days retention period
Enterprise
Personalized service, enterprise security, and custom terms for large organizations.
- Everything from Business
- Volume pricing
- Audit logs
- Custom retention periods
- Bring Your Own Cloud
- Dedicated support and self-hosted deployment options
- Security reviews, custom SLA and DPA
Capabilities
Key Features
- Prompt management and versioning
- Playground for prompt experimentation
- Automatic evaluation at scale
- Human evaluation workflows
- Online evaluation for production
- Tracing and observability (OpenTelemetry-compatible)
- Test sets and A/B testing
- Cost tracking and retention controls
- Role-based access control and enterprise features
- Self-hostable MIT-licensed deployment
Integrations
Demo Video

