PandaProbe
Open source agent engineering platform providing traces, evals, metrics, and live monitoring to debug and improve AI agents.
At a Glance
For hobbyists getting started.
Engagement
Available On
Listed May 2026
About PandaProbe
PandaProbe is an open-source agent engineering platform built by Chirpz AI that gives developers the observability, evaluation, and monitoring tools needed to ship reliable AI agents with confidence. It supports the full agent development lifecycle — from first run to continuous improvement — with a single instrument() call. The platform is self-hostable under the Apache 2.0 license, with no vendor lock-in, and integrates seamlessly with all major agent frameworks and LLM providers.
- Tracing — Automatically captures every span (chains, agents, LLMs, tools) with one
instrument()call, tracking model types, params, token usage, and key metadata. - Evals & Metrics — Run trace-level and session-level evaluations to measure agent quality and catch regressions before they reach production.
- Live Monitoring — Monitor agents in real time to detect failures, latency spikes, and unexpected behaviors as they happen.
- Framework Integrations — Plug-and-play support for LangGraph, LangChain, CrewAI, Google ADK, Claude Agent SDK, and OpenAI Agents SDK via a Python SDK.
- LLM Provider Support — Works seamlessly with OpenAI, Anthropic, Google Gemini, and more through built-in wrappers.
- Human Annotation — Supports human-in-the-loop annotation for labeling and improving agent evaluation datasets.
- Self-Hosting — Deploy the full platform on your own infrastructure using the open-source repo; all core features and APIs are included at no cost.
- Scalable Cloud Option — Use PandaProbe Cloud for a managed experience with pay-as-you-go scaling beyond plan limits.
- Open by Default — Apache 2.0 licensed core platform built by a team of PhD researchers specializing in uncertainty and robustness in AI agents.
Community Discussions
Be the first to start a conversation about PandaProbe
Share your experience with PandaProbe, ask questions, or help others learn from your insights.
Pricing
Hobby
For hobbyists getting started.
- 100 base trace ingestion / mo
- 100 trace eval runs / mo
- 10 session eval runs / mo
- Human annotation
- 1 seat
Open Source
Self-host all core PandaProbe features for free without any limitations.
- Apache 2.0 license
- All core platform features and APIs
- Scalability of PandaProbe Cloud
- Deployment docs
- Community support
Pro
For developers and small teams.
- Everything in Hobby +
- 5k base traces / mo, then pay-as-you-go
- 5K trace eval runs / mo, then pay-as-you-go
- 100 session eval runs / mo, then pay-as-you-go
- 2 seats
- Email support
Startup
For scaling projects.
- Everything in Pro +
- 50k base traces / mo, then pay-as-you-go
- 50K trace eval runs / mo, then pay-as-you-go
- 1K session eval runs / mo, then pay-as-you-go
- 10 seats
- High rate limits
- Private Slack channel
- Data retention management
Enterprise
For large organizations.
- Everything in Startup +
- Alternative hosting options (hybrid & self-hosted)
- Custom SSO
- Access to dedicated engineering team
- Support SLA
- Team trainings & architectural guidance
- Unlimited seats
- Dedicated support
Capabilities
Key Features
- Agent tracing with one instrument() call
- Trace-level and session-level evaluations
- Live agent monitoring
- Human annotation support
- Pay-as-you-go scaling
- Self-hosting with Apache 2.0 license
- Token usage and metadata tracking
- Custom instrumentation support
- Data retention management
- Custom SSO (Enterprise)
