Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,386+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1556
    • Coding1160
    • Infrastructure524
    • Marketing440
    • Design415
    • Projects378
    • Research350
    • Analytics327
    • Testing214
    • MCP207
    • Data201
    • Security186
    • Integration167
    • Learning154
    • Communication144
    • Prompts138
    • Extensions133
    • Commerce123
    • Voice122
    • DevOps97
    • Web74
    • Finance21
    1. Home
    2. Tools
    3. Respan
    Respan icon

    Respan

    Observability Platforms

    AI observability and evaluation platform for agents that traces, evaluates, and optimizes LLM behavior in production through a unified gateway and monitoring dashboard.

    Visit Website

    At a Glance

    Pricing
    Free tier available

    Free tier for getting started with the full platform.

    Team: $198/mo
    Enterprise: Custom/contact

    Engagement

    Available On

    Web
    API
    SDK
    CLI

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Observability PlatformsLLM EvaluationsPrompt Management

    Alternatives

    HumanloopHoneyHiveLangfuse
    Developer
    Keywords AI, Inc.Keywords AI, Inc. builds Respan, an LLMOps platform for AI a…

    Listed May 2026

    About Respan

    Respan (formerly Keywords AI) is an LLMOps platform built for teams shipping AI agents in production. Founded by Andy Li, Raymond Huang, and Hendrix Liu — who met at the University of Illinois — the company joined Y Combinator's Winter 2024 batch and has since rebuilt around a core insight: developers need deep visibility into how agents behave, not just routing. The platform covers the full loop from tracing and evaluation to prompt optimization, deployment, and monitoring.

    What It Is

    Respan is an AI observability and evaluation platform designed specifically for agent workflows. It captures every prompt, tool call, and model response in production, surfaces quality and performance signals automatically, and gives teams the controls to iterate without losing track of what changed. The platform positions itself as an alternative to tools like LangSmith, Langfuse, and Braintrust, with a unified system that combines tracing, evals, prompt management, and an AI gateway in one product.

    How the Five-Stage Workflow Fits Together

    Respan organizes its product around five connected stages:

    • Trace — End-to-end execution paths capture every step from input to output, with search, filter, and sort by content, latency, cost, quality, tags, and custom metadata. Production traces can be replayed in a playground to debug failures in full context.
    • Evaluate — Evaluation workflows combine human review, code checks, and LLM judges in a single flow. Teams define metrics first, then treat each judge as a function inside one evaluation system.
    • Optimize — Prompt, tool, model, and workflow changes are versioned so teams always know what changed and when. New versions are tested against prior baselines using the same production data.
    • Deploy — A single AI gateway routes across 500+ models with version control, rollout logic, and provider abstraction. Prompt and workflow versions can be promoted to production directly from the UI.
    • Monitor — Custom dashboards with 80+ graph types track quality, latency, cost, and product-specific signals. Alerts fire via Slack, email, or text when behavior drifts; automations can trigger dataset builds or follow-up evaluations from production signals.

    Integration Breadth

    Respan supports a wide range of frameworks and providers out of the box:

    • Frameworks: LangChain, LlamaIndex, Vercel AI SDK, Mastra, Haystack, BAML, Instructor, Agno, RubyLLM, OpenAI Agents SDK
    • Providers: OpenAI, Anthropic, Google Gemini, Azure OpenAI, AWS Bedrock, Groq, Fireworks, Together AI, Perplexity, OpenRouter, Mistral, DeepSeek, Cohere, xAI, and more
    • Ecosystem tools: Mem0, Cognee, AssemblyAI, Linkup, PostHog, Zapier, Cursor, Slack

    Python and JavaScript/TypeScript SDKs are available, and the platform supports OpenTelemetry for standardized instrumentation.

    Compliance and Security Posture

    Respan holds ISO 27001 certification and meets SOC 2, GDPR, and HIPAA requirements. A Business Associate Agreement is available for healthcare organizations. The platform offers cloud deployment by default, with self-hosted options available at the enterprise tier. PII masking, conditional log retention, and data retention management are included in higher tiers.

    Background: From Keywords AI to Respan

    The company announced its rebrand from Keywords AI to Respan in February 2026, describing the shift as a move from a routing-focused API toward a proactive observability platform built for the agent era. The about page states that the company started with an API to dynamically route LLM calls, then rebuilt around production observability after early customer feedback. The homepage claims the platform has processed 80 trillion+ tokens, attributing this figure to vendor-published marketing copy. The company is a 10-person team operating out of Alameda, California, and is backed by Gradient, Y Combinator, Hat-Trick Capital, and a range of angel investors from companies including Lovable, Retell AI, and Mem0.

    Respan - 1

    Community Discussions

    Be the first to start a conversation about Respan

    Share your experience with Respan, ask questions, or help others learn from your insights.

    Pricing

    FREE

    Pro

    Free tier for getting started with the full platform.

    • Full platform access
    • 100k logs
    • 1k scores
    • 5 datasets
    • 2 evaluators

    Team

    Popular

    For startups and growing teams. Billed yearly.

    $198/mo
    billed annually
    $239/mo monthly
    • Everything in Pro
    • Unlimited datasets
    • Unlimited evaluators
    • Unlimited prompts
    • 10k scores included
    • 30-day log retention
    • 8,400 requests/min proxy throughput
    • 5 members included
    • Private Slack channel
    • SOC 2 report
    • Email support (8h SLA)
    • 99.9% uptime SLA

    Enterprise

    For large organizations with custom needs.

    Custom
    contact sales
    • Everything in Team
    • Custom log and score packages
    • Volume discounts
    • Custom SLAs
    • Dedicated support engineer
    • HIPAA BAA
    • Self-hosted deployment option
    • SAML SSO
    • Custom log retention
    • 99.99% uptime SLA
    • Advanced admin roles

    HIPAA Compliance

    Add-on

    HIPAA compliance add-on for Team plan customers.

    $249
    per month
    Part of respan
    • HIPAA BAA
    • HIPAA compliance
    View official pricing

    Capabilities

    Key Features

    • End-to-end agent tracing
    • LLM evaluation workflows (human, code, and LLM judges)
    • Prompt versioning and management
    • AI gateway with 500+ model support
    • Custom monitoring dashboards (80+ graph types)
    • Production trace replay in playground
    • Dataset creation from production traces
    • Synthetic test case generation
    • Prompt and workflow deployment from UI
    • Model routing and load balancing
    • Request caching and auto retries
    • Spending and rate limits
    • Slack, email, and text alerts
    • OpenTelemetry support
    • Multi-modality support (image, voice)
    • PII masking
    • PostHog integration
    • Batch export (JSONL, CSV)
    • Scheduled webhooks
    • SSO (Google, SAML for enterprise)

    Integrations

    LangChain
    LlamaIndex
    Vercel AI SDK
    Mastra
    Haystack
    BAML
    Instructor
    Agno
    RubyLLM
    OpenAI Agents SDK
    OpenAI
    Anthropic
    Google Gemini
    Azure OpenAI
    AWS Bedrock
    Groq
    Fireworks
    Together AI
    Perplexity
    OpenRouter
    Mistral
    DeepSeek
    Cohere
    xAI
    Mem0
    Cognee
    AssemblyAI
    Linkup
    PostHog
    Zapier
    Cursor
    Slack
    Replicate
    Baseten
    AI21 Labs
    Nebius AI
    Novita AI
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Respan and help others make informed decisions.

    Developer

    Keywords AI, Inc.

    Keywords AI, Inc. builds Respan, an LLMOps platform for AI agent observability, evaluation, and deployment. Founded by Andy Li, Raymond Huang, and Hendrix Liu — engineering students who met at the University of Illinois — the company joined Y Combinator's Winter 2024 batch and is backed by Gradient and Y Combinator. The team ships weekly from a hacker house in Alameda, California, and focuses on giving AI teams the tracing, evaluation, and prompt management infrastructure needed to operate agents reliably in production.

    Read more about Keywords AI, Inc.
    WebsiteGitHubLinkedInX / Twitter
    1 tool in directory

    Similar Tools

    Humanloop icon

    Humanloop

    An LLM evaluation and prompt management platform for enterprises that helps teams develop, evaluate, and ship trustworthy AI applications — now being acquired by Anthropic.

    HoneyHive icon

    HoneyHive

    AI observability and evaluation platform to monitor, evaluate, and govern AI agents and applications across any model, framework, or agent runtime.

    Langfuse icon

    Langfuse

    Open source LLM engineering platform for observability, prompt management, evaluation, and debugging of AI applications and agents.

    Browse all tools

    Related Topics

    Observability Platforms

    Comprehensive platforms that combine metrics, logs, and traces with AI-powered analytics to provide deep insights into complex distributed systems and application behavior.

    77 tools

    LLM Evaluations

    Platforms and frameworks for evaluating, testing, and benchmarking LLM systems and AI applications. These tools provide evaluators and evaluation models to score AI outputs, measure hallucinations, assess RAG quality, detect failures, and optimize model performance. Features include automated testing with LLM-as-a-judge metrics, component-level evaluation with tracing, regression testing in CI/CD pipelines, custom evaluator creation, dataset curation, and real-time monitoring of production systems. Teams use these solutions to validate prompt effectiveness, compare models side-by-side, ensure answer correctness and relevance, identify bias and toxicity, prevent PII leakage, and continuously improve AI product quality through experiments, benchmarks, and performance analytics.

    74 tools

    Prompt Management

    Tools for organizing, versioning, and managing AI prompts.

    35 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Discussions