EveryDev.ai
Subscribe
Home
Tools

2,835+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. Langfuse
    Langfuse icon

    Langfuse

    LLM Evaluations

    Open source LLM engineering platform for observability, prompt management, evaluation, and debugging of AI applications and agents.

    Visit Website

    At a Glance

    Pricing
    Open Source
    Free tier available

    Get started, no credit card required. Great for hobby projects and POCs.

    Core: $29/mo
    Pro: $199/mo
    Teams Add-on: $300/mo
    +1 more plan

    Engagement

    Available On

    macOS
    Linux
    Web
    API
    SDK

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    LLM EvaluationsObservability PlatformsPrompt Management

    Alternatives

    LunaryLangtraceTensorZero
    Developer
    Langfuse, Inc.Berlin, GermanyEst. 2022$4.5M raised

    Updated May 2026

    About Langfuse

    Langfuse is an open source LLM engineering platform built by Langfuse GmbH (now part of ClickHouse) that helps teams develop, monitor, evaluate, and debug AI applications and agents. It is MIT-licensed for core features, self-hostable in minutes, and available as a managed cloud service. The platform is built on OpenTelemetry and supports over 80 integrations across model providers, agent frameworks, and languages.

    What It Is

    Langfuse sits in the LLMOps category, providing the full development lifecycle toolchain for teams building production-grade LLM applications. It combines LLM observability (tracing), prompt management, evaluation pipelines, a playground, and experiment datasets into one integrated platform. Engineers can use it standalone for tracing or adopt the full suite to power a continuous improvement loop from prototype to production.

    Core Platform Capabilities

    • LLM Observability: Hierarchical traces capture every LLM call, tool invocation, retrieval step, and agent action. Traces include session tracking, user tracking, token and cost tracking, and agent graph visualization.
    • Prompt Management: Version-controlled prompts with one-click deployments and rollbacks, server- and client-side caching, composability, and release labels — all without code changes.
    • Evaluation: LLM-as-a-judge, user feedback, manual annotation queues, and custom evaluation pipelines via API/SDK. Evaluations run on production traces or against offline datasets.
    • Experiments & Datasets: Define test cases, run structured experiments, and compare results side-by-side in the UI or via SDK.
    • Playground: Test prompts on real production inputs and compare models side-by-side directly from a trace.
    • Metrics & Dashboards: Monitor cost, latency, and quality with dashboards and automated alerts.

    Integration Breadth and Stack Compatibility

    Langfuse is built on OpenTelemetry, which means it works with any language that supports OTel instrumentation (Python, TypeScript, Go, Java, .NET, Ruby, PHP, Swift). Native SDKs exist for Python and JavaScript/TypeScript. The platform lists 80+ integrations including:

    • Agent frameworks: LangChain, LlamaIndex, Vercel AI SDK, CrewAI, Pydantic AI, Google ADK, OpenAI Agents SDK, Mastra, AutoGen, DSPy, and more
    • Model providers: OpenAI, Anthropic, Amazon Bedrock, Azure OpenAI, Mistral AI, Google Gemini, xAI, vLLM, Groq, Ollama
    • No-code tools: Dify, Langflow, OpenWebUI, n8n
    • Analytics: PostHog, Mixpanel

    Deployment Model

    Langfuse offers two deployment paths: a managed cloud (Langfuse Cloud) with US, EU, and JP data regions, and a fully self-hosted option. Self-hosting is supported via Docker Compose, Kubernetes (Helm), and Terraform templates for AWS, GCP, and Azure. The core platform is MIT-licensed and all product features are available in the self-hosted version. The architecture uses a ClickHouse OLAP database, async ingestion via Redis queue, and S3/blob storage for large payloads — designed to handle billions of observations per month.

    Update: Joining ClickHouse and Recent Activity

    In January 2026, Langfuse joined ClickHouse, the open-source database company, to accelerate development. The latest OSS release as of the source data is v3.174.1 (published May 13, 2026), with the repository showing active daily releases. Recent changelog entries include "Sign in with ClickHouse Cloud" (May 18, 2026), "Introducing Langfuse Academy" (May 14, 2026), and "Self-Service Enterprise SSO Setup" (May 8, 2026). The homepage states the project has over 27,400 GitHub stars and 300+ contributors.

    Enterprise Scale and Security

    The platform is designed for high-volume LLM workloads. According to the Langfuse website, it processes over 10 billion observations per month and has over 50 million SDK installs per month. Security certifications listed on the site include SOC 2 Type II, ISO 27001, GDPR compliance, and HIPAA eligibility. Enterprise deployments support SCIM, audit logs, fine-grained RBAC, enterprise SSO (Okta, AzureAD/EntraID), custom rate limits, and uptime SLAs.

    Langfuse - 1

    Community Discussions

    Be the first to start a conversation about Langfuse

    Share your experience with Langfuse, ask questions, or help others learn from your insights.

    Pricing

    FREE

    Hobby

    Get started, no credit card required. Great for hobby projects and POCs.

    • All platform features (with limits)
    • 50k units / month included
    • 30 days data access
    • 2 users
    • Community support via GitHub

    Core

    For production projects. Longer data access and unlimited users.

    $29
    per month
    • Everything in Hobby
    • 100k units / month included
    • Additional usage at $8/100k units (volume discounts available)
    • 90 days data access
    • Unlimited users
    • In-app support
    • 3 annotation queues
    • Ingestion throughput: 4,000 requests/min
    • 48h response time SLO

    Pro

    Popular

    For scaling projects. Unlimited history, high rate limits, all features.

    $199
    per month
    • Everything in Core
    • 100k units / month included
    • Additional usage at $8/100k units (volume discounts available)
    • 3 years data access
    • Data retention management
    • Unlimited annotation queues
    • High rate limits (20,000 requests/min ingestion)
    • SOC2 & ISO27001 reports
    • BAA available (HIPAA)
    • Prioritized in-app support

    Teams Add-on

    Optional add-on for Pro plan: Enterprise SSO, fine-grained RBAC, dedicated Slack/MS Teams support.

    $300
    per month
    • Enterprise SSO (e.g. Okta)
    • SSO enforcement
    • Fine-grained RBAC
    • Support via Dedicated Slack / MS Teams Channel

    Enterprise

    For large scale teams. Enterprise-grade support and security.

    $2499
    per month
    • Everything in Pro + Teams
    • 100k units / month included
    • Audit Logs
    • SCIM API
    • Custom rate limits
    • Uptime SLA
    • Support SLA
    • Dedicated support engineer
    • Onboarding & architectural guidance
    • Custom volume pricing (with yearly commitment)
    • Billing via AWS Marketplace (yearly commitment)
    • Billing via Invoice
    View official pricing

    Capabilities

    Key Features

    • LLM Observability & Tracing
    • Hierarchical trace views with cost and latency
    • Session and user tracking
    • Agent graph visualization
    • Prompt Management with version control
    • One-click prompt deployments and rollbacks
    • Prompt caching (server and client)
    • LLM Playground
    • LLM-as-a-judge evaluations
    • Human annotation queues
    • Datasets and offline experiments
    • Custom evaluation scores via API/SDK
    • Cost and token tracking
    • Dashboards and metrics
    • OpenTelemetry native
    • 80+ framework and model integrations
    • REST API and Query SDK
    • S3/blob storage export
    • Self-hosting (Docker, Kubernetes, Terraform)
    • SOC 2 Type II and ISO 27001 compliance
    • HIPAA eligible
    • Enterprise SSO (Okta, AzureAD)
    • SCIM API
    • Audit logs
    • Fine-grained RBAC
    • MCP server and CLI for coding agents
    • SKILL.md agent skill

    Integrations

    OpenAI
    Anthropic
    Amazon Bedrock
    Azure OpenAI
    Google Gemini
    Google Vertex AI
    Mistral AI
    xAI / Grok
    vLLM
    Groq
    Ollama
    LangChain
    LlamaIndex
    Vercel AI SDK
    LiteLLM
    CrewAI
    Pydantic AI
    Google ADK
    OpenAI Agents SDK
    AutoGen
    DSPy
    Mastra
    Spring AI
    Haystack
    Dify
    Langflow
    OpenWebUI
    n8n
    PostHog
    Mixpanel
    Cursor
    Claude Code
    Promptfoo
    Ragas
    LiveKit
    Temporal
    Amazon AgentCore
    Strands Agents
    Microsoft Agent Framework
    API Available
    View Docs

    Ratings & Reviews

    No ratings yet

    Be the first to rate Langfuse and help others make informed decisions.

    Developer

    Langfuse, Inc.

    Langfuse is Y Combinator-backed company (W23) founded in 2023 that builds open-source LLM engineering and observability tools. Backed by Lightspeed Venture Partners and La Famiglia, Langfuse helps developers and teams build, monitor, and improve production-grade LLM applications with its comprehensive suite of tools for tracing, prompt management, quality evaluation, and analytics.

    Founded 2022
    Berlin, Germany
    $4.5M raised
    19 employees

    Used by

    Intuit
    Twilio
    Khan Academy
    Read more about Langfuse, Inc.
    WebsiteGitHubX / Twitter
    1 tool in directory

    Similar Tools

    Lunary icon

    Lunary

    Open-source platform to monitor, improve, and secure AI chatbots with observability, prompt management, evaluations, and analytics.

    Langtrace icon

    Langtrace

    Open-source, OpenTelemetry-based observability and evaluations platform for LLM applications, supporting real-time tracing, metrics, and debugging across popular LLMs, frameworks, and vector databases.

    TensorZero icon

    TensorZero

    An open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation in a single self-hosted stack.

    Browse all tools

    Related Topics

    LLM Evaluations

    Platforms and frameworks for evaluating, testing, and benchmarking LLM systems and AI applications. These tools provide evaluators and evaluation models to score AI outputs, measure hallucinations, assess RAG quality, detect failures, and optimize model performance. Features include automated testing with LLM-as-a-judge metrics, component-level evaluation with tracing, regression testing in CI/CD pipelines, custom evaluator creation, dataset curation, and real-time monitoring of production systems. Teams use these solutions to validate prompt effectiveness, compare models side-by-side, ensure answer correctness and relevance, identify bias and toxicity, prevent PII leakage, and continuously improve AI product quality through experiments, benchmarks, and performance analytics.

    89 tools

    Observability Platforms

    Comprehensive platforms that combine metrics, logs, and traces with AI-powered analytics to provide deep insights into complex distributed systems and application behavior.

    95 tools

    Prompt Management

    Tools for organizing, versioning, and managing AI prompts.

    41 tools
    Browse all topics
    Back to all toolsSuggest an edit
    ratings
    discussions
    25views
    1upvote