Galileo

Name: Galileo
Availability: OnlineOnly
Author: Galileo

End-to-end platform for generative AI evaluation, observability, and real-time protection that helps teams test, monitor, and guard production AI applications.

Visit Website

At a Glance

Pricing

Free tier available

Developer tier for experimenting, iterating, and building with Galileo.

Pro: $100/mo

Enterprise: Custom/contact

Engagement

Available On

Web

API

SDK

GalileoSan Francisco, CAEst. 2021$68M raised

Updated Feb 2026

About Galileo

Galileo provides an enterprise-focused GenAI Studio for evaluating, observing, and protecting generative AI systems in development and production. It centralizes evaluation metrics, real-time observability, and runtime guardrails so teams can iterate on prompts, models, and retrieval strategies with measurable feedback. Galileo runs low-latency evaluators (Luna models), hosts inference for live monitoring, and offers SDKs and APIs to integrate logging and traces into existing applications.

Evaluate — Rapidly run and compare combinations of prompts, models, embedding params, and chain nodes to find the right configuration; use the platform UI or SDKs to log experiments and golden test sets.
Observe — Monitor live traffic, traces, and sessions with pre-built and custom metrics to detect drift, latency, and accuracy issues in production systems; connect via SDKs or API to stream logs and traces.
Protect — Intercept requests and responses in real time with guardrail policies and threat detection to block harmful outputs or attacks before they reach users.
Luna evaluation models — Use Galileo’s low-latency evaluator models to run automated judgements (<200ms typical) for production monitoring and inexpensive continuous evaluation.
SDKs & integrations — Install Python or TypeScript SDKs, initialize with an API key, and add the log decorator or GalileoLogger to capture prompts, responses, traces, and spans.

Getting started: sign up for the hosted console, install the Python or TypeScript SDK, set GALILEO_API_KEY, and either use the log decorator or GalileoLogger to begin sending traces and running evaluations.

Community Discussions

Be the first to start a conversation about Galileo

Share your experience with Galileo, ask questions, or help others learn from your insights.

Pricing

FREE

Free

Developer tier for experimenting, iterating, and building with Galileo.

5,000 traces per month
Unlimited users
Unlimited custom evals

Pro

Popular

Plan for teams launching production AI with higher trace quotas and enterprise features.

$100

per month

Everything in Free
50,000 traces per month
Standard RBAC
Advanced analytics & insights
Real-time guardrails
Dedicated support via Slack

Enterprise

Custom plans for large teams that need unlimited scale, security, and premium support.

Custom

contact sales

Unlimited traces
Custom rate limits
Deploy: Hosted, VPC, or on-prem
Enterprise-grade security (RBAC, SSO)
Dedicated CSM and 24/7 support
Low-latency dedicated inference servers

View official pricing

Capabilities

Key Features

AI evaluation workflows for prompts, models, and RAG systems
Real-time observability of traces, sessions, and metrics
Runtime protection and guardrail policies
Prebuilt and custom evaluator metrics (including hallucination detection)
Luna low-latency evaluation models and hosted inference server
Python and TypeScript SDKs and a public API
Auto-tune evaluators with continuous learning (CLHF)

Integrations

OpenAI (wrapper)

NVIDIA NeMo

Python SDK

TypeScript SDK

API Available

View Docs

Demo Video

Watch on YouTube

Back to all tools

Galileo

LLM Evaluations

End-to-end platform for generative AI evaluation, observability, and real-time protection that helps teams test, monitor, and guard production AI applications.

Visit Website

At a Glance

Pricing

Free tier available

Developer tier for experimenting, iterating, and building with Galileo.

Pro: $100/mo

Enterprise: Custom/contact

Engagement

54views

Discussions

Available On

Web

API

SDK

Resources

Website Docs GitHub llms.txt

Topics

LLM Evaluations Observability Platforms Application Security

Alternatives

Confident AI Patronus AI Opik

Developer

GalileoSan Francisco, CAEst. 2021$68M raised

Updated Feb 2026

About Galileo

Evaluate — Rapidly run and compare combinations of prompts, models, embedding params, and chain nodes to find the right configuration; use the platform UI or SDKs to log experiments and golden test sets.
Observe — Monitor live traffic, traces, and sessions with pre-built and custom metrics to detect drift, latency, and accuracy issues in production systems; connect via SDKs or API to stream logs and traces.
Protect — Intercept requests and responses in real time with guardrail policies and threat detection to block harmful outputs or attacks before they reach users.
Luna evaluation models — Use Galileo’s low-latency evaluator models to run automated judgements (<200ms typical) for production monitoring and inexpensive continuous evaluation.
SDKs & integrations — Install Python or TypeScript SDKs, initialize with an API key, and add the log decorator or GalileoLogger to capture prompts, responses, traces, and spans.

Community Discussions

Be the first to start a conversation about Galileo

Share your experience with Galileo, ask questions, or help others learn from your insights.

Pricing

FREE

Free

Developer tier for experimenting, iterating, and building with Galileo.

5,000 traces per month
Unlimited users
Unlimited custom evals

Pro

Popular

Plan for teams launching production AI with higher trace quotas and enterprise features.

$100

per month

Everything in Free
50,000 traces per month
Standard RBAC
Advanced analytics & insights
Real-time guardrails
Dedicated support via Slack

Enterprise

Custom plans for large teams that need unlimited scale, security, and premium support.

Custom

contact sales

Unlimited traces
Custom rate limits
Deploy: Hosted, VPC, or on-prem
Enterprise-grade security (RBAC, SSO)
Dedicated CSM and 24/7 support
Low-latency dedicated inference servers

View official pricing

Capabilities

Key Features

AI evaluation workflows for prompts, models, and RAG systems
Real-time observability of traces, sessions, and metrics
Runtime protection and guardrail policies
Prebuilt and custom evaluator metrics (including hallucination detection)
Luna low-latency evaluation models and hosted inference server
Python and TypeScript SDKs and a public API
Auto-tune evaluators with continuous learning (CLHF)

Integrations

OpenAI (wrapper)

NVIDIA NeMo

Python SDK

TypeScript SDK

API Available

View Docs

Demo Video

Watch on YouTube

Back to all tools