Datadog
Datadog is an AI-powered observability and security platform that provides unified monitoring for infrastructure, applications, logs, and security across any stack at any scale.
At a Glance
About Datadog
Datadog is a cloud-scale monitoring and security platform that delivers end-to-end visibility into infrastructure, applications, logs, and security from a single unified interface. It supports thousands of integrations and provides AI-driven insights, anomaly detection, and automated alerting to help engineering teams resolve issues faster. The platform spans observability, digital experience monitoring, software delivery, service management, and AI observability — all tightly integrated for full-stack correlation.
- Infrastructure Monitoring — Sign up for a free trial and install the Datadog Agent on your hosts to start collecting metrics, events, and service checks within minutes.
- Application Performance Monitoring (APM) — Instrument your services with Datadog's tracing libraries to get distributed traces, service dependency maps, and RED metrics with 15-month retention.
- Log Management — Route logs to Datadog via the Agent or integrations; use Logging without Limits™ to ingest, process, and archive logs with flexible retention tiers.
- Cloud Security — Enable Cloud Security Posture Management (CSPM), Workload Protection, and Cloud SIEM to detect misconfigurations, threats, and vulnerabilities in real time.
- LLM Observability — Instrument LLM applications with the Python or Node SDK to trace every LLM call, run offline experiments, and monitor token usage, latency, and cost in production.
- Synthetic Monitoring & Real User Monitoring (RUM) — Create browser and API tests to proactively detect issues, and capture real user sessions with Session Replay for frontend performance analysis.
- Bits AI Agents — Use the built-in AI SRE and Security Analyst agents to automatically investigate alerts, identify root causes, and surface remediation steps in natural language.
- Workflow Automation & App Builder — Build no-code automation workflows and internal apps that trigger actions across your stack directly from Datadog dashboards.
- 1,000+ Integrations — Connect AWS, Azure, GCP, Kubernetes, databases, CI/CD tools, and more via out-of-the-box integrations and OpenTelemetry support.
- Dashboards & Alerts — Create custom dashboards combining metrics, logs, traces, and RUM data; set up monitors with machine learning-based anomaly and forecast alerting.
Community Discussions
Be the first to start a conversation about Datadog
Share your experience with Datadog, ask questions, or help others learn from your insights.
Pricing
Infrastructure Free
Core collection and visualization features for up to 5 hosts with 1-day metric retention.
- 1-day metric retention
- Up to 5 hosts
- Out-of-the-Box Dashboards
- 1,000+ Integrations
- Host and Container Maps
Infrastructure Pro
Centralize monitoring of systems, services, and serverless functions with 15-month metric retention.
- 1,000+ integrations
- Out-of-the-box dashboards
- 15-month metric retention
- 100 custom metrics per host allotted
- 5 containers per host allotted
- Single Sign-On with SAML
- Outlier Detection
Infrastructure Enterprise
Advanced features and administrative controls including ML-based alerts and Live Processes.
- Machine learning-based alerts
- Live Processes
- Governance Console
- 200 custom metrics per host allotted
- 10 containers per host allotted
- Anomaly Detection
- Forecast Monitoring
- Watchdog automated insights
- SCIM
- IP Allowlist
APM
Resolve issues faster with end-to-end distributed traces and service health metrics.
- End-to-end distributed tracing
- Service dependency visualizations
- 15-minute live trace search
- 15-day historical search
- RED metrics with 15-month retention
- Universal Service Monitoring
- Dynamic Instrumentation
APM Pro
Everything in APM plus Data Streams Monitoring for streaming data pipeline visibility.
- All APM features
- Data Streams Monitoring
- Automatic dependency mapping of queues
- End-to-end pipeline latency metrics
- Consumer lag metrics
- Faulty queue detection
APM Enterprise
Everything in APM Pro plus Continuous Profiler for code-level performance optimization.
- All APM Pro features
- Continuous Profiler
- Code-level tracing
- CPU and memory code profiles
- Code performance comparisons across versions
- Automatic code analysis
LLM Observability
Monitor, evaluate, and secure LLM applications with end-to-end tracing and experiments.
- End-to-end LLM tracing
- Offline experiments
- Online evaluations
- Token usage and cost monitoring
- Sensitive Data Scanner included (1GB per 10K requests)
- Playground for prompt testing
- Dataset versioning
Log Management - Ingest
Ingest, process, enrich, live tail, and archive all your logs.
- Out-of-the-box parsing for 200+ log sources
- Enrich and tag logs for RBAC
- Generate log-based metrics
- Self-hosted archives
- Dynamic routing to retention tiers
Bits AI SRE Investigations
Resolve incidents faster with autonomous alert investigations. Billed annually per 20 investigations.
- Automatic alert investigations with zero setup
- Root causes delivered in minutes
- Chat-based explanations in natural language
- Enterprise-grade RBAC and data controls
- Slack, Jira, GitHub, ServiceNow integrations
Code Coverage
Track, enforce, and improve test coverage across your entire codebase.
- Unified visibility into test coverage across repositories
- Automated PR quality gates
- Line-level annotations for untested code
- Coverage threshold enforcement
- Bits AI Dev Agent test suggestions
Capabilities
Key Features
- Infrastructure Monitoring
- Application Performance Monitoring (APM)
- Log Management
- Cloud Security Posture Management (CSPM)
- Workload Protection
- Cloud SIEM
- Real User Monitoring (RUM)
- Synthetic Monitoring
- LLM Observability
- Database Monitoring
- Network Monitoring
- Serverless Monitoring
- Continuous Profiler
- Error Tracking
- Incident Response
- Workflow Automation
- Bits AI SRE Investigations
- Dashboards & Alerts
- 1000+ Integrations
- OpenTelemetry Support
- Session Replay
- Feature Flags
- CI Visibility
- Code Coverage
- Sensitive Data Scanner
