Defending Code Reference Harness

Name: Defending Code Reference Harness
Availability: OnlineOnly
Author: Anthropic, Inc.

An open-source reference implementation for autonomous vulnerability discovery and remediation using Claude, covering threat modeling, scanning, triage, patching, and a sandboxed autonomous pipeline.

Visit Website

At a Glance

Pricing

Open Source

Free to use, modify, and distribute under the Apache 2.0 license.

Engagement

Available On

Android

iOS

API

SDK

CLI

Anthropic, Inc.San Francisco, CAEst. 2021$65.7B raised

Listed Jun 2026

About Defending Code Reference Harness

The Defending Code Reference Harness is an open-source toolkit published by Anthropic under the Apache 2.0 license, designed to help security teams use Claude for autonomous vulnerability discovery and remediation. It provides both interactive Claude Code skills and a fully autonomous pipeline that walks through recon, finding, verification, reporting, and patching stages. The repository was created in May 2026 and is described by Anthropic as a reference implementation, not a maintained product.

What It Is

The harness is a Python-based reference pipeline that enables security engineers to run Claude-powered agents against source code repositories to find, verify, triage, and patch vulnerabilities. It ships with a set of Claude Code skills (/quickstart, /threat-model, /vuln-scan, /triage, /patch, /customize) for interactive use, plus an autonomous harness/ pipeline configured out of the box for C/C++ memory vulnerabilities using Docker and AddressSanitizer (ASAN). The repository README explicitly states it is "a reference, not a product" and is not accepting contributions.

How the Pipeline Works

The autonomous pipeline runs through seven distinct stages:

Build: Compiles the target into a Docker image with ASAN enabled.
Recon: A lightweight agent reads source code in a network-isolated container and proposes a partition of input-parsing subsystems for parallel exploration.
Find: Multiple agents run in parallel, each in its own isolated container, crafting malformed inputs and running the ASAN binary until a crash reproduces reliably.
Verify: A separate grader agent reproduces each crash in a fresh container using only the proof-of-concept produced by the find agent.
Dedupe: A judge agent compares verified crashes against previously reported bugs to identify new, improved, or duplicate findings.
Report: A report agent writes a structured exploitability analysis per unique bug, covering primitive class, reachability, escalation path, and severity.
Patch: A patch agent proposes a fix, and a grader agent confirms the fix builds, the original proof-of-concept no longer crashes, the test suite passes, and a fresh find agent cannot bypass the fix.

Sandboxing and Security Model

The harness enforces a strict sandboxing model. The interactive skills (/quickstart, /threat-model, /vuln-scan, /triage) are read- and write-only and are safe to run without a sandbox as long as tool uses are reviewed in Claude Code. The autonomous pipeline, however, executes target code and refuses to run outside a gVisor sandbox unless explicitly overridden. A setup script (scripts/setup_sandbox.sh) installs gVisor, builds agent images, and verifies isolation. Each agent runs inside a gVisor container with egress restricted to the Claude API.

Customization and Portability

The /customize skill allows teams to port the pipeline to languages and vulnerability classes beyond C/C++. Porting requires answering three questions for the target stack: what signals a finding (e.g., ASAN crash, exception, DNS callback), what a proof of concept looks like (e.g., crashing input file, HTTP request sequence), and how the target is built and run. The README provides a structured ramp-up path spanning from Day 1 interactive skills through Week 2 autonomous scanning, triage, and patching at scale.

Update: Repository Launch (May 2026)

The repository was created on May 22, 2026, and last pushed on June 2, 2026. It was published alongside the Claude Mythos Preview (Glasswing) announcement and an accompanying blog post on using LLMs to secure source code. Anthropic also references a companion cookbook on the Claude platform for a lightweight SDK-only walkthrough of the same vulnerability detection loop. The README notes that Anthropic offers a hosted managed version called Claude Security for teams that want a productized option rather than the self-hosted reference implementation.

Community Discussions

Be the first to start a conversation about Defending Code Reference Harness

Share your experience with Defending Code Reference Harness, ask questions, or help others learn from your insights.

Pricing

OPEN SOURCE

Open Source

Free to use, modify, and distribute under the Apache 2.0 license.

Interactive Claude Code skills for threat modeling, scanning, triage, and patching
Autonomous 7-stage vulnerability pipeline
gVisor sandboxing support
Customizable for any language or vulnerability class
Companion cookbook and documentation

Capabilities

Key Features

Interactive Claude Code skills for threat modeling, scanning, triage, and patching
Autonomous 7-stage pipeline: recon, find, verify, dedupe, report, patch
C/C++ memory vulnerability detection using ASAN and Docker
gVisor sandboxing with egress restricted to Claude API
Parallel find agents for broad vulnerability surface coverage
Cross-run triage and deduplication via /triage skill
Customizable pipeline for other languages and vulnerability classes via /customize
Candidate patch generation and automated fix verification
Structured exploitability reports per unique bug
Companion Claude Code skills that are read/write-only and safe to run unsandboxed

Integrations

Claude API

Claude Code

Docker

gVisor

ASAN (AddressSanitizer)

Amazon Bedrock

Google Cloud Vertex AI

Azure

API Available

View Docs

Ratings & Reviews

No ratings yet

Be the first to rate Defending Code Reference Harness and help others make informed decisions.

Developer

Anthropic, Inc.

Anthropic is an American artificial intelligence startup and public benefit corporation focused on developing safe and reliable AI systems that are beneficial for humanity. ## Overview Anthropic is a public benefit corporation dedicated to building AI systems that are reliable, interpretable, and steerable. Founded in 2021, the company emphasizes AI alignment and constitutional AI principles to ensure beneficial outcomes for humanity. Known for its Claude family of large language models, Anthropic serves businesses, nonprofits, developers, and civil society groups with emphasis on AI safety and alignment. ## History Anthropic was founded in 2021 by former OpenAI executives and researchers, including siblings Dario Amodei and Daniela Amodei, along with Jack Clark, Sam McCandlish, and Tom Brown. The founders left OpenAI due to differing views on commercialization and safety priorities, aiming to build AI with stronger emphasis on long-term safety. Key milestones in the company's evolution include: - **January 2026**: Secured another $10 billion in funding led by Singapore's GIC and Coatue Management, pushing valuation to $350 billion - **Late 2025**: Expanded internationally, opening offices in Seoul (third in Asia-Pacific) alongside locations in the US, Europe, and other regions - **November 2025**: Released Claude Opus 4.5, acclaimed as the world's best model for coding, agents, and computer use - **October 2025**: Released Claude Haiku 4.5, achieving a 73.3% score on SWE-bench Verified - **September 2025**: Raised $13 billion Series F round at a $183 billion post-money valuation to fuel enterprise expansion, safety research, and global growth - **May 2025**: Launch of Claude Opus 4 and Claude Sonnet 4, setting new standards in coding, advanced reasoning, AI agents, extended thinking with tool use, and enhanced API capabilities - **2022-2023**: Initially bootstrapped, attracted major investments including partnerships with Amazon (up to $4 billion commitment) and Google (up to $2 billion) - **2021**: Founded by former OpenAI executives with focus on AI safety and alignment The company's trajectory has been marked by rapid innovation in AI models while maintaining a commitment to ethical development. Revenue surged from $1 billion to over $5 billion within eight months following the Series F round. ## Products & Services Anthropic's core offerings revolve around its Claude AI family, which are designed for safety, helpfulness, and honesty. The latest releases as of early 2026 include: ### Claude Model Family - **Claude Opus 4.5** (November 2025): The flagship hybrid reasoning model excelling in coding, agents, and computer use. Recognized as the world's best model for complex tasks like advanced coding and agentic workflows, with improved token efficiency. - **Claude Sonnet 4.5** (September 2025): A balanced hybrid reasoning model designed for advanced agentic workflows, supporting up to 64K output tokens. Sets benchmarks in coding, reasoning, and computer use; noted as the most aligned AI available. Includes the Claude Agent SDK for building autonomous AI agents. Demonstrated a 44% reduction in average vulnerability intake time and a 25% accuracy improvement for HackerOne's Hai security agents. - **Claude Haiku 4.5** (October 2025): A lightweight, cost-efficient model optimized for fast and efficient code generation. Excels in coding and real-time tasks, matching state-of-the-art capabilities at lower costs. Delivers efficient code generation comparable in quality to Sonnet 4 but at faster speeds, powering GitHub Copilot. - **Claude Code**: Supports background coding tasks via GitHub Actions, approaching $1 billion in annual recurring revenue (ARR). - **Claude Developer Platform**: Provides API access to Claude models. ### Key Features Services extend to enterprise solutions, including: - API access via Amazon Bedrock, Google Cloud's Vertex AI, and Microsoft Foundry - Custom fine-tuning for specialized tasks - Enterprise integrations for businesses - Constitutional AI for value-aligned responses - Agent Skills for customization - Extensive 200K token context window Key capabilities include integrations into Microsoft 365 Copilot, Microsoft Foundry, Snowflake, and Accenture for scalable AI deployment. ## Leadership Anthropic's leadership team combines expertise in AI research, product development, and global operations. Key members as of January 2026 include: | Name | Title | Background | |------|-------|------------| | Dario Amodei | Co-founder and CEO | Oversees strategic vision and operations. Former VP of Research at OpenAI. | | Daniela Amodei | Co-founder and President | Manages day-to-day operations and policy. Former OpenAI executive. | | Mike Krieger | Chief Product Officer | Leads product strategy. Co-founder of Instagram. | | Jan Leike | Co-lead of Alignment Science | Focuses on AI safety. Former OpenAI safety lead. | | Chris Ciauri | Managing Director of International | Drives global expansion with 25+ years from Google and Salesforce. Appointed September 2025. | | Thiyagu Ramasamy | Public Sector Leader | Supports government AI initiatives. | The team includes researchers, engineers, and policy experts, with no major leadership changes reported in early 2026. ## Business & Revenue Model Anthropic monetizes through API usage priced per million tokens processed, with tiered pricing based on the model. ### Pricing Tiers | Product | Input Price | Output Price | Notes | |---------|-------------|--------------|-------| | Claude Opus 4.5 | $5 per million tokens | $25 per million tokens | Flagship model for complex tasks | | Claude Sonnet 4.5 | $3 per million tokens | $15 per million tokens | Balanced model with 64K output tokens | | Claude Haiku 4.5 | $1 per million tokens | $5 per million tokens | Cost-efficient, fast model | ### Subscription Plans Available plans include: - **Pro**: Individual professional use - **Max**: Enhanced capabilities for power users - **Team**: Collaborative workspace features - **Enterprise**: Custom solutions with dedicated support ### Revenue Performance - **September 2025**: $1 billion revenue - **Early 2026**: Over $5 billion revenue (8-month growth period) - **Current ARR**: Claude Code approaching $1 billion - **2026 Target**: $26 billion annualized revenue ## Target Markets & Customers Anthropic serves enterprises requiring advanced coding and complex agent workflows, developers building AI solutions, and financial services firms leveraging AI for financial analysis. Notable customers and use cases include: - **Cursor**: Uses Claude for coding assistance - **GitHub**: Employs Claude for complex codebase understanding (Claude Haiku powers GitHub Copilot) - **HackerOne**: Achieved 44% reduction in vulnerability intake time and 25% accuracy improvement for Hai security agents - **Enterprise Clients**: Businesses, nonprofits, and civil society groups requiring AI safety and alignment ### Key Industries - Software Development & Engineering - Cybersecurity - Financial Services - Enterprise Technology - Government & Public Sector ## Competitive Advantages & Differentiators Anthropic distinguishes itself through: ### Constitutional AI Employs Constitutional AI training methods to guide model outputs, ensuring safety and alignment with human values. ### Agent Skills & Capabilities Enables customization for specialized tasks with advanced agentic workflows. Claude Sonnet 4.5 is noted as the most aligned AI available. ### Superior Context & Output Claude models feature an extensive 200K token context window, with Sonnet supporting up to 64K output tokens for complex tasks. ### Performance Metrics - **Claude Opus 4.5**: Recognized as the world's best model for coding, agents, and computer use as of November 2025 - **Claude Haiku 4.5**: 73.3% score on SWE-bench Verified, ranking among the world's top coding models - **Claude Sonnet 4.5**: 44% reduction in average vulnerability intake time for HackerOne's Hai security agents - **Performance**: Haiku delivers code generation comparable to Sonnet 4 at faster speeds and lower costs ## Partnerships & Integrations Strategic partnerships include: | Partner | Type | Description | Date | |---------|------|-------------|------| | Accenture | Enterprise Deployment | Multi-year collaboration to transition enterprises from AI pilots to production | December 2025 | | Snowflake | Agentic AI Platform | $200 million partnership to deploy agentic AI globally | December 2025 | | Allianz | Global Partnership | Advance AI initiatives | Date unspecified | | Amazon | Cloud & Investment | Up to $4 billion commitment providing cloud credits and strategic support | 2022-2023 | | Google | Cloud & Investment | Up to $2 billion providing cloud credits and strategic support | 2022-2023 | ### Platform Integrations Claude integrates with: - Amazon Bedrock - Google Cloud's Vertex AI - Microsoft Foundry - Microsoft 365 Copilot - GitHub (powers Copilot via Claude Haiku) - Snowflake - Accenture platforms ## Financial Information ### Funding & Valuation Anthropic has raised a total of $23 billion as of January 7, 2026. Key funding milestones include: | Round | Date | Amount | Valuation | Lead Investors | |-------|------|--------|-----------|----------------| | Early Partnerships | 2022-2023 | ~$6 billion | N/A | Amazon ($4B commitment), Google ($2B commitment) | | Series F | September 2025 | $13 billion | $183 billion | N/A | | Growth Round | January 2026 | $10 billion | $350 billion | GIC, Coatue Management | ### Major Investors - GIC (Singapore sovereign wealth fund) - Coatue Management - Amazon - Google - Microsoft - Nvidia ### Financial Projections - **2026**: $26 billion revenue target (annualized) - **2027**: Projecting profitability (three years ahead of competitors like OpenAI) - **2028**: Up to $70 billion in revenue with $17 billion in cash flow; reaching breakeven ## Future Plans Anthropic is positioned for explosive growth, projecting profitability by 2027—three years ahead of competitors like OpenAI. ### Strategic Initiatives - **Global Expansion**: Further international growth with new offices and partnerships to support AI adoption in regions like Korea - **Frontier AI Development**: Plans to advance frontier AI, potentially creating systems by summer 2026 that make users feel like they're in a "parallel world" due to rapid economic acceleration - **Enterprise Focus**: Emphasis on enterprise AI, agentic capabilities, and collaborations like those with Accenture for production-scale deployments - **AI Infrastructure**: Scaling AI infrastructure and safety research ### Expansion Plans Future initiatives include: - International office openings beyond current US, Europe, and Asia-Pacific locations - Enhanced agentic AI capabilities and autonomous systems - Production-scale enterprise deployments - Continued AI safety and alignment research ### Vision & Goals Co-founder Dario Amodei has predicted the emergence of one-person billion-dollar companies enabled by AI by 2026. The company aims to maintain its commitment to AI safety while scaling infrastructure and research capabilities. ## Market Position & Industry Context ### Market Segment Anthropic operates within the AI research and development sector, with specialized focus on AI safety, alignment, and advanced reasoning capabilities. ### Market Size & Growth The company targets $26 billion in revenue by the end of 2026, reflecting rapid growth in AI adoption across industries. This positions Anthropic as one of the most valuable AI startups. ### Competitive Landscape Anthropic's product suite competes with other advanced AI models and platforms including OpenAI, Google DeepMind, and others. Claude Haiku 4.5 matches Sonnet 4's performance on coding, computer use, and agent tasks, positioning Anthropic strongly in the coding and AI agent markets. ### Challenges & Opportunities Agent models face challenges managing multiple context windows. Claude Sonnet 4.5 and Haiku 4.5 exhibited awareness of evaluation scenarios during testing, highlighting ongoing complexities in AI behavior. The broader AI landscape poses unprecedented risks requiring careful management. However, the rapid adoption and revenue growth indicate strong market demand and opportunity. ## Recent News & Updates - **January 2026**: Secured $10 billion in funding led by GIC and Coatue Management, pushing valuation to $350 billion - **December 9, 2025**: Accenture and Anthropic announced a multi-year partnership to accelerate enterprise AI adoption - **December 3, 2025**: Snowflake and Anthropic unveiled a $200 million partnership to bring agentic AI to global enterprises - **November 24, 2025**: Release of Claude Opus 4.5, acclaimed as the best model globally for coding, agents, and computer use - **October 2025**: Claude Haiku 4.5 achieved a 73.3% score on SWE-bench Verified, ranking it among the world's top coding models - **September 2, 2025**: Completion of a $13 billion Series F funding round at a $183 billion valuation - **May 22, 2025**: Launch of Claude Opus 4 and Claude Sonnet 4, setting new standards in coding, advanced reasoning, AI agents, extended thinking with tool use, and enhanced API capabilities ## User Growth & Adoption - **January 2024**: 2.9 million users - **Early 2025**: 18.9 million users - **Growth rate**: Explosive growth reflecting strong market traction and adoption ## Additional Information ### Corporate Structure Anthropic is structured as a public benefit corporation, prioritizing societal impact alongside financial returns. This legal structure reflects the company's commitment to AI safety and beneficial outcomes for humanity. ### Public Offerings Anthropic is preparing for an initial public offering (IPO) as early as the first half of 2026, potentially one of the largest in history. The company has hired law firm Wilson Sonsini to handle preparations, with expected valuations exceeding $300 billion and annualized revenue nearing $26 billion. As of January 2026, no official filing has occurred, but investor enthusiasm is high amid the AI sector's momentum. ### Industry Influence The company plays a significant role in shaping US AI policy through CEO Dario Amodei's discussions on innovation and leadership. Anthropic's emphasis on AI safety and alignment has influenced industry conversations around responsible AI development. ### Recognition & Awards - Claude Opus 4.5 recognized as the world's best model for coding, agents, and computer use (November 2025) - Claude Haiku 4.5 ranked among world's top coding models with 73.3% SWE-bench Verified score - Claude Sonnet 4.5 noted as the most aligned AI available --- *Last Updated: January 11, 2026*

Founded 2021

San Francisco, CA

$65.7B raised

Used by

U.S. Department of Defense

U.S. intelligence and defense agencies

U.S. national security agencies

Iceland Ministry of Education and…

+23 more

Similar Tools

SkillShield

Security-scored AI skills directory that scans, scores, and verifies public GitHub and GitLab repositories containing SKILL.md files.

AISLE

Autonomous AI-powered cybersecurity platform that finds, fixes, and verifies vulnerabilities at superhuman speed and scale.

Gecko Security

AI-powered security engineer that detects and fixes business logic and multi-step vulnerabilities with low false positives.

Browse all tools

What It Is

How the Pipeline Works

The autonomous pipeline runs through seven distinct stages:

Build: Compiles the target into a Docker image with ASAN enabled.
Recon: A lightweight agent reads source code in a network-isolated container and proposes a partition of input-parsing subsystems for parallel exploration.
Find: Multiple agents run in parallel, each in its own isolated container, crafting malformed inputs and running the ASAN binary until a crash reproduces reliably.
Verify: A separate grader agent reproduces each crash in a fresh container using only the proof-of-concept produced by the find agent.
Dedupe: A judge agent compares verified crashes against previously reported bugs to identify new, improved, or duplicate findings.
Report: A report agent writes a structured exploitability analysis per unique bug, covering primitive class, reachability, escalation path, and severity.
Patch: A patch agent proposes a fix, and a grader agent confirms the fix builds, the original proof-of-concept no longer crashes, the test suite passes, and a fresh find agent cannot bypass the fix.

Defending Code Reference Harness