How Claude Mythos found a 15-year-old bug in Mozilla Firefox | Brian Grinstead
Open original episodeMozilla distinguished engineer Brian Grinstead breaks down the agentic bug-finding pipeline his team built on top of Claude that shipped a record month of Firefox security fixes—covering the full harness architecture: LLM-based file scoring, verifier subagents to kill false positives, and goal-loop retry patterns any developer can replicate with Claude Code or Codex and a single prompt flag.
Why this matters: Demonstrates a concrete, reproducible pattern for deploying LLM agents against real production codebases at scale, with honest analysis of where models vs. harness design each deserve credit.