EveryDev.ai
Sign inSubscribe
Home
Tools

2,723+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. Canary
    Canary icon

    Canary

    Automated Testing

    A QA harness built for Claude Code that reads code diffs, identifies affected UI flows, and tests them in real browser instances with full session recordings.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under the MIT license. Install via npm and use without restrictions.

    Engagement

    Available On

    CLI
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Automated TestingBrowser AutomationAgent Harness

    Alternatives

    Glancemochi.jsExpect CLI
    Developer
    wizenheimerwizenheimer builds Canary, an open-source QA harness designe…

    Listed Jun 2026

    About Canary

    Canary is an open-source QA harness purpose-built for coding agents like Claude Code, Cursor, and Codex. It reads code diffs, identifies affected UI flows, and drives real browser instances using a QuickJS WASM sandbox that exposes the full Playwright API. The project is MIT-licensed and hosted on GitHub under the wizenheimer/canary repository.

    What It Is

    Canary sits at the intersection of browser automation and AI coding agents. Rather than forcing a choice between opaque agent runs you can't reproduce and raw Playwright scripts you have to write and maintain by hand, Canary does both: the agent performs the QA and hands back a reproducible Playwright script. Every session captures screen recordings, console logs, network requests, HAR files, and Playwright traces in a self-contained report.html that requires no server or build step to open.

    How the Agent Workflow Works

    The core loop is straightforward: describe a UI flow in plain language, and the agent drives a real browser, then returns both a human-readable report and the exact Playwright script behind it. In Claude Code, Canary registers as a first-class plugin with slash commands (/canary:verify, /canary:session, /canary:run, /canary:review) and subagents. Cursor and Codex integrations are also available via their respective plugin marketplaces, all pointing at the same shared skills/, agents/, and commands/ directories.

    Architecture: Three Tools, One Runtime

    Canary ships as a pnpm + Turborepo monorepo with three user-facing tools sharing a single background daemon:

    • @usecanary/cli (canary) — the main session orchestrator; records capture-enabled QA sessions and renders reports
    • @usecanary/browser (canary-browser) — a lightweight engine for quick, one-off browser automation with no recording overhead
    • @usecanary/ui (canary-viewer) — a local Astro-based viewer for browsing, searching, and replaying recorded sessions

    The daemon runs Playwright and a QuickJS WASM sandbox. Scripts execute in a sandboxed environment with no arbitrary host access — no Node.js module system, no direct filesystem or network access from script context, and enforced memory and CPU limits.

    What Gets Captured

    Every Canary session records a comprehensive evidence trail by default:

    • Video replay with a per-step filmstrip and scrubbing
    • Playwright trace (trace.zip) decodable with npx playwright show-trace
    • Network HAR with per-request headers, payloads, and response inspection
    • Console log filterable by level (errors, warnings, info, logs) with source URLs
    • Reproducible Playwright script — the exact calls (goto, waitForSelector, evaluate, screenshot) with params and timing
    • Self-contained report.html — one file, no server, committable and shareable

    Individual capture streams can be disabled with --no-trace, --no-video, --no-har, or --no-console.

    Setup Path

    Installation requires Node 20+ and pnpm. The quickest path is:

    npm i -g @usecanary/cli @usecanary/ui
    canary install   # one-time: downloads Chromium + runtime (~150 MB) into ~/.canary
    

    A guided wizard (npm create canary@latest) handles the full setup interactively. All commands also run one-off via npx without a global install. Agent plugin installation uses each agent's own marketplace mechanism — Claude Code via /plugin marketplace add wizenheimer/canary, Cursor via its Marketplace UI, and Codex via codex marketplace add wizenheimer/canary.

    Current Status

    The repository was created in June 2026 and had 344 stars and 19 forks as of mid-June 2026, with active development reflected in recent pushes. The project is MIT-licensed, with portions derived from MIT-licensed work by Sawyer Hood. The license file in the repository is listed as "NOASSERTION" in GitHub metadata, though the LICENSE file itself is a standard MIT license text.

    Canary - 1

    Community Discussions

    Be the first to start a conversation about Canary

    Share your experience with Canary, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source under the MIT license. Install via npm and use without restrictions.

    • Full CLI session orchestration
    • Browser automation engine
    • Local session viewer
    • Claude Code, Cursor, and Codex plugin integrations
    • Playwright trace, video, HAR, and console capture

    Capabilities

    Key Features

    • Reads code diffs and identifies affected UI flows
    • Drives real browser instances via Playwright
    • QuickJS WASM sandbox with full Playwright Page API
    • Full session recordings with video replay
    • Playwright trace capture and decoding
    • Network HAR capture with per-request inspection
    • Console log capture filterable by level
    • Reproducible Playwright scripts generated from every run
    • Self-contained report.html with no server required
    • Claude Code plugin with slash commands and subagents
    • Cursor and Codex plugin integrations
    • Background daemon with automatic lifecycle management
    • One-off browser automation via canary-browser
    • Local session viewer via canary-viewer
    • Sandboxed script execution with memory and CPU limits
    • CI-ready script replay with zero inference cost
    • Attach to existing Chrome via remote debugging port

    Integrations

    Claude Code
    Cursor
    Codex
    Playwright
    Chromium
    QuickJS WASM
    pnpm
    Turborepo
    Biome (Ultracite)
    pino (structured logging)
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Canary and help others make informed decisions.

    Developer

    wizenheimer

    wizenheimer builds Canary, an open-source QA harness designed for AI coding agents like Claude Code, Cursor, and Codex. The project focuses on making agent-driven browser automation reproducible by combining real Playwright session capture with reusable script generation. Canary is developed as a pnpm + Turborepo monorepo and distributed via npm under the `@usecanary` scope.

    Read more about wizenheimer
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Glance icon

    Glance

    An open-source MCP server that gives AI agents real browser control — navigate, click, screenshot, and run tests with Claude seeing your actual app.

    mochi.js icon

    mochi.js

    A Bun-native, raw-CDP browser automation framework that generates relationally-coherent fingerprints to evade detection probes, with behavioral synthesis and Chromium-native fetch.

    Expect CLI icon

    Expect CLI

    A CLI tool that scans code changes, generates AI-powered test plans, and runs them against a live browser with video recording of every bug found.

    Browse all tools

    Related Topics

    Automated Testing

    AI-powered platforms that automate end-to-end testing processes with intelligent test case generation, execution, and reporting for faster, more reliable software delivery.

    94 tools

    Browser Automation

    AI-powered agents that autonomously navigate and interact with web applications to automate repetitive tasks, extract data, fill forms, and perform web-based workflows using intelligent understanding of page structure and content.

    88 tools

    Agent Harness

    Infrastructure, orchestrators, and task runners that wrap around LLM coding agents — covering session management, context delivery, worktree isolation, architecture enforcement, and issue-to-PR pipelines.

    95 tools
    Browse all topics
    Back to all toolsSuggest an edit
    Discussions