Browser Harness
A self-healing, minimal browser automation harness that gives LLMs complete freedom to complete any browser task, built directly on Chrome DevTools Protocol (CDP).
At a Glance
About Browser Harness
Browser Harness is a lightweight, self-healing browser automation harness (~592 lines of Python) that connects LLMs directly to Chrome via a single WebSocket using the Chrome DevTools Protocol (CDP). It gives AI agents complete freedom to complete any browser task without frameworks, recipes, or rails. When a helper function is missing mid-task, the agent writes it itself — making the harness genuinely self-improving over time. It supports both local browsers and free remote browsers with proxies and captcha solving.
- Self-healing architecture: When the agent encounters a missing tool or helper function, it edits
helpers.pyon the fly and writes the missing capability, then continues the task without interruption. - Direct CDP connection: Built on a single WebSocket to Chrome with no intermediary framework — just
run.py,helpers.py, and a daemon bridge (admin.py+daemon.py). - Domain skills system: Agents generate and save reusable skill files under
domain-skills/for sites like GitHub, LinkedIn, and Amazon, capturing selectors and flows so they don't have to rediscover them. - Free remote browsers: Integrates with cloud.browser-use.com for up to 3 concurrent remote browsers with proxies and captcha solving — no credit card required.
- Minimal footprint: The entire harness is ~592 lines of Python across a handful of files, making it easy to audit, fork, and extend.
- LLM-agnostic setup: Paste the setup prompt into Claude Code or Codex to have the agent install and configure the harness against your real browser automatically.
- Open source (MIT): Freely available on GitHub under the MIT License — PRs, domain skill contributions, and bug fixes are all welcome.
Community Discussions
Be the first to start a conversation about Browser Harness
Share your experience with Browser Harness, ask questions, or help others learn from your insights.
Pricing
Open Source (MIT)
Fully free and open-source under the MIT License. Use, modify, and distribute freely.
- Full source code access
- Self-healing browser harness
- Domain skills system
- Direct CDP WebSocket connection
- Community contributions welcome
Capabilities
Key Features
- Self-healing helper functions written by the agent mid-task
- Direct Chrome DevTools Protocol (CDP) WebSocket connection
- Domain skills system for reusable site-specific knowledge
- Free remote browsers with proxies and captcha solving
- Minimal ~592-line Python codebase
- LLM-agnostic (works with Claude Code, Codex, etc.)
- Agent-generated skill files for GitHub, LinkedIn, Amazon, and more
- No framework, no recipes, no rails
