Babysitter

Name: Babysitter
Availability: OnlineOnly
Author: a5c.ai

An open-source orchestration layer that wraps AI coding agents in quality gates, convergent loops, and deterministic workflows to ensure reliable, verified task completion.

Visit Website

At a Glance

Pricing

Open Source

Fully free and open-source under the MIT License. Self-hosted, no telemetry, no licensing fees.

Engagement

Available On

Web

API

VS Code

SDK

CLI

a5c.aiTel Aviv, IsraelEst. 2024

Listed Jun 2026

About Babysitter

Babysitter is an open-source orchestration layer built by a5c.ai that wraps AI coding agents in quality gates, convergent loops, and an immutable event-sourced journal. Licensed under MIT and available on GitHub, it enforces deterministic, hallucination-free self-orchestration so that "done" means gates passed — not just agent confidence. The project was created in January 2026 and has accumulated over 1,100 GitHub stars.

What It Is

Babysitter sits between a developer and their AI coding agent, acting as a process enforcement layer rather than a simple prompt wrapper. Workflows are defined as JavaScript code; the agent can only execute what the process permits. Every task, gate, and decision is recorded in an immutable journal stored in .a5c/runs/, enabling deterministic replay and resumption from any point. The homepage describes it as "CI/CD for AI agents" — your agent codes, Babysitter verifies.

How the Enforcement Architecture Works

The core mechanism is a mandatory stop-and-check loop. After every step, the agent is forced to halt; the process code then decides what is permitted next. Quality gates are real code logic — tests, lints, validators — not configuration suggestions. If a gate fails, the agent cannot progress until it passes. Key architectural properties include:

Process as Code: Workflows are JavaScript functions, not YAML config
Mandatory Stop Hook: Claude (or any harness) cannot "keep running" past a gate
Event-Sourced Journal: All state is persisted, enabling resume from any checkpoint
Cryptographic Completion Proof: A secret emits only when all gates have passed
4-Layer Token Compression: Built-in subsystem the project claims reduces context window usage by 50–67% while maintaining 99% fact retention

Supported Harnesses and Platforms

Babysitter ships plugins for multiple AI coding agents. Claude Code is the recommended and natively supported harness. Additional harness plugins — some labeled experimental — cover Codex CLI, Cursor IDE, Gemini CLI, GitHub Copilot, Pi, Oh-My-Pi, and OpenCode. An internal harness (--harness internal) runs processes programmatically without any external AI agent, making it suitable for CI/CD pipelines and headless automation. The SDK is published to npm as @a5c-ai/babysitter-sdk.

The Process Library and Plugin System

The project ships with what it describes as 2,000+ built-in processes — executable blueprints covering development (TDD, refactoring, code review), security (pentesting, compliance audits), research (scientific discovery pipelines), and operations (CI/CD, incident response). Community methodologies such as BMAD, CC10X, and Gas Town can be integrated via /babysitter:assimilate. The plugin system distributes natural language instruction sets (markdown) and deterministic coded processes (JS) that an AI agent reads and executes at install time, including setup for security tooling, testing frameworks, deployment configs, and CI/CD pipelines.

Operating Modes

Four primary modes give teams control over autonomy level:

Interactive (/babysitter:call): Pauses at breakpoints for human approval
Plan First (/babysitter:plan): Research and architect before executing
YOLO (/babysitter:yolo): Full autonomy, no interruptions
Forever (/babysitter:forever): Continuous operation for monitoring or periodic tasks

Update: v0.0.187

The latest release is v0.0.187, published April 4, 2026. The repository was last pushed to on June 1, 2026, indicating active development. The project was created in January 2026 and has grown to 1,133 stars and 71 forks. The enterprise page at a5c.ai/enterprise describes an "Org AI Transformation" offering focused on autonomous workflow orchestration, governance, and an internal AI Task Force engagement model.

Community Discussions

Be the first to start a conversation about Babysitter

Share your experience with Babysitter, ask questions, or help others learn from your insights.

Pricing

OPEN SOURCE

Open Source

Fully free and open-source under the MIT License. Self-hosted, no telemetry, no licensing fees.

Full orchestration layer with quality gates
2,000+ built-in processes
Multi-harness support (Claude Code, Codex, Cursor, Gemini, Copilot)
Internal harness for CI/CD automation
4-layer token compression

Capabilities

Key Features

Quality gates that block agent progression until tests/lints/validators pass
Convergent loops that retry until gates are satisfied
Immutable event-sourced journal for deterministic replay and resume
Cryptographic completion proof (secret emits only when truly done)
2,000+ built-in battle-tested processes
Four operating modes: Interactive, Plan First, YOLO, Forever
4-layer token compression subsystem (50-67% context reduction claimed)
Plugin system for security, testing, deployment, CI/CD, and themes
Internal harness for headless CI/CD automation without external AI agent
Multi-harness support: Claude Code, Codex, Cursor, Gemini CLI, GitHub Copilot
Real-time monitoring via /babysitter:observe
Human-in-the-loop breakpoints with structured approval context
Parallel task execution with dependency management
Run resumption from any checkpoint
Zero telemetry — no data sent home

Integrations

Claude Code

Codex CLI

Cursor IDE

Gemini CLI

GitHub Copilot

Oh-My-Pi

OpenCode

LiteLLM

GPT (via LiteLLM)

Local models (via LiteLLM)

GitHub Actions

Terraform

Helm

Docker

Vitest

Playwright

pytest

ESLint

Gitleaks

npm

API Available

View Docs

Back to all tools Suggest an edit