Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,111+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1228
    • Coding1045
    • Infrastructure455
    • Marketing414
    • Design374
    • Projects340
    • Analytics319
    • Research306
    • Testing200
    • Data171
    • Integration169
    • Security169
    • MCP164
    • Learning146
    • Communication131
    • Prompts122
    • Extensions120
    • Commerce116
    • Voice107
    • DevOps92
    • Web73
    • Finance19
    1. Home
    2. Tools
    3. Rocky
    Rocky icon

    Rocky

    Data Processing
    Featured

    A Rust-based open-source control plane for warehouse pipelines with branches, replay, column-level lineage, compile-time safety, and per-model cost attribution.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open source under the Apache License 2.0. All features included.

    Engagement

    Available On

    Windows
    macOS
    Linux
    API
    VS Code

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Data ProcessingData GovernanceWorkflow Automation

    Alternatives

    DagsterBruinBenchling
    Developer
    rocky-datarocky-data builds Rocky, an open-source Rust-based control p…

    Listed Apr 2026

    About Rocky

    Rocky is a trust system for data pipelines — a Rust-based control plane that brings branches, replay, column-level lineage, compile-time safety, and per-model cost attribution to your existing data warehouse. It works alongside Databricks or Snowflake, handling the DAG orchestration layer without requiring you to replace your warehouse. Rocky ships as a CLI binary, a Dagster integration, and a VS Code extension, and runs locally on DuckDB for zero-credential playground exploration.

    • Schema drift detection: Rocky diffs source vs. target on every run, automatically dropping and recreating targets when upstream column types change — no silent data corruption.
    • Compile-time data contracts: Missing required columns, protected column removals, or unsafe type changes surface as diagnostic codes (E010, E013) before a single row is written.
    • Named branches: Create isolated schema branches for risk-free experiments, inspect results, then drop or promote — with column-level lineage showing downstream blast radius before shipping.
    • Column-level lineage: Trace a single column from a downstream fact table all the way back to its seed, enabling precise blast-radius analysis without reading every model.
    • AI model generation: Describe a transformation in plain English; Rocky generates a Rocky DSL model, compiles it, and retries automatically on parse failure.
    • PR-time lineage diff: rocky lineage-diff compares two git refs and outputs per-changed-column downstream consumer readouts as Markdown, ready to drop into a GitHub PR comment.
    • Classification, masking, and compliance: Tag PII columns in model sidecars, bind tags to mask strategies per environment, and gate CI with rocky compliance --fail-on exception.
    • Incremental loads with watermark state: Use strategy = "incremental" with a timestamp_column to persist high-water marks and only process deltas on subsequent runs.
    • Dagster integration: The dagster-rocky PyPI wheel wraps the Rocky CLI as a Dagster resource and component for orchestration workflows.
    • VS Code extension: An LSP client and command palette for AI-assisted model generation and pipeline navigation directly in the editor.
    • Adapter SDK: Build custom warehouse adapters (ClickHouse, Trino, Redshift, etc.) using the documented Rust-native adapter skeleton.

    Community Discussions

    Be the first to start a conversation about Rocky

    Share your experience with Rocky, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open source under the Apache License 2.0. All features included.

    • Schema drift detection and auto-recovery
    • Compile-time data contract enforcement
    • Named branches for isolated experiments
    • Column-level lineage tracing
    • AI model generation

    Capabilities

    Key Features

    • Schema drift detection and auto-recovery
    • Compile-time data contract enforcement
    • Named branches for isolated pipeline experiments
    • Column-level lineage tracing
    • AI model generation with compile-validate loop
    • PR-time lineage diff with blast-radius analysis
    • PII classification, masking, and compliance gating
    • Incremental loads with persistent watermark state
    • Per-model cost attribution
    • Dagster integration via dagster-rocky wheel
    • VS Code extension with LSP client
    • Adapter SDK for custom warehouse backends
    • Local DuckDB playground with no credentials required

    Integrations

    Databricks
    Snowflake
    DuckDB
    Dagster
    VS Code
    GitHub Actions
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Rocky and help others make informed decisions.

    Developer

    rocky-data

    rocky-data builds Rocky, an open-source Rust-based control plane for data warehouse pipelines. The project delivers compile-time safety, column-level lineage, schema drift detection, and branching for teams using Databricks or Snowflake. Rocky is free and open source under the Apache 2.0 license, with sponsorship available to sustain continued development.

    Read more about rocky-data
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Dagster icon

    Dagster

    A unified control plane for teams to build, scale, and observe AI and data pipelines with confidence.

    Bruin icon

    Bruin

    AI data analyst that answers data questions directly in Slack, backed by an end-to-end open-source ELT/ETL data platform with SQL & Python pipelines.

    Benchling icon

    Benchling

    Cloud-based R&D platform for life sciences that unites data, automation, and AI to accelerate biotech research and discovery.

    Browse all tools

    Related Topics

    Data Processing

    AI-enhanced ETL (Extract, Transform, Load) tools and data pipelines that automate the processing, cleaning, and transformation of large datasets with intelligent optimizations.

    80 tools

    Data Governance

    AI-driven tools for managing data quality, privacy, compliance, and security across organizations with automated monitoring and policy enforcement.

    8 tools

    Workflow Automation

    Agents designed to automate complex business and development workflows.

    210 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Discussions