Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
Create
Sign In
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    1,456+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    Categories
    • Coding737
    • Agents659
    • Marketing313
    • Infrastructure299
    • Design241
    • Analytics231
    • Research228
    • Projects222
    • Integration148
    • Testing129
    • Data127
    • Learning116
    • MCP114
    • Security108
    • Extensions96
    • Communication81
    • Prompts80
    • Commerce72
    • Voice72
    • Web59
    • DevOps46
    • Finance12
    Sign In
    1. Home
    2. Tools
    3. lakeFS
    lakeFS icon

    lakeFS

    Version Control

    lakeFS is a data version control platform that brings Git-like branching, merging, and rollback capabilities to data lakes, enabling AI and data teams to manage data lifecycle, provenance, and access at scale.

    Visit Website

    At a Glance

    Pricing

    Open Source
    Free tier available

    Free forever open-source version of lakeFS with core data version control features.

    Enterprise: Custom/contact

    Engagement

    Available On

    Web
    API
    Linux
    macOS
    Windows

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Version ControlData GovernanceAI Infrastructure

    Listed Mar 2026

    About lakeFS

    lakeFS is a scalable data version control system built by Treeverse that applies proven software engineering practices to data lake management. It enables teams to branch, merge, commit, and roll back data just like code, providing isolated environments for testing, reproducible ML experiments, and atomic data promotion. Trusted by organizations like Netflix, Volvo, Lockheed Martin, and Amazon, lakeFS integrates with virtually every major data and AI stack without moving data out of your storage. It is available as an open-source project and as a managed Enterprise offering with advanced security and governance features.

    • Data Branching & Merging: Create zero-copy branches of your data lake for isolated testing and experimentation, then atomically merge changes back to production.
    • Format-Agnostic Version Control: Works with any data format—Parquet, CSV, Avro, JSON, Delta Lake, Iceberg, Hudi, and unstructured data like images and video.
    • Data CI/CD with Hooks: Enforce data quality and compliance standards automatically using lakeFS hooks before changes reach production.
    • Instant Rollback: Recover from data incidents immediately by reverting to any previous commit without duplicating data.
    • Audit Trail & Lineage: Gain full visibility into data history with built-in audit logs to satisfy model governance and compliance requirements.
    • Role-Based Access Control (RBAC): Enterprise plan includes RBAC, SSO, SCIM, and IAM Roles for fine-grained, secure access management across teams.
    • lakeFS Mount: Virtually mount remote lakeFS repositories as a local filesystem for high-performance deep learning workloads.
    • Transactional Mirroring: Replicate repositories to remote regions for disaster recovery and data locality without data inconsistency.
    • Broad Integrations: Connects natively with Spark, Databricks, Airflow, Kafka, Flink, Airbyte, dbt, MLflow, Kubeflow, AWS SageMaker, and many more tools.
    • Cloud & Storage Agnostic: Supports AWS S3, Azure Blob, Google Cloud Storage, MinIO, Ceph, Dell EMC, and on-premises storage via the S3 interface.

    To get started, run lakeFS locally using the quickstart guide at docs.lakefs.io, or sign up for lakeFS Cloud. Connect your existing object storage, create a repository, and begin branching your data just like a Git workflow.

    lakeFS - 1

    Community Discussions

    Be the first to start a conversation about lakeFS

    Share your experience with lakeFS, ask questions, or help others learn from your insights.

    Pricing

    FREE

    Free Plan Available

    Free forever open-source version of lakeFS with core data version control features.

    • Format-Agnostic Data Version Control
    • Cloud-Agnostic
    • Zero Clone copy for isolated environment (via branches)
    • Atomic Data Promotion (via merges)
    • Data Stays in One Place

    Enterprise

    Full-featured enterprise plan with unlimited seats, advanced security, governance, and SLA support.

    Custom
    contact sales
    • All Open Source features
    • Role-Based Access Control (RBAC)
    • Single Sign On (SSO)
    • SCIM Support
    • IAM Roles
    • Mount Capability
    • Audit Logs
    • Transactional Mirroring
    • Iceberg REST Catalog
    • Metadata Search
    • Multiple Storage Backends Support
    • Simplified Garbage Collection (Managed or Standalone)
    • SOC2
    • Support SLA
    • Unlimited seats
    View official pricing

    Capabilities

    Key Features

    • Data branching and merging (zero-copy)
    • Atomic data promotion via merges
    • Data CI/CD using lakeFS Hooks
    • Instant rollback from data incidents
    • Built-in audit trail and data lineage
    • Role-Based Access Control (RBAC)
    • Single Sign-On (SSO)
    • SCIM Support
    • IAM Roles authentication
    • lakeFS Mount for local filesystem access
    • Transactional Mirroring (cross-region)
    • Configurable Garbage Collection
    • Metadata Search
    • Iceberg REST Catalog
    • Multiple Storage Backends Support
    • Format-agnostic version control
    • Cloud-agnostic deployment
    • Private-link support
    • SOC2 compliance

    Integrations

    Amazon S3
    Azure Blob Storage
    Google Cloud Storage
    MinIO
    Ceph
    Dell EMC
    VastData
    Apache Spark
    Trino
    Presto
    Databricks
    Snowflake
    AWS Glue
    StarBurst
    Apache Hive
    AWS EMR
    GCP DataProc
    Cloudera
    Azure Synapse
    AWS Athena
    Dremio
    DuckDB
    Apache Kafka
    Apache Flink
    Airbyte
    Fivetran
    AWS Kinesis
    GCP PubSub
    Delta Lake
    Apache Iceberg
    Apache Hudi
    Apache Airflow
    Argo Workflows
    Dagster
    Prefect
    Kubeflow
    Metaflow
    dbt
    AWS SageMaker
    MLflow
    Weights & Biases
    Ray
    Dask
    Jupyter
    Pandas
    Great Expectations
    Monte Carlo Data
    Labelbox
    API Available
    View Docs

    Demo Video

    lakeFS Demo Video
    Watch on YouTube

    Reviews & Ratings

    No ratings yet

    Be the first to rate lakeFS and help others make informed decisions.

    Developer

    Treeverse

    Treeverse builds lakeFS, a scalable data version control platform that brings Git-like operations to data lakes. Founded in 2020 by Oz Katz and Dr. Einat Orr, the company applies proven software engineering practices to data management challenges at scale. Treeverse serves thousands of organizations worldwide, from Fortune 100 enterprises to fast-growing AI teams, helping them deliver data and ML projects faster and with greater confidence. The company is backed by Norwest Venture Partners, Zeev Ventures, Dell Technologies Capital, and Maor Investments.

    Founded 2020
    Campbell, CA
    $51 raised
    37 employees

    Used by

    Volvo
    Netflix
    Lockheed Martin
    Overture Maps
    +3 more
    Read more about Treeverse
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Giggles icon

    Giggles

    Giggles is an open-source project hosted on GitHub by zion-off.

    GitHub icon

    GitHub

    A complete developer platform for building, shipping, and maintaining software with AI-powered tools, version control, and collaboration features.

    Commander icon

    Commander

    A native macOS interface for AI coding agents with built-in diffs, git workflow, and worktrees for seamless code review and commits.

    Browse all tools

    Related Topics

    Version Control

    AI tools that enhance version control systems and code management.

    14 tools

    Data Governance

    AI-driven tools for managing data quality, privacy, compliance, and security across organizations with automated monitoring and policy enforcement.

    6 tools

    AI Infrastructure

    Infrastructure designed for deploying and running AI models.

    142 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Sign in
    0views
    0upvotes
    0discussions