Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
Create
Sign In
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    1,547+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    Categories
    • Coding733
    • Agents640
    • Marketing302
    • Infrastructure298
    • Design239
    • Analytics228
    • Research224
    • Projects207
    • Integration148
    • Testing129
    • Data125
    • Learning115
    • MCP113
    • Security107
    • Extensions94
    • Prompts79
    • Communication73
    • Voice71
    • Commerce70
    • Web59
    • DevOps46
    • Finance12
    Sign In
    1. Home
    2. Tools
    3. DVC (Data Version Control)
    DVC (Data Version Control) icon

    DVC (Data Version Control)

    Version Control

    DVC is an open-source Git extension that brings version control to data, models, and ML pipelines, enabling reproducible data science workflows.

    Visit Website

    At a Glance

    Pricing

    Open Source

    Free and open-source Git extension for data version control, ML pipelines, and experiment tracking.

    Engagement

    Available On

    Windows
    macOS
    Linux
    VS Code
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Version ControlAI InfrastructureData Processing

    Listed Mar 2026

    About DVC (Data Version Control)

    DVC (Data Version Control) is a free, open-source tool that applies Git-like version control to datasets, machine learning models, and experiment pipelines. It works as a Git extension, allowing data scientists and ML engineers to track large files, manage experiments, and reproduce results without changing their existing Git workflows. DVC is used by thousands of teams ranging from individual data scientists to Fortune 500 companies, and is now part of the lakeFS family for enterprise-scale data versioning.

    • Git-like data versioning: Track datasets and model files using .dvc pointer files committed to Git, while actual data is stored in remote storage (S3, GCS, Azure, SSH, etc.).
    • ML pipeline management: Define and run reproducible ML pipelines with dvc run and dvc repro, automatically caching intermediate stages.
    • Experiment tracking: Compare, switch between, and reproduce experiments using dvc exp commands without leaving the terminal.
    • Remote storage support: Push and pull data to/from cloud storage backends including Amazon S3, Google Cloud Storage, Azure Blob Storage, and more.
    • VS Code extension: Use the DVC VS Code extension for a graphical interface to manage experiments, plots, and pipelines directly in the editor.
    • Language and framework agnostic: Works with any programming language or ML framework — Python, R, Julia, and beyond.
    • Open source and community-driven: Actively maintained on GitHub with 15,000+ stars and a vibrant Discord community for support.
    • Enterprise scaling via lakeFS: For large-scale AI/ML infrastructure needs, DVC integrates with lakeFS for petabyte-scale multimodal object stores and data lakes.

    To get started, install DVC via pip (pip install dvc), initialize it in a Git repo with dvc init, and begin tracking data files with dvc add. The documentation at doc.dvc.org provides comprehensive guides for all major workflows.

    DVC (Data Version Control) - 1

    Community Discussions

    Be the first to start a conversation about DVC (Data Version Control)

    Share your experience with DVC (Data Version Control), ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Free and open-source Git extension for data version control, ML pipelines, and experiment tracking.

    • Git-like data versioning
    • ML pipeline management
    • Experiment tracking
    • Remote storage support (S3, GCS, Azure, SSH)
    • VS Code extension
    View official pricing

    Capabilities

    Key Features

    • Git-like data versioning
    • ML pipeline management
    • Experiment tracking and comparison
    • Remote storage support (S3, GCS, Azure, SSH)
    • VS Code extension
    • Language and framework agnostic
    • Reproducible ML workflows
    • Data caching
    • Open source

    Integrations

    Git
    Amazon S3
    Google Cloud Storage
    Azure Blob Storage
    SSH
    HDFS
    VS Code
    lakeFS
    GitHub
    GitLab
    Bitbucket
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate DVC (Data Version Control) and help others make informed decisions.

    Developer

    Iterative (lakeFS)

    Iterative builds open-source tools for data version control and ML experiment management, now operating under the lakeFS umbrella. The team created DVC to bring software engineering best practices to data science and ML workflows. lakeFS provides enterprise-scale data versioning infrastructure for complex AI operations and big data environments. Together, they serve thousands of users from individual data scientists to Fortune 500 companies.

    Founded 2020
    Tel Aviv, Israel
    $43+ raised
    45 employees

    Used by

    Netflix
    Volvo
    Lockheed Martin
    Paige.ai
    +3 more
    Read more about Iterative (lakeFS)
    WebsiteGitHubX / Twitter
    1 tool in directory

    Similar Tools

    lakeFS icon

    lakeFS

    lakeFS is a data version control platform that brings Git-like branching, merging, and rollback capabilities to data lakes, enabling AI and data teams to manage data lifecycle, provenance, and access at scale.

    xmloxide icon

    xmloxide

    xmloxide is an open-source Rust library for parsing and manipulating XML documents with a focus on performance and safety.

    DoltHub icon

    DoltHub

    DoltHub is a collaboration platform for Dolt, the world's first version-controlled SQL database that combines Git-like branching and merging with a MySQL-compatible interface.

    Browse all tools

    Related Topics

    Version Control

    AI tools that enhance version control systems and code management.

    16 tools

    AI Infrastructure

    Infrastructure designed for deploying and running AI models.

    152 tools

    Data Processing

    AI-enhanced ETL (Extract, Transform, Load) tools and data pipelines that automate the processing, cleaning, and transformation of large datasets with intelligent optimizations.

    61 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Sign in
    0views
    0upvotes
    0discussions