EveryDev.ai
Subscribe
Home
Tools

3,020+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents2063
  • Coding1441
  • Infrastructure665
  • Marketing524
  • Projects470
  • Research437
  • Design408
  • Analytics371
  • MCP268
  • Security265
  • Testing255
  • Data249
  • Integration183
  • Prompts183
  • Communication172
  • Learning166
  • Extensions163
  • Voice146
  • Commerce132
  • DevOps115
  • Web84
  • Finance24
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. VOID: Video Object and Interaction Deletion
    VOID: Video Object and Interaction Deletion icon

    VOID: Video Object and Interaction Deletion

    Image
    Featured

    VOID removes objects from videos along with all physical interactions they induce on the scene, such as objects falling when a person is removed.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under Apache License 2.0. Download model checkpoints from HuggingFace and run locally.

    Engagement

    Available On

    CLI
    API
    SDK

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    ImageVideoContent Generation

    Alternatives

    SkyReels V4Vision AgentsGoogle Flow
    Developer
    NetflixLos GatosEst. 1997$3.1B+ raised

    Listed Apr 2026

    About VOID: Video Object and Interaction Deletion

    VOID (Video Object and Interaction Deletion) is an open-source video inpainting model developed by Netflix Research that removes objects from videos while also eliminating the physical interactions those objects cause — not just visual artifacts like shadows, but dynamic effects like objects falling or being displaced. Built on top of CogVideoX and fine-tuned for interaction-aware video inpainting, VOID uses a novel quadmask conditioning system to distinguish between primary objects, affected regions, and background. The model runs in two sequential passes: Pass 1 for base inpainting and Pass 2 for warped-noise temporal refinement, enabling high-quality results on longer video clips.

    • Interaction-aware removal — Uses a 4-value quadmask (primary object, overlap, affected region, background) to model and remove physical interactions like falling objects, not just the target object itself.
    • Two-pass inference pipeline — Pass 1 performs base inpainting; Pass 2 applies optical flow-warped latent initialization for improved temporal consistency on longer clips.
    • VLM-powered mask generation — The VLM-MASK-REASONER pipeline uses SAM2 segmentation and Gemini for automated reasoning about interaction-affected regions, generating quadmasks from raw video.
    • Manual mask refinement GUI — An included GUI editor allows frame-by-frame refinement of quadmasks with brush tools, grid toggles, and undo/redo support.
    • Synthetic training data pipelines — Provides two data generation pipelines: HUMOTO (human-object interaction via Blender/motion capture) and Kubric (physics-based object interaction), both producing paired counterfactual videos.
    • Google Colab notebook — A ready-to-run notebook handles setup, model download, and inference on sample videos; requires a GPU with 40GB+ VRAM (e.g., A100).
    • HuggingFace model hosting — Both VOID Pass 1 and Pass 2 checkpoints are available on HuggingFace for direct download and use.
    • Apache 2.0 licensed — Fully open-source; free to use, modify, and distribute under the Apache License 2.0.
    VOID: Video Object and Interaction Deletion - 1

    Community Discussions

    Be the first to start a conversation about VOID: Video Object and Interaction Deletion

    Share your experience with VOID: Video Object and Interaction Deletion, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source under Apache License 2.0. Download model checkpoints from HuggingFace and run locally.

    • Apache License 2.0
    • Full source code access on GitHub
    • VOID Pass 1 and Pass 2 model checkpoints on HuggingFace
    • VLM-MASK-REASONER pipeline
    • Training data generation pipelines (HUMOTO + Kubric)

    Capabilities

    Key Features

    • Interaction-aware video object removal
    • Two-pass inference pipeline (base + warped-noise refinement)
    • Quadmask conditioning (4-value semantic mask)
    • VLM-powered automated mask generation (SAM2 + Gemini)
    • Manual quadmask refinement GUI
    • HUMOTO-based synthetic training data generation (Blender)
    • Kubric-based synthetic training data generation
    • Google Colab notebook for quick start
    • HuggingFace model hosting (Pass 1 and Pass 2 checkpoints)
    • Batch inference support
    • Optical flow-based temporal consistency (Pass 2)
    • DeepSpeed ZeRO stage 2 training support

    Integrations

    CogVideoX
    SAM2
    Gemini (Google AI API)
    HuggingFace
    Google Colab
    Blender
    Kubric
    DeepSpeed
    HUMOTO
    ffmpeg
    VideoX-Fun
    API Available
    View Docs

    Ratings & Reviews

    No ratings yet

    Be the first to rate VOID: Video Object and Interaction Deletion and help others make informed decisions.

    Developer

    Netflix

    Netflix builds streaming entertainment products and invests heavily in applied research across machine learning, computer vision, and generative AI. The Netflix Research team develops open-source models and tools — including VOID — that push the boundaries of video understanding and synthesis. The team behind VOID includes researchers from Netflix and INSAIT, Sofia University, combining expertise in computer vision, generative models, and video production technology.

    Founded 1997
    121 Albright Way, CA 95032
    $3.1B+ raised
    12,800 employees

    Used by

    Individual consumers (260M+ global…
    Read more about Netflix
    WebsiteGitHubLinkedIn
    1 tool in directory

    Similar Tools

    SkyReels V4 icon

    SkyReels V4

    SkyReels V4 is the world's #1 AI video generation model, creating stunning 1080P videos up to 15 seconds from text or image inputs with full commercial licensing.

    Vision Agents icon

    Vision Agents

    Open-source Video AI framework for building real-time voice and video applications with built-in AI integrations.

    Google Flow icon

    Google Flow

    Google Flow is an AI-powered generative media tool from Google Labs for creating and experimenting with video and creative content.

    Browse all tools

    Related Topics

    Image

    AI tools that generate or edit still images — illustrations, photos, logos, icons, and graphics — from text prompts, references, or existing images.

    78 tools

    Video

    AI tools that generate or edit video — from text-to-video and animation to avatars, dubbing, and short-form clips.

    70 tools

    Content Generation

    Advanced LLM-based tools that create high-quality, engaging marketing content, articles, and copy tailored to specific audiences, tones, and campaign objectives with minimal human input.

    245 tools
    Browse all topics
    Back to all toolsSuggest an edit
    ratings
    discussions
    47views
    2upvotes