EveryDev.ai
Sign inSubscribe
Explore AI Tools
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    Home
    Tools

    2,424+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1573
    • Coding1176
    • Infrastructure524
    • Marketing445
    • Design422
    • Projects381
    • Research354
    • Analytics328
    • Testing219
    • MCP210
    • Data203
    • Security192
    • Integration168
    • Learning154
    • Communication145
    • Prompts140
    • Extensions135
    • Commerce123
    • Voice122
    • DevOps98
    • Web76
    • Finance21
    1. Home
    2. Tools
    3. STARFlow
    STARFlow icon

    STARFlow

    Video Generation

    STARFlow is Apple's open-source transformer autoregressive flow model for high-quality text-to-image and text-to-video generation, combining autoregressive models with normalizing flows.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Freely available open-source code and pretrained model weights on GitHub and Hugging Face.

    Engagement

    Available On

    CLI
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Video GenerationImage GenerationGenerative Media

    Alternatives

    PoseCutLeonardo.AIFiddl.art
    Developer
    AppleApple Park, CaliforniaEst. 1976$1300+ raised

    Listed May 2026

    About STARFlow

    STARFlow is Apple's official open-source release of a novel transformer autoregressive flow architecture for high-quality image and video generation. The project, hosted on GitHub under the apple organization, covers both STARFlow (text-to-image) and STARFlow-V (text-to-video), with pretrained model checkpoints available on Hugging Face.

    What It Is

    STARFlow is a generative AI research framework that combines the expressiveness of autoregressive models with the efficiency of normalizing flows. Rather than relying on diffusion-based approaches, it introduces a "deep-shallow" transformer block architecture that processes latent representations through normalizing flow layers. The result is a family of models capable of generating high-resolution images and temporally consistent videos from text prompts.

    Architecture and Model Family

    The project ships two primary model variants:

    • STARFlow (3B parameters): Text-to-image generation at 256×256 resolution. Uses a 6-block deep-shallow architecture, T5-XL text encoder, SD-VAE, and RoPE positional encoding.
    • STARFlow-V (7B parameters): Text-to-video generation at up to 640×480 (480p). Supports up to 481 frames (~30 seconds at 16 FPS) with causal temporal attention and WAN2.2-VAE.
    • STARFlow2 and NTM (Normalizing Trajectory Models): Two follow-on research directions with papers published but code listed as "TBD."

    A key inference optimization is block-wise Jacobi iteration, which accelerates sampling by enabling parallel convergence across token blocks rather than strictly sequential decoding.

    Research Lineage and Recognition

    The STARFlow paper (arXiv:2506.06276) was accepted as a NeurIPS 2025 Spotlight, and STARFlow-V (arXiv:2511.20462) received a CVPR 2026 Highlight designation, according to the repository's own badges and citations. The project cites four arXiv papers in total, reflecting an active research program at Apple spanning image synthesis, video generation, and unified multimodal generation.

    Setup and Usage Path

    The repository targets ML researchers and practitioners comfortable with Python and distributed training. Setup involves:

    1. Cloning the repo and creating a conda environment via scripts/setup_conda.sh or pip install -r requirements.txt
    2. Downloading pretrained checkpoints from Hugging Face into a local ckpts/ directory
    3. Running inference via torchrun with provided shell scripts for both image and video generation

    Training is supported via FSDP (Fully Sharded Data Parallel) for large-scale distributed runs, with gradient checkpointing available to reduce memory usage. The repository includes separate training scripts for image and video tasks, along with dry-run validation flags.

    Update: Active Development as of May 2026

    The repository was created in October 2025 and last pushed to in May 2026, with 563 stars and 39 forks as of the latest metadata. The codebase covers STARFlow and STARFlow-V with full training and inference support, while STARFlow2 and NTM remain paper-only releases with code marked as forthcoming. The project is licensed under a custom Apple license (separate LICENSE and LICENSE_MODEL files), not a standard OSI-approved license.

    STARFlow - 1

    Community Discussions

    Be the first to start a conversation about STARFlow

    Share your experience with STARFlow, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Freely available open-source code and pretrained model weights on GitHub and Hugging Face.

    • Full source code access
    • Pretrained model checkpoints via Hugging Face
    • Text-to-image generation (3B model)
    • Text-to-video generation (7B model)
    • Training scripts with FSDP support

    Capabilities

    Key Features

    • Text-to-image generation (256×256)
    • Text-to-video generation (up to 480p, ~30 seconds)
    • Text-image-to-video (TI2V) generation
    • Transformer autoregressive flow architecture
    • Block-wise Jacobi iteration for fast sampling
    • FSDP support for distributed training
    • Variable-length video generation
    • Classifier-free guidance
    • RoPE positional encoding
    • Causal temporal attention for video
    • Gradient checkpointing for memory efficiency
    • Configurable aspect ratios and resolutions

    Integrations

    Hugging Face (model checkpoints)
    T5-XL (text encoder)
    SD-VAE
    WAN2.2-VAE
    PyTorch
    torchrun
    conda
    wandb (training logging)
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate STARFlow and help others make informed decisions.

    Developer

    Apple

    Apple publishes select open-source tools and research artifacts, including Embedding Atlas for exploring and visualizing embeddings.

    Founded 1976
    Apple Park
    $1300+ raised
    166,000 employees

    Used by

    HIMO Group (26M+ customers, using Mac…
    Rituals (retail)
    Laura Canada (retail)
    H-E-B (retail)
    +8 more
    Read more about Apple
    WebsiteGitHubX / Twitter
    3 tools in directory

    Similar Tools

    PoseCut icon

    PoseCut

    PoseCut is an AI-powered creative studio for generating studio-quality images and cinematic videos, with 14+ editing tools and 437+ artistic styles.

    Leonardo.AI icon

    Leonardo.AI

    A creator-first generative AI platform for producing high-quality images, videos, and edited visuals from text prompts or custom-trained models.

    Fiddl.art icon

    Fiddl.art

    AI art generation platform where creators can generate images and videos, share creations, and earn points when others engage with their work.

    Browse all tools

    Related Topics

    Video Generation

    AI-powered platforms for creating, synthesizing, and generating video content including realistic scenes, animations, and visual effects.

    28 tools

    Image Generation

    AI-powered tools that create original images, illustrations, and visual assets from text prompts, reference images, or conceptual descriptions using generative models.

    42 tools

    Generative Media

    AI platforms providing comprehensive generative capabilities across multiple media types including images, video, audio, and 3D content.

    88 tools
    Browse all topics
    Back to all tools
    Discussions