Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,025+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1104
    • Coding995
    • Infrastructure429
    • Marketing408
    • Design354
    • Projects323
    • Analytics311
    • Research297
    • Testing194
    • Data166
    • Integration164
    • Security162
    • MCP152
    • Learning143
    • Communication126
    • Extensions118
    • Commerce112
    • Prompts109
    • Voice105
    • DevOps89
    • Web73
    • Finance19
    1. Home
    2. Tools
    3. llamafile
    llamafile icon

    llamafile

    Local Inference

    llamafile lets you distribute and run LLMs with a single self-contained executable file, with no installation required, across most operating systems and CPU architectures.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under Apache 2.0. Download and run LLMs locally with no cost.

    Engagement

    Available On

    Windows
    macOS
    Linux
    API
    CLI

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Local InferenceLLM OrchestrationSpeech Recognition

    Alternatives

    SyntheticOrKaBodega Inference Engine
    Developer
    Mozilla AISan Francisco, CAEst. 2023$30000000 raised

    Listed Apr 2026

    About llamafile

    llamafile is a Mozilla Builders project that collapses the complexity of running large language models into a single-file executable. It combines llama.cpp with Cosmopolitan Libc so that one file runs locally on most operating systems and CPU architectures without any installation. The project also includes whisperfile, a single-file speech-to-text tool built on whisper.cpp using the same packaging approach. llamafile is fully open source under the Apache 2.0 license and is actively maintained by Mozilla.ai.

    • Single-file distribution — Download one .llamafile executable and run it directly; no Python environment, Docker, or package manager needed.
    • Cross-platform support — The same file runs on macOS, Linux, Windows, BSD, and multiple CPU architectures thanks to Cosmopolitan Libc.
    • Built on llama.cpp — Inherits broad model compatibility and GPU acceleration support from the widely-used llama.cpp inference engine.
    • whisperfile included — A companion single-file speech-to-text tool built on whisper.cpp for audio transcription and translation, requiring no installation.
    • Local inference — All computation runs on your own hardware; no data is sent to external servers.
    • Pre-built model files — Ready-to-run llamafiles for popular models (e.g., Qwen, LLaVA) are hosted on Hugging Face for immediate download.
    • Quick start — Download a .llamafile, mark it executable (chmod +x), and run it; Windows users rename with .exe extension.
    • Versioned releases — Stable and legacy releases are available on GitHub; pre-built llamafiles indicate which server version they bundle.
    • Open source — Apache 2.0 licensed core; llama.cpp and whisper.cpp modifications are MIT licensed for upstream compatibility.
    llamafile - 1

    Community Discussions

    Be the first to start a conversation about llamafile

    Share your experience with llamafile, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source under Apache 2.0. Download and run LLMs locally with no cost.

    • Single-file LLM execution
    • Cross-platform support
    • whisperfile speech-to-text
    • Local inference
    • No installation required

    Capabilities

    Key Features

    • Single-file LLM executable
    • No installation required
    • Cross-platform (Windows, macOS, Linux, BSD)
    • Multi-architecture CPU support
    • Built on llama.cpp
    • whisperfile speech-to-text tool
    • Local inference
    • Pre-built model files on Hugging Face
    • GPU acceleration support
    • Open source (Apache 2.0)

    Integrations

    llama.cpp
    whisper.cpp
    Cosmopolitan Libc
    Hugging Face
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate llamafile and help others make informed decisions.

    Developer

    Mozilla AI

    Mozilla AI is a subsidiary of Mozilla focused on building open-source tools and infrastructure for trustworthy, transparent, and controllable AI. The team develops products and libraries including llamafile, any-agent, any-llm, Lumigator, and cq, contributing to an open AI ecosystem that prioritizes developer empowerment and responsible development practices.

    Founded 2023
    San Francisco, CA
    $30000000 raised
    50 employees

    Used by

    Mission-driven startups in the 'Rebel…
    Open-source developer community
    Read more about Mozilla AI
    WebsiteGitHubLinkedIn
    3 tools in directory

    Similar Tools

    Synthetic icon

    Synthetic

    AI platform providing access to multiple LLMs with subscription or usage-based pricing, offering both UI and API access.

    OrKa icon

    OrKa

    Open-source tool for building AI workflows using YAML configuration instead of Python code, with built-in memory and local LLM support.

    Bodega Inference Engine icon

    Bodega Inference Engine

    Enterprise-grade local LLM inference engine built specifically for Apple Silicon, featuring a multi-model registry, OpenAI-compatible API, and high-throughput continuous batching.

    Browse all tools

    Related Topics

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    78 tools

    LLM Orchestration

    Platforms and frameworks for designing, managing, and deploying complex LLM workflows with visual interfaces, allowing for the coordination of multiple AI models and services.

    86 tools

    Speech Recognition

    AI tools that convert spoken language into text.

    33 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Discussions