Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Developers

    1,995+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. vLLM

    vLLM

    To grow vLLM as the world's leading AI inference engine and provide a universal inference layer that makes AI serving fast, cheap, and accessible.

    Visit Website

    At a Glance

    1Tool Listed
    2Products
    22Tool Views
    6Capabilities
    Discussions
    San Francisco, CAHeadquarters
    2025Est.
    25Employees
    $150MRaised
    Focus Areas
    Local Inference
    AI Infrastructure
    Deployment Automation
    Connect
    Latest News
    Inferact Launches With $150 Million Funding At $800M Valuation To Commercialize vLLMJan 22, 2026
    Investing in Inferact: The Team Behind vLLMJan 22, 2026
    Markets
    • AI Startups
    • Enterprise AI Teams
    • Cloud Service Providers
    • Open Source AI Community

    AI Tools by vLLM

    (1)
    View vLLM
    vLLM tool icon

    vLLM

    Open Source LLM Inference Library

    Local InferenceAI InfrastructureDeploy Automation

    Discussions

    No discussions yet

    Be the first to start a discussion about vLLM

    Latest News

    01/22/2026

    Inferact Launches With $150 Million Funding At $800M Valuation To Commercialize vLLM

    Pulse 2.0 / SiliconANGLE
    01/22/2026

    Investing in Inferact: The Team Behind vLLM

    Andreessen Horowitz (a16z) Blog
    12/01/2025

    vLLM Project in Talks for Major Funding Round

    Forbes
    10/01/2023

    vLLM Paper on PagedAttention Presented at SOSP 2023

    ACM Digital Library / UC Berkeley

    Products & Services

    2
    vLLM
    2023-06-20

    Open-source library for high-throughput and memory-efficient LLM inference and serving using PagedAttention.

    Inferact Platform
    2026

    A commercial, managed, and serverless version of the vLLM inference engine, designed to provide a universal inference layer.

    Market Position

    Positions itself as the 'universal inference layer' building on the most popular open-source inference engine (vLLM) to compete with proprietary inference stacks.

    Leadership

    Founders

    SM

    Simon Mo

    Co-creator of vLLM, core maintainer, and former PhD student at UC Berkeley Sky Computing Lab. CEO of Inferact.

    WK

    Woosuk Kwon

    Co-creator of vLLM and PhD candidate at UC Berkeley Sky Lab. CTO of Inferact.

    KY

    Kaichao You

    Core maintainer of vLLM, Ph.D. from Tsinghua University. Chief Scientist at Inferact.

    RW

    Roger Wang

    Core maintainer of vLLM, previously a software engineer at NVIDIA, Microsoft, and Amazon.

    IS

    Ion Stoica

    Professor at UC Berkeley, Director of Sky Computing Lab. Co-founder of Databricks and Anyscale. Board member and co-founder of Inferact.

    JG

    Joseph Gonzalez

    Professor at UC Berkeley, co-founder of Databricks and Anyscale. Previously co-founded Turi (acquired by Apple).

    Executive Team

    SM

    Simon Mo

    CEO

    Co-creator and maintainer of vLLM.

    WK

    Woosuk Kwon

    CTO

    Co-creator of vLLM and lead researcher.

    Board of Directors

    IS
    Ion Stoica
    Co-Founder and Board Member
    BM
    Bucky Moore
    Partner at Lightspeed Venture Partners, Board Member
    MB
    Matt Bornstein
    Partner at Andreessen Horowitz, Board Member/Observer

    Founding Story

    Founded by the creators of the vLLM project at UC Berkeley's Sky Computing Lab to commercialize the popular open-source inference engine and provide enterprise-grade infrastructure.

    Business Model

    Revenue
    Early stage; Forbes reported 'little revenue' as of late 2025 prior to the $150M seed launch.

    Revenue Model

    Managed service (SaaS), serverless AI inference platform, and potential enterprise support/licensing.

    Pricing Tiers

    vLLM Open Source
    Free

    Community-supported open-source version.

    Inferact Managed
    Usage-based / TBD

    Serverless managed version of vLLM (in roadmap/early access).

    Private

    Target Markets

    Industries & Segments
    • AI Startups
    • Enterprise AI Teams
    • Cloud Service Providers
    • Open Source AI Community
    Use Cases
    • High-throughput LLM serving
    • Cost-efficient AI inference
    • Real-time chat and agentic workflows
    • Enterprise AI infrastructure
    Notable Customers
    • Meta
    • Google
    • Character.ai
    • DoorDash

    Quick Facts

    Headquarters
    San Francisco, CA
    Founded
    2025
    Entity Type
    Inc.
    Employees
    25
    Total Funding
    $150M
    Investors
    Andreessen Horowitz, Lightspeed Venture Partners
    Office Locations
    San Francisco
    Berkeley

    Funding History

    Seed$150,000,000
    January 22, 2026
    $800,000,000 valuation
    Andreessen Horowitz (a16z)
    Lightspeed Venture Partners

    History & Milestones

    January 22, 2026

    Inferact launches with $150 million in seed funding to commercialize vLLM.

    June 20, 2023

    Initial open-source release of vLLM and the PagedAttention technique.

    October 2023

    vLLM paper 'Efficient Memory Management for Large Language Model Serving with PagedAttention' accepted at SOSP '23.

    Key Capabilities

    6
    PagedAttention memory management
    Continuous batching
    Speculative decoding
    Multi-node scaling
    Quantization support (FP8, INT8, AWQ, etc.)
    Tensor and Pipeline parallelism

    Integrations & Partnerships

    Platform Integrations

    • Kubernetes
    • Hugging Face
    • OpenAI-compatible API
    • Docker

    Key Partnerships

    NVIDIA (Hardware optimization)
    AMD (GPU support)
    Databricks (Strategic investment/integration)

    Connect

    Website
    docs.vllm.ai
    GitHub
    vllm-project
    X / Twitter
    vllm_project

    AI Topics

    3

    vLLM focuses on these topics:

    Local Inference(1)
    AI Infrastructure(1)
    Deployment Automation(1)
    Back to all developers
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026