Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Developers

    2,113+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. Cactus Compute

    Cactus Compute

    Develop high-performance kernels and an AI inference engine for phone chips to enable low-latency AI on mobile devices and wearables.

    Visit Website

    At a Glance

    1Tool Listed
    2Products
    5Capabilities
    Discussions
    San Francisco, CAHeadquarters
    2025Est.
    8Employees
    $1000000Raised
    Focus Areas
    Local Inference
    Agent Frameworks
    AI Development Libraries
    Connect
    Latest News
    Cactus Compute introduces Needle, a 26M parameter model for mobile tool-calling.Apr 1, 2026
    Cactus Team announces multiple papers accepted at ICLR 2026 workshops.Mar 15, 2026
    Markets
    • Mobile App Developers
    • Hardware Manufacturers
    • Wearable Tech Companies

    AI Tools by Cactus Compute

    (1)
    View Needle
    Needle tool icon

    Needle

    Tiny LLM for Edge Devices

    Local InferenceAgent FrameworksAI Dev Libraries

    Discussions

    No discussions yet

    Be the first to start a discussion about Cactus Compute

    Latest News

    04/01/2026

    Cactus Compute introduces Needle, a 26M parameter model for mobile tool-calling.

    github.com
    03/15/2026

    Cactus Team announces multiple papers accepted at ICLR 2026 workshops.

    LinkedIn
    11/20/2025

    DeepMind x Cactus Compute Hackathon results released.

    Hacker News / Twitter
    06/01/2025

    Cactus Compute joins Y Combinator S25 batch.

    Y Combinator

    Products & Services

    2
    Cactus Inference Engine
    2025

    An open-source, high-performance inference engine and unified cross-platform framework for running AI models locally on mobile devices and wearables.

    Needle
    2026

    A specialized 26M parameter model distilled from Gemini, optimized for single-shot function calling and tool-use on mobile devices. MIT-licensed.

    Market Position

    Positioned as a more efficient, low-level alternative to generic mobile AI frameworks, focusing on kernel optimization and tiny, specialized models (26M parameters) like Needle.

    Leadership

    Founders

    RS

    Roman Shemet

    Former quant and economist with a background in product and data. Graduated from the University of Oxford. Experience in corporate finance, financial econometrics, and machine learning.

    HN

    Henry Ndubuaku

    Background in EECS, Robotics, and AI. MS in AI from the University of Pennsylvania (UPenn). Author of 4 ICLR papers and specialized in on-device AI and robotics.

    Executive Team

    RS

    Roman Shemet

    Co-Founder

    Former quant & economist, Oxford graduate.

    HN

    Henry Ndubuaku

    Co-Founder

    Robotics and AI expert, MS AI from UPenn, 4x ICLR author.

    Board of Directors

    AM
    Andrew Miklas
    Investor/Advisor

    Founding Story

    Cactus Compute was started to solve the latency and efficiency bottlenecks of running AI models on edge devices. The founders leveraged their backgrounds in finance (econometrics) and robotics/AI to build a specialized engine for phone chips.

    Business Model

    Revenue
    Reported reaching $500,000 in ARR within four weeks of launch.

    Revenue Model

    Open core model with enterprise licensing (Pro Key) and potentially support/services. Free for hobbyists.

    Pricing Tiers

    Hobbyist/Personal
    Free

    Open-source access to models and framework for non-commercial or personal use.

    Enterprise/Pro
    Not Public

    Requires a 'Pro Key' and enterprise licensing for commercial deployment and support.

    Private

    Target Markets

    Industries & Segments
    • Mobile App Developers
    • Hardware Manufacturers
    • Wearable Tech Companies
    Use Cases
    • On-device mobile AI assistants
    • Wearable device AI features
    • Function calling for mobile agents
    • Private and offline AI inference
    Notable Customers
    • Hobbyist developers
    • Mobile AI engineers

    Quick Facts

    Headquarters
    San Francisco, CA
    Founded
    2025
    Entity Type
    Inc.
    Employees
    8
    Total Funding
    $1,000,000
    Investors
    Y Combinator, Andrew Miklas
    Office Locations
    San Francisco
    London

    Funding History

    Seed (YC Deal)$500,000
    2025-08-01
    Y Combinator

    History & Milestones

    2026

    Released 'Needle', a 26M parameter single-shot tool-calling model distilled from Gemini.

    2026

    Had multiple papers accepted at ICLR 2026 workshops regarding on-device AI and inference.

    2025

    Cactus Compute founded to build low-latency AI engines for mobile devices.

    2025

    Accepted into the Y Combinator Summer 2025 (S25) batch.

    2025

    Hosted the DeepMind x Cactus Compute Hackathon.

    Key Capabilities

    5
    Low-latency on-device inference
    Optimized kernels for phone chips
    Unified cross-platform framework (Android, Kotlin)
    Function-calling specialized models (Needle)
    MIT-licensed open source models

    Integrations & Partnerships

    Platform Integrations

    • Android
    • Jetpack Compose
    • Kotlin Multiplatform
    • ARM/Mobile Phone Chips

    Key Partnerships

    DeepMind (Hackathon partnership)
    Y Combinator
    ASU AI Society

    Connect

    Website
    cactuscompute.com
    GitHub
    cactus-compute

    AI Topics

    3

    Cactus Compute focuses on these topics:

    Local Inference(1)
    Agent Frameworks(1)
    AI Development Libraries(1)
    Back to all developers
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026