Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,226+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1228
    • Coding1045
    • Infrastructure455
    • Marketing414
    • Design374
    • Projects340
    • Analytics319
    • Research306
    • Testing200
    • Data171
    • Integration169
    • Security169
    • MCP164
    • Learning146
    • Communication131
    • Prompts122
    • Extensions120
    • Commerce116
    • Voice107
    • DevOps92
    • Web73
    • Finance19
    1. Home
    2. Tools
    3. SubQ
    SubQ icon

    SubQ

    LLM Orchestration

    SubQ is a sub-quadratic LLM built for 12M-token reasoning, enabling agents to work across full repositories, long histories, and persistent state at one-fifth the cost of leading LLMs.

    Visit Website

    At a Glance

    Pricing
    Paid
    API: Custom/contact
    SubQ Code: Custom/contact

    Engagement

    Available On

    API

    Resources

    WebsiteDocsllms.txt

    Topics

    LLM OrchestrationAI InfrastructureAI Coding Assistants

    Alternatives

    InterfazeAlibaba Cloud Model StudioSynthetic
    Developer
    Subquadratic Inc.Miami, FLEst. 2026$29M raised

    Listed May 2026

    About SubQ

    SubQ is the first large language model built on a fully sub-quadratic sparse-attention architecture, designed specifically for long-context tasks at scale. Unlike transformer-based models that process every possible token relationship at O(n²) complexity, SubQ operates at O(n) by focusing only on the relationships that matter — reducing attention compute by nearly 1,000× at 12M tokens. It delivers 12M-token reasoning at 150 tokens per second, at one-fifth the cost of other leading LLMs, with no quality loss. SubQ is available as a developer API and as a coding agent integration layer.

    • 12M Token Context Window — Process entire codebases, months of pull request history, and long-running agent state in a single prompt.
    • Sub-Quadratic Architecture — Built on Sparse Structured Attention (SSA), SubQ scales linearly rather than quadratically, making long-context inference practical and affordable.
    • OpenAI-Compatible API — Drop-in API endpoints with streaming and tool use support, making integration straightforward for existing developer workflows.
    • SubQ Code Integration — A long-context layer for coding agents that plugs into Claude Code, Codex, and Cursor, delivering ~25% lower bills and 10× faster codebase exploration via a one-line install.
    • Auto-Redirect for Expensive Turns — SubQ Code automatically redirects token-heavy questions away from expensive frontier models, optimizing cost without changing agent behavior.
    • Benchmark-Validated Performance — SubQ 1M-Preview achieves 81.8% on SWE-Bench Verified and 95.0% on RULER @ 128K, with results third-party validated.
    • Enterprise API Access — Full-context API for enterprise teams to process full repositories and pipeline states in a single call at linear cost.
    • Research-Driven Team — Built by researchers from Meta, Google, Oxford, Cambridge, and BYU, pushing foundational change at the model architecture level.
    SubQ - 1

    Community Discussions

    Be the first to start a conversation about SubQ

    Share your experience with SubQ, ask questions, or help others learn from your insights.

    Pricing

    API

    Full-context API for developers and enterprise teams. Process full repositories and pipeline states in a single API call at linear cost.

    Custom
    contact sales
    • 12M token context window
    • Streaming + tool use
    • OpenAI-compatible endpoints

    SubQ Code

    Long-context layer for coding agents. Plug into Claude Code, Codex, and Cursor to map codebases and answer token-heavy questions faster.

    Custom
    contact sales
    • ~25% lower bill
    • 10× faster codebase exploration
    • Auto-redirects expensive model turns
    • One-line install
    View official pricing

    Capabilities

    Key Features

    • 12M token context window
    • Sub-quadratic sparse attention architecture
    • O(n) linear scaling vs O(n²) transformer
    • 150 tokens per second inference speed
    • 1/5 cost of leading LLMs
    • OpenAI-compatible API endpoints
    • Streaming and tool use support
    • SubQ Code for coding agent integration
    • Auto-redirect for expensive model turns
    • One-line install for coding agents
    • SWE-Bench Verified 81.8% score
    • RULER @ 128K 95.0% score
    • Third-party validated benchmarks

    Integrations

    Claude Code
    Codex
    Cursor
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate SubQ and help others make informed decisions.

    Developer

    Subquadratic Inc.

    Subquadratic builds frontier AI research and infrastructure, developing a new class of LLMs on a fully sub-quadratic architecture. The team, drawn from Meta, Google, Oxford, Cambridge, and BYU, pushes foundational change at the model architecture level rather than incremental transformer improvements. Their flagship model, SubQ, enables large-context and multi-modal inference that scales efficiently where transformers cannot.

    Founded 2026
    Miami, FL
    $29M raised
    15 employees

    Used by

    Enterprises in early access beta
    Read more about Subquadratic Inc.
    WebsiteLinkedInX / Twitter
    1 tool in directory

    Similar Tools

    Interfaze icon

    Interfaze

    An AI model built on a hybrid DNN/CNN + LLM architecture for deterministic developer tasks like OCR, web scraping, STT, translation, and classification with 98–99% structured output accuracy.

    Alibaba Cloud Model Studio icon

    Alibaba Cloud Model Studio

    Alibaba Cloud's platform for deploying and scaling Qwen, Wan, and other leading AI foundation models with enterprise-grade security.

    Synthetic icon

    Synthetic

    AI platform providing access to multiple LLMs with subscription or usage-based pricing, offering both UI and API access.

    Browse all tools

    Related Topics

    LLM Orchestration

    Platforms and frameworks for designing, managing, and deploying complex LLM workflows with visual interfaces, allowing for the coordination of multiple AI models and services.

    107 tools

    AI Infrastructure

    Infrastructure designed for deploying and running AI models.

    212 tools

    AI Coding Assistants

    AI tools that help write, edit, and understand code with intelligent suggestions.

    420 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Discussions