Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,051+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1104
    • Coding995
    • Infrastructure429
    • Marketing408
    • Design354
    • Projects323
    • Analytics311
    • Research297
    • Testing194
    • Data166
    • Integration164
    • Security162
    • MCP152
    • Learning143
    • Communication126
    • Extensions118
    • Commerce112
    • Prompts109
    • Voice105
    • DevOps89
    • Web73
    • Finance19
    1. Home
    2. Tools
    3. OpenMythos
    OpenMythos icon

    OpenMythos

    AI Development Libraries

    An open-source, theoretical PyTorch implementation of a Recurrent-Depth Transformer (RDT) inspired by the suspected Claude Mythos architecture, featuring MoE, MLA/GQA attention, and LTI-stable looped inference.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under the MIT License. Free to use, modify, and distribute.

    Engagement

    Available On

    Linux
    API
    SDK
    CLI

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    AI Development LibrariesLocal InferenceLLM Orchestration

    Alternatives

    BitNetQwen3Vercel AI SDK
    Developer
    Kye GomezKye Gomez builds open-source AI research tools and framework…

    Listed Apr 2026

    About OpenMythos

    OpenMythos is an open-source, community-driven theoretical reconstruction of the Claude Mythos model architecture, built from publicly available research. It implements a Recurrent-Depth Transformer (RDT) with three stages: a Prelude of standard transformer blocks, a looped Recurrent Block run up to max_loop_iters times, and a final Coda. The project is not affiliated with or endorsed by Anthropic and is intended purely for research and exploration of compute-adaptive, depth-variable reasoning.

    • Recurrent-Depth Transformer (RDT) — implements a Prelude → Looped Recurrent Block → Coda architecture where the same weights are reused across loop iterations for implicit multi-hop reasoning in continuous latent space
    • Switchable Attention — supports both Multi-head Latent Attention (MLA) and Grouped Query Attention (GQA), configurable via attn_type in MythosConfig
    • Sparse Mixture of Experts (MoE) — feed-forward layers use fine-grained routed experts plus always-on shared experts, enabling broad domain coverage with low per-token activation cost
    • LTI-Stable Injection — injection parameters are constrained so the spectral radius ρ(A) < 1 by construction, preventing residual explosion and enabling stable training at high learning rates
    • Pre-configured Model Variants — factory functions (mythos_1b through mythos_1t) provide ready-to-use MythosConfig objects spanning 1B to 1T parameters
    • Training Script Included — a 3B model training script on FineWeb-Edu is provided, supporting single-GPU and multi-GPU (DDP via torchrun) setups with bfloat16/float16 precision
    • Adaptive Computation Time (ACT) — architecture supports variable loop depth per input, allowing harder inputs to receive more compute while simpler ones halt early
    • LoRA Depth Adaptation — optional depth-wise LoRA modules allow each loop iteration to adapt behavior slightly while preserving the compactness of weight sharing
    • Install via pip — get started with pip install open-mythos, then import OpenMythos and MythosConfig from open_mythos.main to instantiate and run the model
    OpenMythos - 1

    Community Discussions

    Be the first to start a conversation about OpenMythos

    Share your experience with OpenMythos, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source (MIT)

    Fully free and open-source under the MIT License. Free to use, modify, and distribute.

    • Full source code access
    • All model variants (1B–1T)
    • Training scripts included
    • MIT License — commercial use allowed

    Capabilities

    Key Features

    • Recurrent-Depth Transformer (RDT) architecture
    • Switchable MLA and GQA attention
    • Sparse Mixture of Experts (MoE) with routed and shared experts
    • LTI-stable injection parameters (spectral radius < 1)
    • Pre-configured model variants from 1B to 1T parameters
    • Training script for 3B model on FineWeb-Edu
    • Single-GPU and multi-GPU (DDP) training support
    • Adaptive Computation Time (ACT) halting mechanism
    • Depth-wise LoRA adaptation per loop iteration
    • bfloat16/float16 mixed precision training
    • Continuous Depth-wise Batching for variable inference compute
    • Loop-index positional embedding support

    Integrations

    PyTorch
    HuggingFace Datasets (FineWeb-Edu)
    torchrun (PyTorch DDP)
    openai/gpt-oss-20b tokenizer
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate OpenMythos and help others make informed decisions.

    Developer

    Kye Gomez

    Kye Gomez builds open-source AI research tools and frameworks, with a focus on large language model architectures and multi-agent systems. He maintains several high-profile repositories exploring cutting-edge model designs, including theoretical reconstructions of frontier AI architectures. His work emphasizes reproducibility, community collaboration, and making advanced AI research accessible via pip-installable Python packages.

    Read more about Kye Gomez
    WebsiteGitHubX / Twitter
    1 tool in directory

    Similar Tools

    BitNet icon

    BitNet

    Microsoft's official implementation of BitNet, enabling efficient 1-bit large language model inference on CPUs without requiring GPUs.

    Qwen3 icon

    Qwen3

    Qwen3 is a family of open-weight large language models by Alibaba Cloud's Qwen team, featuring both dense and Mixture-of-Experts architectures with seamless thinking and non-thinking modes.

    Vercel AI SDK icon

    Vercel AI SDK

    A free, open-source TypeScript toolkit from the creators of Next.js for building AI-powered applications with a unified provider API and streaming support.

    Browse all tools

    Related Topics

    AI Development Libraries

    Programming libraries and frameworks that provide machine learning capabilities, model integration, and AI functionality for developers.

    140 tools

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    82 tools

    LLM Orchestration

    Platforms and frameworks for designing, managing, and deploying complex LLM workflows with visual interfaces, allowing for the coordination of multiple AI models and services.

    91 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    Discussions