Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    1,819+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    Categories
    • Agents891
    • Coding869
    • Infrastructure377
    • Marketing357
    • Design302
    • Research276
    • Projects271
    • Analytics266
    • Testing160
    • Integration157
    • Data150
    • Security131
    • MCP125
    • Learning124
    • Extensions108
    • Communication107
    • Prompts100
    • Voice90
    • Commerce89
    • DevOps70
    • Web66
    • Finance17
    1. Home
    2. Tools
    3. Qwen3
    Qwen3 icon

    Qwen3

    Local Inference

    Qwen3 is a family of open-weight large language models by Alibaba Cloud's Qwen team, featuring both dense and Mixture-of-Experts architectures with seamless thinking and non-thinking modes.

    Visit Website

    At a Glance

    Pricing
    Open Source

    All Qwen3 open-weight models are free to download and use under the Apache 2.0 license.

    Engagement

    Available On

    macOS
    Linux
    Android
    iOS
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Local InferenceAI Development LibrariesLLM Orchestration

    Alternatives

    BitNetArcee AIIBM Granite Playground
    Developer
    Alibaba GroupNo. 969 West Wen Yi Road, HangzhouEst. 1999$105M+ raised

    Listed Apr 2026

    About Qwen3

    Qwen3 is a series of open-weight large language models developed by the Qwen team at Alibaba Cloud, available in dense and Mixture-of-Experts (MoE) variants ranging from 0.6B to 235B parameters. The models support seamless switching between a thinking mode (for complex reasoning, math, and coding) and a non-thinking mode (for efficient general-purpose chat). Qwen3 supports 100+ languages and dialects and achieves state-of-the-art performance among open-weight models on reasoning, coding, and agent benchmarks. The latest Qwen3-2507 update extends long-context understanding to 256K tokens, extendable to 1 million tokens.

    • Dense and MoE model sizes: Available in 0.6B, 1.7B, 4B, 8B, 14B, 32B (dense) and 30B-A3B, 235B-A22B (MoE) to fit a wide range of hardware budgets.
    • Thinking and non-thinking modes: Switch between deep reasoning mode and fast chat mode using enable_thinking flags or /think//no_think instructions in the prompt.
    • Long-context support: Handles up to 256K tokens natively, extendable to 1 million tokens with updated Qwen3-2507 model variants.
    • Multilingual capability: Supports 100+ languages and dialects with strong multilingual instruction following and translation.
    • Agent and tool use: Integrates with Qwen-Agent for tool use and MCP support, enabling precise function calling in both thinking and non-thinking modes.
    • Broad inference framework support: Run with Transformers, vLLM, SGLang, TensorRT-LLM, llama.cpp, Ollama, LM Studio, MLX LM, OpenVINO, ExecuTorch, and MNN for flexible local and cloud deployment.
    • Finetuning support: Compatible with Axolotl, UnSloth, Swift, and LLaMA-Factory for SFT, DPO, and GRPO training workflows.
    • Quantization: Supports GPTQ, AWQ, and GGUF quantization for efficient deployment on consumer hardware.
    • Apache 2.0 license: All open-weight models are freely available for commercial and research use.
    Qwen3 - 1

    Community Discussions

    Be the first to start a conversation about Qwen3

    Share your experience with Qwen3, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    All Qwen3 open-weight models are free to download and use under the Apache 2.0 license.

    • All model sizes (0.6B to 235B)
    • Dense and MoE architectures
    • Thinking and non-thinking modes
    • Apache 2.0 license
    • Commercial use allowed

    Capabilities

    Key Features

    • Dense and MoE model architectures
    • Thinking and non-thinking mode switching
    • 256K token long-context support (extendable to 1M)
    • 100+ language and dialect support
    • Agent and tool use with MCP support
    • Supports vLLM, SGLang, TensorRT-LLM, llama.cpp, Ollama, LM Studio
    • GPTQ, AWQ, and GGUF quantization
    • Finetuning with Axolotl, UnSloth, Swift, LLaMA-Factory
    • OpenAI-compatible API server
    • Apache 2.0 open-weight license

    Integrations

    Hugging Face Transformers
    vLLM
    SGLang
    TensorRT-LLM
    llama.cpp
    Ollama
    LM Studio
    MLX LM
    OpenVINO
    ExecuTorch
    MNN
    ModelScope
    Qwen-Agent
    Axolotl
    LLaMA-Factory
    UnSloth
    Swift
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Qwen3 and help others make informed decisions.

    Developer

    Alibaba Group

    Alibaba Group is a global technology company developing open models and infrastructure, including the Qwen series for AI applications.

    Founded 1999
    No. 969 West Wen Yi Road, China
    $105M+ raised
    124,320 employees

    Used by

    SAP
    Bosch
    Ford
    IHG (InterContinental Hotels Group)
    +36 more
    Read more about Alibaba Group
    WebsiteGitHub
    4 tools in directory

    Similar Tools

    BitNet icon

    BitNet

    Microsoft's official implementation of BitNet, enabling efficient 1-bit large language model inference on CPUs without requiring GPUs.

    Arcee AI icon

    Arcee AI

    US-based open intelligence lab building open-weight foundation models that run anywhere - on edge, on-prem, or cloud.

    IBM Granite Playground icon

    IBM Granite Playground

    Interactive playground for testing and experimenting with IBM's Granite family of open-source AI foundation models.

    Browse all tools

    Related Topics

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    60 tools

    AI Development Libraries

    Programming libraries and frameworks that provide machine learning capabilities, model integration, and AI functionality for developers.

    125 tools

    LLM Orchestration

    Platforms and frameworks for designing, managing, and deploying complex LLM workflows with visual interfaces, allowing for the coordination of multiple AI models and services.

    72 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026