EveryDev.ai
Sign inSubscribe
Home
Tools

2,685+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. Dia
    Dia icon

    Dia

    Voice Synthesis

    Dia is an open-source text-to-speech model by Nari Labs that generates realistic dialogue audio with multiple speakers, emotions, and non-verbal sounds from transcripts.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully open-source model available for free download and local use.

    Engagement

    Available On

    Windows
    macOS
    Linux
    Web
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Voice SynthesisLocal InferenceAudio

    Alternatives

    Miso TTS 8BVibeVoicePapla Media
    Developer
    Nari LabsNari Labs builds open-source AI models focused on speech and…

    Listed Mar 2026

    About Dia

    Dia is an open-source 1.6B parameter text-to-speech model developed by Nari Labs, designed to generate highly realistic dialogue directly from transcripts. It supports multi-speaker audio generation, non-verbal cues like laughter and coughing, and fine-grained emotion and tone control. Dia can also perform voice cloning using an audio reference, making it a powerful tool for content creators, researchers, and developers building conversational AI applications.

    • Multi-speaker dialogue generation: Generate realistic conversations between multiple speakers directly from a text transcript using speaker tags.
    • Non-verbal audio support: Include sounds like laughter, coughing, and sighs in generated audio by adding special tokens in the transcript.
    • Emotion and tone control: Guide the emotional delivery of speech through natural language descriptions embedded in the transcript.
    • Voice cloning: Provide an audio reference clip to clone a specific voice and use it in generated dialogue.
    • Open-source model weights: Download and run the 1.6B parameter model locally via Hugging Face or the GitHub repository.
    • Gradio demo: Try Dia instantly through the hosted Hugging Face Spaces demo without any local setup.
    • Python API: Integrate Dia into your own applications using the provided Python package and inference scripts.
    • Local inference: Run the model on your own hardware for full control over privacy and customization.
    Dia - 1

    Community Discussions

    Be the first to start a conversation about Dia

    Share your experience with Dia, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully open-source model available for free download and local use.

    • 1.6B parameter TTS model
    • Multi-speaker dialogue generation
    • Voice cloning
    • Non-verbal audio cues
    • Emotion control

    Capabilities

    Key Features

    • Multi-speaker dialogue generation
    • Non-verbal audio cues (laughter, coughing, sighs)
    • Emotion and tone control via transcript
    • Voice cloning from audio reference
    • 1.6B parameter open-source model
    • Hugging Face Spaces demo
    • Python API
    • Local inference support

    Integrations

    Hugging Face
    Gradio
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Dia and help others make informed decisions.

    Developer

    Nari Labs

    Nari Labs builds open-source AI models focused on speech and audio generation. The team develops cutting-edge text-to-speech technology, including Dia, a 1.6B parameter model capable of generating realistic multi-speaker dialogue with emotional nuance and non-verbal sounds. Nari Labs releases model weights publicly to empower researchers and developers worldwide.

    Read more about Nari Labs
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Miso TTS 8B icon

    Miso TTS 8B

    An 8-billion parameter open-source text-to-speech model designed for high-quality, highly emotive conversational speech generation with voice cloning support.

    VibeVoice icon

    VibeVoice

    An open-source family of frontier voice AI models from Microsoft, including long-form TTS, multi-speaker speech synthesis, real-time streaming TTS, and long-form ASR with speaker diarization.

    Papla Media icon

    Papla Media

    AI voice generator that converts text to natural-sounding speech with voice cloning capabilities from just 10 seconds of audio.

    Browse all tools

    Related Topics

    Voice Synthesis

    AI tools that generate human-like speech from text.

    30 tools

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    121 tools

    Audio

    AI tools that generate or edit audio — music, sound effects, voice and speech, and podcast production.

    23 tools
    Browse all topics
    Back to all tools
    22views
    Discussions