Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    2,275+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1228
    • Coding1045
    • Infrastructure455
    • Marketing414
    • Design374
    • Projects340
    • Analytics319
    • Research306
    • Testing200
    • Data171
    • Integration169
    • Security169
    • MCP164
    • Learning146
    • Communication131
    • Prompts122
    • Extensions120
    • Commerce116
    • Voice107
    • DevOps92
    • Web73
    • Finance19
    1. Home
    2. Tools
    3. Sutando
    Sutando icon

    Sutando

    Voice Assistant
    Featured

    An open-source, self-hosted AI agent for macOS that uses voice, vision, and autonomous action to control your computer, join meetings, make phone calls, and build itself.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under the MIT License. Requires a Claude Code subscription and a Gemini API key (free tier available).

    Engagement

    Available On

    macOS
    Web
    CLI
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Voice AssistantAutonomous SystemsAgent Frameworks

    Alternatives

    VocodePipecatOpenAI Agents SDK
    Developer
    sonichiSeattle, WAEst. 2024

    Listed May 2026

    About Sutando

    Sutando is an open-source, self-hosted AI personal agent for macOS that combines voice control, screen vision, and autonomous task execution into a single local system. It runs on your existing Claude Code subscription and a free Gemini API key, with no remote control plane or third-party write access. Sutando can see your screen, join your meetings, make phone calls, send messages, and autonomously improve its own capabilities when idle — all from your Mac.

    • Voice control — Connect via browser or phone; say commands like "what's on my screen?" or "fix the typo in that file" and Sutando acts immediately using Gemini Live real-time voice.
    • Screen vision — Sutando captures and analyzes your screen on demand, enabling context-aware assistance without manual copy-paste.
    • Meeting participation — Say "join my 2pm call" and Sutando reads your calendar, joins Zoom or Google Meet with computer audio, researches questions live, and writes a summary when done.
    • Phone calls — Sutando can make and receive calls via Twilio, have conversations on your behalf, and report back while you keep working.
    • Multi-channel messaging — Reach the same agent via voice, Telegram, Discord, web, phone, or email — all sharing the same memory and task queue.
    • Autonomous build loop — When idle, Sutando monitors its own health, detects usage patterns, discovers new skills, and writes missing capabilities — most of its own code was built this way.
    • Notes and memory — Capture ideas by voice; Sutando tags, saves, and searches them as YAML-frontmatter markdown notes and acts on actionable items automatically.
    • Multi-machine scaling — Plug in a second Mac and Sutando migrates services autonomously via Discord, coordinating the handoff between agents without migration scripts.
    • 3-tier access control — Owner, verified, and unverified callers get different capability bands on phone, Discord, and Telegram, with STIR/SHAKEN caller ID verification for inbound calls.
    • Quick start — Clone the repo, add your GEMINI_API_KEY to .env, and run bash src/startup.sh; a menu bar app, dashboard, and voice interface launch automatically.
    Sutando - 1

    Community Discussions

    Be the first to start a conversation about Sutando

    Share your experience with Sutando, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source (MIT)

    Fully free and open-source under the MIT License. Requires a Claude Code subscription and a Gemini API key (free tier available).

    • Voice control via browser
    • Screen capture and vision
    • Autonomous task execution
    • Notes and memory
    • Multi-channel messaging (Telegram, Discord)

    Capabilities

    Key Features

    • Voice control via browser or phone
    • Screen capture and vision analysis
    • Autonomous meeting joining (Zoom, Google Meet)
    • Outbound and inbound phone calls via Twilio
    • Multi-channel messaging (Telegram, Discord, email, web)
    • Autonomous build loop and self-improvement
    • Voice-driven note capture and search
    • Multi-machine agent scaling and migration
    • 3-tier access control (owner/verified/unverified)
    • Global keyboard shortcuts via macOS menu bar app
    • Proactive health monitoring and auto-repair
    • Pattern detection and user modeling
    • Gmail read/send/search
    • Calendar and reminders integration
    • Browser automation via MCP tools
    • Cross-node memory and notes sync
    • Info-radar (arXiv, GitHub, HN, news monitoring)
    • System dashboard at localhost:7844

    Integrations

    Claude Code
    Gemini Live API
    Twilio
    ngrok
    Telegram
    Discord
    Gmail (Google Workspace OAuth)
    Google Calendar
    Zoom
    Google Meet
    macOS Contacts
    macOS Reminders
    Claude for Chrome (browser extension)
    WhatsApp (wacli)
    API Available
    View Docs

    Demo Video

    Sutando Demo Video
    Watch on YouTube

    Reviews & Ratings

    No ratings yet

    Be the first to rate Sutando and help others make informed decisions.

    Developer

    sonichi

    sonichi builds Sutando, an open-source self-hosted AI personal agent for macOS that combines voice, vision, and autonomous action. The project runs on Claude Code and Gemini Live, enabling hands-free computer control, meeting participation, and phone calls. Sutando's autonomous build loop writes most of its own code, and the project is developed openly on GitHub under the MIT license.

    Founded 2024
    Seattle, WA
    5 employees

    Used by

    Community of 150+ GitHub stargazers
    AI researchers at Berkeley RDI and…
    Read more about sonichi
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Vocode icon

    Vocode

    Open source voice AI framework for building, deploying, and scaling hyperrealistic voice agents.

    Pipecat icon

    Pipecat

    An open-source Python framework for building real-time voice and multimodal conversational AI agents with composable pipelines and ultra-low latency.

    OpenAI Agents SDK icon

    OpenAI Agents SDK

    OpenAI's lightweight, provider-agnostic framework for building multi-agent and voice agent workflows in Python and TypeScript with very few abstractions.

    Browse all tools

    Related Topics

    Voice Assistant

    AI voice assistants that perform tasks through voice commands.

    36 tools

    Autonomous Systems

    AI agents that can perform complex tasks with minimal human guidance.

    191 tools

    Agent Frameworks

    Tools and platforms for building and deploying custom AI agents.

    276 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    3views
    Discussions