MulmoChat

Name: MulmoChat
Availability: OnlineOnly
Author: receptron

A research prototype exploring multimodal AI chat experiences where users converse naturally while experiencing rich visual and interactive content on a canvas.

Visit Website

At a Glance

Pricing

Open Source

Free and open-source research prototype available on GitHub.

Engagement

Available On

Web

API

CLI

receptronreceptron builds open-source AI tooling and research prototy…

Listed Mar 2026

About MulmoChat

MulmoChat is a research prototype that reimagines AI chat interfaces by combining natural language conversation with rich visual and interactive content rendered directly on a canvas. Unlike traditional text-based chat, MulmoChat enables images to materialize, maps to become explorable, and games to turn playable—all within the conversational flow. Built with TypeScript and Vue, it supports multiple LLM providers and local image generation via ComfyUI integration.

Multimodal Canvas Interface — Users engage in voice or text chat while visual content (images, maps, interactive elements) renders simultaneously on a canvas alongside the conversation.
Voice Chat Support — Click "Start Voice Chat" to interact with the AI using your microphone; the system processes speech and responds with both text and visual output.
Provider-Agnostic Text API — A unified /api/text/generate endpoint supports OpenAI, Anthropic, Google Gemini, and Ollama, normalizing responses across vendors.
ComfyUI Local Image Generation — Integrates with ComfyUI Desktop for local image generation using FLUX and Stable Diffusion models, with automatic model-specific parameter optimization.
Plugin Architecture — Developers can extend MulmoChat with new capabilities by implementing TypeScript contracts, Vue views, and configuration as described in TOOLPLUGIN.md.
Multi-Provider LLM Support — Configure API keys for OpenAI, Anthropic, Gemini, and Ollama; each provider can be enabled independently via environment variables.
AI-Powered Search — Optional Exa integration enables AI-powered web search within the conversational interface.
Google Maps Integration — Optional Google Maps API key enables explorable map experiences directly within the chat canvas.
Getting Started — Run yarn install, create a .env file with your API keys, then yarn dev to launch the development server and open the browser.

Community Discussions

Be the first to start a conversation about MulmoChat

Share your experience with MulmoChat, ask questions, or help others learn from your insights.

Pricing

OPEN SOURCE

Open Source

Free and open-source research prototype available on GitHub.

Full source code access
Voice chat interface
Multimodal canvas
Multi-provider LLM support
ComfyUI integration

Capabilities

Key Features

Multimodal canvas chat interface
Voice chat support
Provider-agnostic text generation API
ComfyUI local image generation
Plugin/tool extension architecture
OpenAI, Anthropic, Gemini, and Ollama support
AI-powered search via Exa
Google Maps integration
FLUX and Stable Diffusion model support
Unified text API with normalized responses

Integrations

OpenAI

Anthropic

Google Gemini

Ollama

ComfyUI

Exa

Google Maps

API Available

View Docs

Back to all tools Suggest an edit

About MulmoChat

Multimodal Canvas Interface — Users engage in voice or text chat while visual content (images, maps, interactive elements) renders simultaneously on a canvas alongside the conversation.
Voice Chat Support — Click "Start Voice Chat" to interact with the AI using your microphone; the system processes speech and responds with both text and visual output.
Provider-Agnostic Text API — A unified /api/text/generate endpoint supports OpenAI, Anthropic, Google Gemini, and Ollama, normalizing responses across vendors.
ComfyUI Local Image Generation — Integrates with ComfyUI Desktop for local image generation using FLUX and Stable Diffusion models, with automatic model-specific parameter optimization.
Plugin Architecture — Developers can extend MulmoChat with new capabilities by implementing TypeScript contracts, Vue views, and configuration as described in TOOLPLUGIN.md.
Multi-Provider LLM Support — Configure API keys for OpenAI, Anthropic, Gemini, and Ollama; each provider can be enabled independently via environment variables.
AI-Powered Search — Optional Exa integration enables AI-powered web search within the conversational interface.
Google Maps Integration — Optional Google Maps API key enables explorable map experiences directly within the chat canvas.
Getting Started — Run yarn install, create a .env file with your API keys, then yarn dev to launch the development server and open the browser.

MulmoChat