Sonna Documentation
Sonna is the open-source, local-first AI voice studio — a free alternative to ElevenLabs and WisprFlow, running entirely on your machine.
Sonna is the open-source, local-first AI voice studio — a free alternative to ElevenLabs and WisprFlow in one app. Clone voices, generate speech across 7 TTS engines, dictate into any app with a global hotkey, compose multi-voice projects, and let any MCP-aware agent speak in a voice you own. Everything runs on your hardware.

- Dictation — hold a chord anywhere on your machine, speak, release; the transcript pastes into the focused field
- Captures tab — paired audio + transcript archive, retranscribe / refine / play-as-voice
- Voice personalities — per-profile compose button + persona-rewrite toggle, powered by a local LLM
- Agents speak back — any MCP-aware agent can call Sonna to speak in one of your cloned voices
- 7 TTS engines — Qwen3-TTS, Qwen CustomVoice, LuxTTS, Chatterbox Multilingual, Chatterbox Turbo, HumeAI TADA, Kokoro
- Cloning and preset voices — zero-shot cloning or 50+ curated preset voices
- 23 languages — from English to Arabic, Japanese, Hindi, Swahili
- Post-processing effects — pitch shift, reverb, delay, chorus, compression, filters
- Expressive speech — paralinguistic tags (
[laugh],[sigh]) and natural-language delivery control - Unlimited length — auto-chunking with crossfade for long scripts
- Stories editor — multi-track timeline for conversations, podcasts, narratives
- API-first — REST + WebSocket API, MCP server for agent integrations
- Complete privacy — models, audio, transcripts, LLM output never leave your machine
- Runs everywhere — macOS (MLX/Metal), Windows (CUDA / DirectML), Linux (ROCm / CPU), Intel Arc, Docker
Download
| Platform | Download |
|---|---|
| macOS (Apple Silicon) | Download DMG |
| macOS (Intel) | Download DMG |
| Windows | Download MSI |
| Docker | docker compose up |
Get Started
- Installation — download and install Sonna
- Quick Start — get up and running in 5 minutes
- Dictation — start talking to your computer
- Voice Personalities — compose and rewrite in any profile
- API Reference — integrate voice synthesis into your apps