Sonna

Sonna Documentation

Sonna is the open-source, local-first AI voice studio — a free alternative to ElevenLabs and WisprFlow, running entirely on your machine.

Sonna is the open-source, local-first AI voice studio — a free alternative to ElevenLabs and WisprFlow in one app. Clone voices, generate speech across 7 TTS engines, dictate into any app with a global hotkey, compose multi-voice projects, and let any MCP-aware agent speak in a voice you own. Everything runs on your hardware.

Sonna App Screenshot

  • Dictation — hold a chord anywhere on your machine, speak, release; the transcript pastes into the focused field
  • Captures tab — paired audio + transcript archive, retranscribe / refine / play-as-voice
  • Voice personalities — per-profile compose button + persona-rewrite toggle, powered by a local LLM
  • Agents speak back — any MCP-aware agent can call Sonna to speak in one of your cloned voices
  • 7 TTS engines — Qwen3-TTS, Qwen CustomVoice, LuxTTS, Chatterbox Multilingual, Chatterbox Turbo, HumeAI TADA, Kokoro
  • Cloning and preset voices — zero-shot cloning or 50+ curated preset voices
  • 23 languages — from English to Arabic, Japanese, Hindi, Swahili
  • Post-processing effects — pitch shift, reverb, delay, chorus, compression, filters
  • Expressive speech — paralinguistic tags ([laugh], [sigh]) and natural-language delivery control
  • Unlimited length — auto-chunking with crossfade for long scripts
  • Stories editor — multi-track timeline for conversations, podcasts, narratives
  • API-first — REST + WebSocket API, MCP server for agent integrations
  • Complete privacy — models, audio, transcripts, LLM output never leave your machine
  • Runs everywhere — macOS (MLX/Metal), Windows (CUDA / DirectML), Linux (ROCm / CPU), Intel Arc, Docker

Download

PlatformDownload
macOS (Apple Silicon)Download DMG
macOS (Intel)Download DMG
WindowsDownload MSI
Dockerdocker compose up

View all releases

Get Started

On this page