
Open-source local AI voice studio for cloning voices, generating speech, dictation, and giving MCP-aware agents custom voices.
Voicebox is a free, open-source, local-first AI voice studio from the jamiepine/voicebox GitHub project. It is described as an alternative to ElevenLabs and WisprFlow in one app, combining voice output and input workflows. Voicebox can clone voices from a few seconds of audio, generate speech in 23 languages across seven TTS engines, provide global-hotkey dictation into text fields, and let MCP-aware AI agents speak using voices the user owns. The project emphasizes local execution and privacy, with models, voice data, and captures running on the user's machine.
21%
Loading Community Opinions...
Generate setup files, upload your own, or launch from a kit. Chat in the browser first, then attach WhatsApp, Telegram, or Slack when it is useful.
Hosted agent
OpenClaw or Hermes