Open Source Voice Cloning
Cloning a voice is now a few minutes of audio away, which turns the whole category into a question of control - whose voice, with what consent, and whether the sample ever touches a server you don't run. The open source synthesis and cloning tools here generate speech locally on your own hardware, so the voice you train on and the audio you produce stay on your machine, keeping the ethical line and the data both firmly in your hands rather than a cloud vendor's.
GPT-SoVITS-WebUI
Few-shot voice cloning and text-to-speech WebUI that can fine-tune from 1 minute of voice data

OpenVoice
Instant voice cloning model with tone color cloning, style control, and cross-lingual speech generation

RVC WebUI
VITS-based voice conversion web UI for training and running voice models from short audio samples

Chatterbox
Open-source text-to-speech models with multilingual voice cloning and built-in watermarking

F5-TTS
Text-to-speech system for fluent, faithful speech generation with flow matching

Coqui TTS
Advanced text-to-speech library with pretrained models in 1100+ languages and voice cloning tools