Clone any voice with AI โ Create personalized voice synthesis from just 10-30 seconds of audio. Powered by Coqui XTTS.
๐ค HuggingFace ๐ Try Live Demo
Step 1: Upload 10s of audio โ "This is my voice"
Step 2: AI clones the voice
Step 3: Generate anything โ "Hello, I am cloned!"
Voice AI is exploding โ from content creation to accessibility, voice cloning is revolutionizing communication.
| Feature | Description |
|---|---|
| ๐ฏ Voice Cloning | Clone from 10-30 seconds of audio |
| ๐ Multi-language | English, Arabic, Spanish, French, German |
| โก Fast | Generate speech in seconds |
| ๐ High Quality | 44.1kHz output |
| ๐๏ธ Gradio UI | Beautiful web interface |
| ๐ณ Docker | One-command deployment |
git clone https://github.com/walidsobhie-code/ai-voice-clone.git
cd ai-voice-clone
pip install -r requirements.txt# Upload 10-30 seconds of clear audio
python clone_voice.py --input my_voice.wav --output my_cloned_voice
# Output:
# ๐ค Loading audio: my_voice.wav
# ๐ Cloning voice...
# โ
Voice cloned successfully!python synthesize.py --model my_cloned_voice.wav \
--text "Hello everyone, this is my cloned voice!" \
--output hello.wavpython gradio_app.py
# Opens: http://localhost:7860โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ ๐๏ธ AI Voice Clone โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ โ
โ [๐ค Clone Voice] [๐ค Generate Speech] โ
โ โ
โ โโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโ โ
โ โ Upload Audio: โ โ Text Input: โ โ
โ โ [my_voice.wav] โ โ Hello world! โ โ
โ โ Duration: 15s โ โ Speed: [1.0x] โ โ
โ โโโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโโ โ
โ โ
โ [๐ Clone Voice] [๐ค Generate] โ
โ โ
โ Status: โ
โ โ
Voice cloned successfully! โ
โ ๐ Model: my_cloned_voice โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
from clone_voice import clone_voice
from synthesize import synthesize
# Step 1: Clone voice
result = clone_voice(
input_file="my_voice.wav",
output_name="my_model"
)
print(result)
# {'status': 'success', 'output': 'my_model.wav'}
# Step 2: Generate speech
result = synthesize(
model_path="my_model.wav",
text="Hello, I sound exactly like the original!",
output="output.wav"
)
print(result)
# {'status': 'success', 'output': 'output.wav'}| Industry | Use Case |
|---|---|
| ๐ฌ Content | Create videos with any voice |
| ๐ง Podcast | Clone voices for narration |
| โฟ Accessibility | Read text in any voice |
| ๐ข Brand | Consistent brand voice across content |
| ๐ Education | Localize content in local voices |
| ๐ฎ Gaming | Character voices for games |
Original Audio (10-30s)
โ
XTTS Encoder (extracts voiceprint)
โ
Latent Space Representation
โ
XTTS Decoder (generates new audio)
โ
Your Cloned Voice Saying Anything!
# Build
docker build -t voice-clone .
# Run
docker run -p 7860:7860 voice-cloneai-voice-clone/
โโโ clone_voice.py # Voice cloning
โโโ synthesize.py # Speech synthesis
โโโ gradio_app.py # Web UI
โโโ requirements.txt
โโโ Dockerfile
โโโ examples/
โโโ basic_clone.py
โโโ multi_language.py
This tool should only be used ethically:
- โ With consent of the voice owner
- โ For legitimate purposes (accessibility, entertainment)
- โ NOT for fraud, impersonation, or deception
See CONTRIBUTING.md
If this helped you, please star the repo!
Built with โค๏ธ by walidsobhie-code
- [Planned] Web version / hosted demo
- [Planned] API endpoint for production use
- [Planned] Support for more languages
- [In Progress] Performance optimizations
- [Done] Gradio web interface
- [Done] Docker deployment
Have a project using this? Send a PR to add your company!
- (coming soon โ be the first to list your project!)
We welcome contributions! Please see CONTRIBUTING.md for guidelines.