voice-clone - 技术专题深度解读

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts voice-cloning vits voice-clone voice-cloneai

Updated Jun 16, 2026
Python

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speech tts voice-clone zero-shot-tts

Updated Apr 19, 2025
Python

jamiepine / voicebox

The open-source AI voice studio. Clone, dictate, create.

ai cuda whisper mlx voice-ai voice-clone qwen3-tts qwen3-tts-ui

Updated Apr 26, 2026
TypeScript

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

text-to-speech tts cross-lingual voice-clone zero-shot-tts bigvgan indextts

Updated Jun 16, 2026
Python

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

text-to-speech tts gpt transformer-architecture emotional-speech voice-clone vall-e

Updated Feb 11, 2024
Python

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated Jul 30, 2024
Python

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

multilingual realtime tts english chinese streaming-audio multi-modality voice-clone audio-tokenizer

Updated Jun 2, 2026
Python

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Jun 14, 2026
Python

High-Logic / Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

text-to-speech tts voice-cloning vits voice-clone gpt-sovits

Updated Apr 18, 2026
Python

wladradchenko / wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

free face-swap photo-editing video-editing lip-sync public-api remover restyle video-generation deepfake face-animation diffusion-models remove-background voice-clone controlnet segment-anything txt2video wunjo img2video

Updated Feb 3, 2026
JavaScript

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

text-to-speech speech tts speech-synthesis gan language-model diffusion voice-clone flow-matching

Updated Sep 28, 2025
Python

travisvn / chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)

python docker text-to-speech ai cuda speech self-hosted tts openai gpt chatterbox voice-cloning openai-api voice-clone chatgpt elevenlabs local-llm ollama open-webui

Updated Dec 23, 2025
Python

BoltzmannEntropy / MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support

python osx mcp voice tts rvc audiobooks flutter-apps audio-book-converter flutter-examples flutter-ui flutter-app voice-cloning apple-silicon voice-clone qwen xttsv2 qwen3 qwen3-tts

Updated Apr 1, 2026
Dart

lukaszliniewicz / Pandrator

Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Updated Jun 17, 2026
Python

HKoon / ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

tts openvoice voice-cloning voice-clone chattts

Updated Jun 13, 2026
Python

Saganaki22 / ComfyUI-OmniVoice-TTS

Sponsor

OmniVoice TTS nodes for ComfyUI - Zero-shot multilingual text-to-speech with voice cloning, voice design, and multi-speaker dialogue

text-to-speech tts voice-cloning custom-nodes voice-clone comfyui

Updated Jun 11, 2026
Python

LSimon95 / megatts2

Unoffical implementation of Megatts2

tts voice-clone

Updated Mar 23, 2024
Python

netease-youdao / Confucius4-TTS

Confucius4-TTS: a Multilingual and Cross-Lingual Zero-Shot TTS Engine

audio python text-to-speech deep-learning multi-lingual pytorch tts speech-synthesis cross-lingual fine-tuning voice-clone zero-shot-tts

Updated Jun 17, 2026
Python

kk43994 / kkclaw

🦞 一个可爱的桌面龙虾AI助手 - Desktop lobster pet with OpenClaw AI, Edge TTS voice, and emotion animations

electron tts lobster desktop-pet ai-assistant voice-clone openclaw

Updated Apr 14, 2026
JavaScript

Saganaki22 / ComfyUI-VoxCPM2