相关标签
text-to-speechttsvoice-cloningvitsvoice-clonevoice-cloneaizero-shot-ttsaicudawhisper

Here are 85 public repositories matching this topic...

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

  • Updated Jun 2, 2026
  • Python

Turn PDFs and EPUBs into audiobooks; subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

  • Updated Jun 17, 2026
  • Python