Xerophayze/TTS-Storyv2.0updated 13h ago
Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.
1 check-inNVIDIAAMDApple
Finrandojin/alexandria-audiobookv5.0updated 2d ago
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
@finrandojin5 check-insNVIDIAAMDApple
SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition-Pinokiov3.7updated 1mo ago
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
@sup3rmass1ve7 check-insNVIDIAAMDApple
6Morpheus6/Chatteredv3.7updated 1mo ago
All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)
@morpheus1 check-inNVIDIAAMDApple
6Morpheus6/alltalk-ttsv3.3updated 1mo ago
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
@morpheus5 check-insNVIDIAAMDApple
senigami/audiobook-studio.pinokiov3.7updated 2mo ago
Local-first AI audiobook production with voice cloning and chapter repair tools. This is the easiest way to install locally, including an optional demo voice library so you can start exploring right away. Live demo: senigami.github.io/audiobook-studio
@senigami8 check-insNVIDIAAMDApple
Xeronal81/Qwen3-TTS-Pinokiov5.0updated 3mo ago
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
12 check-insNVIDIAAMDApple
6Morpheus6/xtts.pinokiov3.7updated 4mo ago
clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)
@morpheus4 check-insNVIDIAAMDApple
TheAwaken1/LuxTTS-Studiov2.0updated 4mo ago
Gradio-based web interface for the LuxTTS voice cloning and text-to-speech model, enabling users to generate customized speech from text using uploaded or recorded audio references with adjustable parameters like speed, guidance scale, and inference steps.
@theawakenone2 check-insNVIDIAAMDApple
TheAwaken1/LiquidAI-LFM2.5-Playgroundv2.0updated 4mo ago
Local multimodal app powered by Liquid AI LFM2.5-Audio-1.5B and LFM2.5-VL-1.6B models, delivering real-time voice chat, text-to-speech synthesis, long-form audio transcription, and multi-image vision reasoning.
@theawakenone1 check-inNVIDIAAMDApple
6Morpheus6/barkv3.7updated 6mo ago
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
@morpheus2 check-insNVIDIAAMDApple