Store

GPUNVIDIAAMDAppleBe first to check in on AMD

TTS-Story

https://github.com/Xerophayze/TTS-Storyv2.0updated 2d ago

Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.

#tts

Check-ins

1 check-in

GPUNVIDIAAMDAppleBe first to check in on AMD/Apple

Alexandria

https://github.com/Finrandojin/alexandria-audiobookv5.0updated 2d ago

A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects

#text-to-audiobook

Owner

@finrandojin

Check-ins

4 check-ins

Platforms

GPUNVIDIAAMDApple

Wan2GP

https://github.com/pinokiofactory/wanv3.7updated 3d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video #wan #image #video-generation #ai #1 #image-generation #gradio

Qwen3-TTS MLX WebUI Enhanced

Check-ins

81 check-ins

Platforms

GPUNVIDIAAMDApple

Qwen3-TTS

https://github.com/SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 3d ago

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

#tts #voice #qwen3-tts #ai

Owner

@sup3rmass1ve

Check-ins

16 check-ins

Platforms

GPUNVIDIAAMDApple

https://github.com/Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 3d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

Owner

@blizaine

Check-ins

50 check-ins

Platforms

GPUNVIDIAAMDApple

Ultimate-TTS-Studio

https://github.com/pinokiofactory/Ultimate-TTS-Studiov3.7updated 6d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio

#comfyui #video #image #ai #audio #image-generation #node-interface

Check-ins

19 check-ins

Platforms

GPUNVIDIAAMDApple

Voicebox

https://github.com/cocktailpeanut/voicebox.pinokiov5.0updated 10d ago

Local-first voice synthesis studio powered by Qwen3-TTS.

#tts #voice-clone

Owner

@cocktailpeanut

Check-ins

19 check-ins

Platforms

GPUNVIDIAAMDApple

Comfyui

https://github.com/pinokiofactory/comfyv3.7updated 11d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

Check-ins

38 check-ins

Platforms

GPUNVIDIAAMDApple

e2-f5-tts

https://github.com/pinokiofactory/e2-f5-ttsv3.7updated 13d ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

Check-ins

7 check-ins

GPUNVIDIAAMDAppleBe first to check in on AMD/Apple

AllTalk-TTS v2

https://github.com/6Morpheus6/alltalk-ttsv3.3updated 17d ago

[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.

#tts

Owner

@morpheus

Check-ins

4 check-ins

Ultimate-TTS-Studio-SUP3R-Edition

GPUNVIDIAAMDAppleBe first to check in on AMD/Apple

https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition-Pinokiov3.7updated 20d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Owner

@sup3rmass1ve

Check-ins

7 check-ins

GPUNVIDIAAMDApple

Orpheus-TTS-FastAPI

https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 22d ago

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

Check-insNo check-ins yet

PlatformsBe first to check in on macOS/Windows/Linux

GPUNVIDIAAMDAppleBe first to check in on NVIDIA/AMD/Apple

Qwen3-TTS

https://github.com/Xeronal81/Qwen3-TTS-Pinokiov5.0updated 1mo ago

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

#tts

Check-ins

9 check-ins

GPUNVIDIAAMDApple

XTTS

https://github.com/6Morpheus6/xtts.pinokiov3.7updated 1mo ago

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

Owner

@morpheus

Check-ins

3 check-ins

GPUNVIDIAAMDApple

LuxTTS Studio

https://github.com/TheAwaken1/LuxTTS-Studiov2.0updated 1mo ago

Gradio-based web interface for the LuxTTS voice cloning and text-to-speech model, enabling users to generate customized speech from text using uploaded or recorded audio references with adjustable parameters like speed, guidance scale, and inference steps.

Owner

@theawakenone

Check-ins

2 check-ins

GPUNVIDIAAMDAppleBe first to check in on AMD/Apple

VibeVoice Realtime

https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 2mo ago

Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B

#ai #tts

Check-ins

3 check-ins

GPUNVIDIAAMDAppleBe first to check in on NVIDIA

OpenAudio

https://github.com/pinokiofactory/openaudiov3.7updated 2mo ago

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

#openaudio #ai #audio #gradio #tts

Check-ins

8 check-ins

LiquidAI-LFM2.5 Playground

GPUNVIDIAAMDAppleBe first to check in on Apple

https://github.com/TheAwaken1/LiquidAI-LFM2.5-Playgroundv2.0updated 2mo ago

Local multimodal app powered by Liquid AI LFM2.5-Audio-1.5B and LFM2.5-VL-1.6B models, delivering real-time voice chat, text-to-speech synthesis, long-form audio transcription, and multi-image vision reasoning.

Owner

@theawakenone

Check-ins

1 check-in

GPUNVIDIAAMDAppleBe first to check in on AMD/Apple

Chattered

https://github.com/6Morpheus6/Chatteredv3.7updated 2mo ago

All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)

Owner

@morpheus

Check-ins

1 check-in