Pinokio

Type:All

Platform:All

GPU:All

Tag:#ttsx

Latest Check-ins Name

Sort:Latest

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 13h ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 6d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio

30 check-insNVIDIAAMDApple

Whisper-WebUI

pinokiofactory/whisper-webuiv3.7updated 9d ago

A Web UI for easy subtitle using whisper model.

#ai #gradio #tts #whisper

2 check-insNVIDIAAMDApple

e2-f5-tts

pinokiofactory/e2-f5-ttsv3.7updated 9d ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

13 check-insNVIDIAAMDApple

Wan2GP

pinokiofactory/wanv3.7updated 13d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video #video-generation #wan #image #ai #1 #image-generation #gradio

187 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 16d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

61 check-insNVIDIAAMDApple

Voicebox

cocktailpeanut/voicebox.pinokiov5.0updated 18d ago

Local-first voice synthesis studio powered by Qwen3-TTS.

#tts #voice-clone

@cocktailpeanut

30 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 18d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #video #image #ai #audio #image-generation #node-interface

72 check-insNVIDIAAMDApple

VoxCPM

IAnMove/voxcpm2-pinokio-launcherv7.0updated 28d ago

Tokenizer-free multilingual TTS and voice cloning with low-VRAM and VoxCPM2 Web UI/API launch modes.

#ai #tts

@theinaog

2 check-insNVIDIAAMDApple

VibeVoice Realtime

pinokiofactory/vibevoice-realtimev5.0updated 1mo ago

Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B

#ai #tts

5 check-insNVIDIAAMDApple

OpenAudio

pinokiofactory/openaudiov3.7updated 1mo ago

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

#openaudio #ai #audio #gradio #tts

12 check-insNVIDIAAMDApple

Orpheus-TTS-FastAPI

pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1mo ago

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

0 check-insNVIDIAAMDApple

Qwen3-TTS

SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1mo ago

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

#tts #voice #qwen3-tts #ai

@sup3rmass1ve

24 check-insNVIDIAAMDApple

XTTS

cocktailpeanut/xtts.pinokiov3.0updated 1mo ago

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

#ai #tts

@cocktailpeanut1 check-inNVIDIAAMDApple

OpenVoice

cocktailpeanutlabs/openvoicev1updated 4mo ago

Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice

#tts #ai

4 check-insNVIDIAAMDApple

StyleTTS2 Studio

pinokiofactory/StyleTTS2_Studiov3.7updated 4mo ago

Build your own voice for StyleTTS2

#ai #tts

2 check-insNVIDIAAMDApple

Openvoice2

cocktailpeanutlabs/openvoice2v3.0updated 5mo ago

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

#ai #tts

1 check-inNVIDIAAMDApple

Dia

pinokiofactory/diav3.7updated 5mo ago

Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

#ai #tts

0 check-insNVIDIAAMDApple

zonos

pinokiofactory/zonosv3.7updated 5mo ago

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #tts

3 check-insNVIDIAAMDApple

MeloTTS

cocktailpeanutlabs/melottsv1.2updated 9mo ago

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS

#ai #tts

2 check-insNVIDIAAMDApple

Store