Pinokio
Tag#tts
Install Pinokio
Log inRegister
Log inRegister

Store

Type:api
apipluginAll
Platform:All
AllmacOSWindowsLinux
GPU:All
AllNVIDIAAMDApple
Tag:#ttsx
Sort by
LatestCheck-insName
Sort:Latest
LatestCheck-insName
#tts20#ai19#voice-clone6#gradio4#voice4#13#audio3##ai-#tts2#fubar2#image2#image-edit2#image-generation2#lipsync2#qwen2#qwen3-tts2#song2#video2#mac1#mlx1#node-interface1#whisper1
DramaBoxFeatured
PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 7h ago
Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI
#ai#tts#voice-clone
@pierrunoyt5 check-insNVIDIAAMDApple
Ultimate-TTS-StudioFeatured
pinokiofactory/Ultimate-TTS-Studiov3.7updated 5d ago
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
#tts#ai#gradio
30 check-insNVIDIAAMDApple
Whisper-WebUIFeatured
pinokiofactory/whisper-webuiv3.7updated 8d ago
A Web UI for easy subtitle using whisper model.
#ai#gradio#tts#whisper
2 check-insNVIDIAAMDApple
e2-f5-ttsFeatured
pinokiofactory/e2-f5-ttsv3.7updated 8d ago
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
#tts#voice-clone#ai
13 check-insNVIDIAAMDApple
Wan2GPFeatured
pinokiofactory/wanv3.7updated 12d ago
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
#video#video-generation#wan#image#ai#1#image-generation#gradio
185 check-insNVIDIAAMDApple
Qwen3-TTS MLX WebUI EnhancedFeatured
Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 15d ago
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
#mlx#qwen#tts#ai#mac
@blizaine61 check-insNVIDIAAMDApple
VoiceboxFeatured
cocktailpeanut/voicebox.pinokiov5.0updated 17d ago
Local-first voice synthesis studio powered by Qwen3-TTS.
#tts#voice-clone
@cocktailpeanut30 check-insNVIDIAAMDApple
ComfyuiFeatured
pinokiofactory/comfyv3.7updated 18d ago
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
#comfyui#video#image#ai#audio#image-generation#node-interface
72 check-insNVIDIAAMDApple
VoxCPMFeatured
IAnMove/voxcpm2-pinokio-launcherv7.0updated 28d ago
Tokenizer-free multilingual TTS and voice cloning with low-VRAM and VoxCPM2 Web UI/API launch modes.
#ai#tts
@theinaog2 check-insNVIDIAAMDApple
VibeVoice RealtimeFeatured
pinokiofactory/vibevoice-realtimev5.0updated 1mo ago
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
#ai#tts
5 check-insNVIDIAAMDApple
OpenAudioFeatured
pinokiofactory/openaudiov3.7updated 1mo ago
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
#openaudio#ai#audio#gradio#tts
12 check-insNVIDIAAMDApple
Orpheus-TTS-FastAPIFeatured
pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1mo ago
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
#ai#tts
0 check-insNVIDIAAMDApple
Qwen3-TTSFeatured
SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1mo ago
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
#tts#voice#qwen3-tts#ai
@sup3rmass1ve23 check-insNVIDIAAMDApple
XTTSFeatured
cocktailpeanut/xtts.pinokiov3.0updated 1mo ago
clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)
#ai#tts
@cocktailpeanut1 check-inNVIDIAAMDApple
OpenVoiceFeatured
cocktailpeanutlabs/openvoicev1updated 4mo ago
Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice
#tts#ai
4 check-insNVIDIAAMDApple
StyleTTS2 StudioFeatured
pinokiofactory/StyleTTS2_Studiov3.7updated 4mo ago
Build your own voice for StyleTTS2
#ai#tts
2 check-insNVIDIAAMDApple
Openvoice2Featured
cocktailpeanutlabs/openvoice2v3.0updated 5mo ago
Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793
#ai#tts
1 check-inNVIDIAAMDApple
DiaFeatured
pinokiofactory/diav3.7updated 5mo ago
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
#ai#tts
0 check-insNVIDIAAMDApple
zonosFeatured
pinokiofactory/zonosv3.7updated 5mo ago
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
#ai#tts
3 check-insNVIDIAAMDApple
MeloTTSFeatured
cocktailpeanutlabs/melottsv1.2updated 9mo ago
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS
#ai#tts
2 check-insNVIDIAAMDApple
Previous
12
NextPage 1 of 2
Pinokio
PrivacyTerms