Pinokio

Type:All

Platform:All

GPU:All

Tag:#ttsx

Latest Check-ins Name

Sort:Check-ins

Wan2GP

pinokiofactory/wanv3.7updated 5d ago

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#video-generation #wan #wan2gp #video #image #ai #ai-video-generator #1 #image-generation #gradio

283 check-insNVIDIAAMDApple

Comfyui

pinokiofactory/comfyv3.7updated 21d ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI

#comfyui #ai #video #comfy #image #image-generation #audio #video-generation #node-interface

98 check-insNVIDIAAMDApple

Qwen3-TTS MLX WebUI Enhanced

Blizaine/Qwen3-TTS-MLX-WebUI-Enhancedv5.0updated 23d ago

High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.

#mlx #qwen #tts #ai #mac

@blizaine

63 check-insNVIDIAAMDApple

Maestro

Blizaine/Maestrov8.0updated 2h ago

An all-in-one, 100% local AI video, image & music studio. Its Director mode turns a single prompt into a full music video or short film — LLM-planned, shot by shot. Built on the WanGP pipeline (Wan 2.1/2.2, LTX-2.3, Qwen, Hunyuan Video, Flux). Requires an NVIDIA GPU (6GB+ VRAM).

#ai

@blizaine

49 check-insNVIDIAAMDApple

Ultimate-TTS-Studio

pinokiofactory/Ultimate-TTS-Studiov3.7updated 18d ago

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

#tts #ai #gradio #voice

42 check-insNVIDIAAMDApple

Voicebox

cocktailpeanut/voicebox.pinokiov5.0updated 1mo ago

Local-first voice synthesis studio powered by Qwen3-TTS.

#tts #voice-clone

@cocktailpeanut

34 check-insNVIDIAAMDApple

Qwen3-TTS

SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 10d ago

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team

#tts #voice #qwen3-tts #ai

@sup3rmass1ve

30 check-insNVIDIAAMDApple

e2-f5-tts

pinokiofactory/e2-f5-ttsv3.7updated 1mo ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

16 check-insNVIDIAAMDApple

OpenAudio

pinokiofactory/openaudiov3.7updated 2mo ago

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

#openaudio #ai #audio #gradio #tts

14 check-insNVIDIAAMDApple

zonos

pinokiofactory/zonosv3.7updated 1mo ago

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #tts

6 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 15d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Whisper-WebUI

pinokiofactory/whisper-webuiv3.7updated 28d ago

A Web UI for easy subtitle using whisper model.

#whisper #ai #gradio #tts

5 check-insNVIDIAAMDApple

OpenVoice

cocktailpeanutlabs/openvoicev1updated 6mo ago

Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice

#tts #ai

5 check-insNVIDIAAMDApple

VibeVoice Realtime

pinokiofactory/vibevoice-realtimev5.0updated 2mo ago

Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B

#ai #tts

5 check-insNVIDIAAMDApple

Uncensored Local Studio

cocktailpeanut/uncensored-local-studio.pinokiov8.0updated 1d ago

Run image generation, GGUF language models, Whisper speech recognition, and Kokoro speech synthesis locally from one offline studio.

#ai #gguf #image-generation #llm #speech-to-text #text-generation #text-to-speech #transcription #tts

@cocktailpeanut 4 check-insNVIDIAAMDApple

Openvoice2

cocktailpeanutlabs/openvoice2v3.0updated 6mo ago

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

#ai #tts

3 check-insNVIDIAAMDApple

VoxCPM

IAnMove/voxcpm2-pinokio-launcherv7.0updated 2mo ago

Tokenizer-free multilingual TTS and voice cloning with low-VRAM and VoxCPM2 Web UI/API launch modes.

#ai #tts

@theinaog

2 check-insNVIDIAAMDApple

StyleTTS2 Studio

pinokiofactory/StyleTTS2_Studiov3.7updated 6mo ago

Build your own voice for StyleTTS2

#ai #tts

2 check-insNVIDIAAMDApple

MeloTTS

cocktailpeanutlabs/melottsv1.2updated 11mo ago

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS

#ai #tts

2 check-insNVIDIAAMDApple

XTTS

cocktailpeanut/xtts.pinokiov3.0updated 3mo ago

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

#ai #tts

@cocktailpeanut1 check-inNVIDIAAMDApple

Store