Pinokio

@misiek

0 posts0 checkpointsJoined 1/27/2026, 10:08:53 AM
Apps @misiek follows
17 total
AllTalk-TTS v21/28/2026, 11:54:03 AM
https://github.com/6Morpheus6/alltalk-ttsv3.3updated 1/5/2026, 6:52:06 PM
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
Ultimate-TTS-Studio1/28/2026, 11:53:47 AM
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
LiquidAI-LFM2.5 Playground1/28/2026, 11:53:44 AM
Local multimodal app powered by Liquid AI LFM2.5-Audio-1.5B and LFM2.5-VL-1.6B models, delivering real-time voice chat, text-to-speech synthesis, long-form a...
e2-f5-tts1/28/2026, 11:53:42 AM
https://github.com/pinokiofactory/e2-f5-ttsv3.7updated 1/23/2026, 9:14:27 PM
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
Chattered1/28/2026, 11:53:41 AM
https://github.com/6Morpheus6/Chatteredv3.7updated 1/22/2026, 2:40:08 AM
All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech genera...
Z-Fusion1/27/2026, 10:10:46 AM
https://github.com/ai-anchorite/Z-Fusionv3.7updated 1/28/2026, 4:51:33 AM
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
PersonaPlex1/27/2026, 10:10:43 AM
https://github.com/PierrunoYT/PersonaPlex-Pinokiov5.0updated 1/28/2026, 5:34:58 PM
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices. Requi...
SongGeneration Studio1/27/2026, 10:10:30 AM
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo...
Wan2GP1/27/2026, 10:10:27 AM
https://github.com/6Morpheus6/wan2gpv3.7updated 1/25/2026, 6:29:25 PM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...
Wan2GP1/27/2026, 10:10:25 AM
https://github.com/pinokiofactory/wanv3.7updated 1/28/2026, 9:41:31 AM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://git...
aura-sr-upscaler1/27/2026, 10:10:23 AM
AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2
MagicQuill1/27/2026, 10:10:16 AM
https://github.com/pinokiofactory/MagicQuillv3.7updated 1/11/2026, 8:07:39 PM
An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.
Whisper-WebUI1/27/2026, 10:10:14 AM
https://github.com/pinokiofactory/whisper-webuiv3.7updated 1/20/2026, 11:36:49 PM
A Web UI for easy subtitle using whisper model.
VibeVoice Realtime1/27/2026, 10:10:08 AM
https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 12/22/2025, 10:00:08 PM
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
Qwen3-TTS1/27/2026, 10:10:01 AM
https://github.com/SUP3RMASS1VE/Qwen3-TTS-Pinokiov5.0updated 1/27/2026, 5:41:21 PM
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
Orpheus-TTS-FastAPI1/27/2026, 10:09:57 AM
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech s...
Moltbot (F.K.A. ClawdBot)1/27/2026, 10:09:50 AM
The AI that actually does things https://www.molt.bot