Pinokio
Explore tags
VyvoTTS LFM2
https://github.com/PierrunoYT/VyvoTTS-LFM2-Pinokiov1.0.0updated 12/25/2025, 7:04:31 AMindexed 1/23/2026, 7:45:47 PM
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
Fara-7B Computer Use Agent
https://github.com/neviah/Fara-Pinokiov3.7updated 12/24/2025, 12:31:42 PMindexed 1/23/2026, 7:47:03 PM
Microsoft's 7B parameter computer use agent with Gradio interface
Miratts Pinokio
https://github.com/SUP3RMASS1VE/MiraTTS-Pinokiov4.0updated 12/24/2025, 1:16:33 AMindexed 1/23/2026, 7:46:33 PM
Moondream3 Gradio UI
https://github.com/PierrunoYT/moondream-3-pinokiov1.0.0updated 12/24/2025, 12:48:21 AMindexed 1/23/2026, 7:44:53 PM
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
chatterbox
https://github.com/Paxurux/chatterbox-old-supermasive-vrv3.7updated 12/24/2025, 12:45:51 AMindexed 1/23/2026, 7:45:39 PM
SoTA open-source TTS
Puter Model Emulator
https://github.com/amondeuz/puter-model-emulatorv4.0updated 12/24/2025, 12:45:36 AMindexed 1/23/2026, 7:46:10 PM
VibeVoice Realtime
https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 12/22/2025, 10:00:08 PMindexed 1/20/2026, 9:13:58 AM
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
AudioGradio
https://github.com/cocktailpeanut/audiogradio.pinokioupdated 12/22/2025, 8:18:23 PMindexed 1/23/2026, 7:46:05 PM
One click installer for AudioCraft MusicGen and AudioGen Gradio UI (Requires at least Pinokio v0.0.56)
IndexTTS-2
https://github.com/6Morpheus6/IndexTTS2v3.7updated 12/22/2025, 2:17:20 AMindexed 1/23/2026, 7:44:41 PM
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
ComfyUI Image to 3D
https://github.com/V-Sekai-fire/pinokio-image-to-3dv1.0.0updated 12/21/2025, 1:44:36 AMindexed 1/23/2026, 7:47:27 PM
ComfyUI with TRELLIS2, GeometryPack, and UniRig custom nodes for image-to-3D generation
PhotoMaker2
https://github.com/6Morpheus6/photomaker2v3.7updated 12/20/2025, 4:54:45 AMindexed 1/28/2026, 6:33:46 AM
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
Applio
https://github.com/pinokiofactory/appliov3.7updated 12/19/2025, 4:34:35 AMindexed 1/23/2026, 7:46:17 PM
A simple, high-quality voice conversion tool focused on ease of use and performance.
Whisper-WebUI
https://github.com/6Morpheus6/whisper-webuiv3.7updated 12/18/2025, 9:09:52 PMindexed 1/23/2026, 7:47:29 PM
A Web UI for easy subtitle using whisper model.
FramePack
https://github.com/pinokiofactory/Frame-Packv3.7updated 12/18/2025, 10:04:25 AMindexed 1/23/2026, 7:46:30 PM
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
Audio Flamingo 3
https://github.com/PierrunoYT/Audio-Flamingo-3-Pinokiov1.0.0updated 12/15/2025, 4:41:12 PMindexed 1/23/2026, 7:45:54 PM
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
ClearerVoice-Studio
https://github.com/gotoolkits/ClearerVoice-Studiov2.0updated 12/15/2025, 2:13:11 PMindexed 1/23/2026, 7:48:24 PM
Umo
https://github.com/linus74rn/UmoPinokiov1.0updated 12/15/2025, 7:41:31 AMindexed 1/23/2026, 7:47:30 PM
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
SillyTavern Character Generator
https://github.com/drago87/SillyTavern-Character-Generatorv4.0updated 12/14/2025, 4:03:35 PMindexed 1/23/2026, 7:47:36 PM
# SillyTavern Character Generator A pinokio script for https://github.com/Tremontaine/character-card-generator When used with KoboldCPP use http://localhost:5001/v1 Where 5001 is the port reported by KoboldCPP when starting Text API Key needs to be filled with anything. (If left empty will give a error so just add anything to it)
Resemble Enhance
https://github.com/sealad886/pinokio-resemble-enhancev2.0updated 12/13/2025, 11:46:10 PMindexed 1/23/2026, 7:45:44 PM
AI-powered speech denoising + enhancement (Gradio web demo + CLI).
GLM-TTS
https://github.com/PierrunoYT/GLM-TTS-Pinokiov1.0.0updated 12/13/2025, 8:56:58 AMindexed 1/23/2026, 7:44:58 PM
๐ŸŽ™๏ธ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
PreviousPage 7 / 28Next