Pinokio
Explore tags
Fara-7B Computer Use Agent
https://github.com/neviah/Fara-Pinokiov3.7updated 12/24/2025, 12:31:42 PMindexed 1/23/2026, 7:47:03 PM
Microsoft's 7B parameter computer use agent with Gradio interface
Miratts Pinokio
https://github.com/SUP3RMASS1VE/MiraTTS-Pinokiov4.0updated 12/24/2025, 1:16:33 AMindexed 1/23/2026, 7:46:33 PM
Moondream3 Gradio UI
https://github.com/PierrunoYT/moondream-3-pinokiov1.0.0updated 12/24/2025, 12:48:21 AMindexed 1/23/2026, 7:44:53 PM
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
chatterbox
https://github.com/Paxurux/chatterbox-old-supermasive-vrv3.7updated 12/24/2025, 12:45:51 AMindexed 1/23/2026, 7:45:39 PM
SoTA open-source TTS
Puter Model Emulator
https://github.com/amondeuz/puter-model-emulatorv4.0updated 12/24/2025, 12:45:36 AMindexed 1/23/2026, 7:46:10 PM
chatterbox-tts-api
https://github.com/travisvn/chatterbox-tts-apiupdated 12/23/2025, 1:05:46 AMindexed 1/28/2026, 8:25:16 AM
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
VibeVoice Realtime
https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 12/22/2025, 10:00:08 PMindexed 1/20/2026, 9:13:58 AM
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
AudioGradio
https://github.com/cocktailpeanut/audiogradio.pinokioupdated 12/22/2025, 8:18:23 PMindexed 1/23/2026, 7:46:05 PM
One click installer for AudioCraft MusicGen and AudioGen Gradio UI (Requires at least Pinokio v0.0.56)
IndexTTS-2
https://github.com/6Morpheus6/IndexTTS2v3.7updated 12/22/2025, 2:17:20 AMindexed 1/23/2026, 7:44:41 PM
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
Checked in recently
ComfyUI Image to 3D
https://github.com/V-Sekai-fire/pinokio-image-to-3dv1.0.0updated 12/21/2025, 1:44:36 AMindexed 1/23/2026, 7:47:27 PM
ComfyUI with TRELLIS2, GeometryPack, and UniRig custom nodes for image-to-3D generation
Chatterbox-TTS-Server
https://github.com/devnen/Chatterbox-TTS-Serverupdated 12/20/2025, 8:18:54 AMindexed 1/28/2026, 9:48:45 PM
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
PhotoMaker2
https://github.com/6Morpheus6/photomaker2v3.7updated 12/20/2025, 4:54:45 AMindexed 1/28/2026, 6:33:46 AM
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
Checked in recently
Whisper-WebUI
https://github.com/6Morpheus6/whisper-webuiv3.7updated 12/18/2025, 9:09:52 PMindexed 1/23/2026, 7:47:29 PM
A Web UI for easy subtitle using whisper model.
FramePack
https://github.com/pinokiofactory/Frame-Packv3.7updated 12/18/2025, 10:04:25 AMindexed 1/23/2026, 7:46:30 PM
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
facefusion
https://github.com/facefusion/facefusionupdated 12/17/2025, 5:37:51 PMindexed 1/27/2026, 8:55:41 PM
Industry leading face manipulation platform
manga-image-translator
https://github.com/zyddnys/manga-image-translatorupdated 12/17/2025, 1:56:05 AMindexed 1/27/2026, 9:34:02 PM
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Deep-Live-Cam
https://github.com/hacksider/Deep-Live-Camupdated 12/15/2025, 7:50:08 PMindexed 1/29/2026, 10:19:48 AM
real time face swap and one-click video deepfake with only a single image
Audio Flamingo 3
https://github.com/PierrunoYT/Audio-Flamingo-3-Pinokiov1.0.0updated 12/15/2025, 4:41:12 PMindexed 1/23/2026, 7:45:54 PM
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
ClearerVoice-Studio
https://github.com/gotoolkits/ClearerVoice-Studiov2.0updated 12/15/2025, 2:13:11 PMindexed 1/23/2026, 7:48:24 PM
Umo
https://github.com/linus74rn/UmoPinokiov1.0updated 12/15/2025, 7:41:31 AMindexed 1/23/2026, 7:47:30 PM
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
PreviousPage 12 / 36Next