MFLUX-WEBUI
https://github.com/pinokiofactory/MFLUX-WEBUIv2.1updated 12/15/2025, 2:06:08 AMindexed 1/6/2026, 6:16:42 AM
[MAC ONLY] A powerful and user-friendly web interface for FLUX, powered by MLX and Gradio via MFLUX
Puter Model Emulator
https://github.com/amondeuz/puter-model-emulatorv4.0updated 12/14/2025, 8:14:49 PMindexed 1/6/2026, 6:17:12 AM
Resemble Enhance
https://github.com/sealad886/pinokio-resemble-enhancev2.0updated 12/13/2025, 11:46:00 PMindexed 1/6/2026, 6:16:40 AM
AI-powered speech denoising + enhancement (Gradio web demo + CLI).
GLM-TTS
https://github.com/PierrunoYT/GLM-TTS-Pinokiov1.0.0updated 12/13/2025, 8:56:50 AMindexed 1/6/2026, 6:17:38 AM
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
Forge Neo
https://github.com/6Morpheus6/forge-neov2.0updated 12/12/2025, 10:30:40 PMindexed 1/6/2026, 6:17:14 AM
[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
MuseTalk
https://github.com/manat0912/TalkingMusev3.7updated 12/12/2025, 9:24:47 AMindexed 1/6/2026, 6:18:55 AM
Ultimate-TTS-Studio-SUP3R-Edition
https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition-Pinokiov3.7updated 12/10/2025, 10:35:56 PMindexed 1/6/2026, 6:16:51 AM
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Chattered
https://github.com/6Morpheus6/Chatteredv3.7updated 12/9/2025, 5:50:02 AMindexed 1/6/2026, 6:16:15 AM
All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)
Dia
https://github.com/pinokiofactory/diav3.7updated 12/7/2025, 7:54:59 PMindexed 1/6/2026, 6:16:57 AM
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
Ollama Web Interface
https://github.com/JL-Bones/Ollama_Webupdated 12/6/2025, 10:59:27 PMindexed 1/6/2026, 6:19:40 AM
A web interface for managing and interacting with Ollama models
zonos
https://github.com/pinokiofactory/zonosv3.7updated 12/6/2025, 10:44:22 PMindexed 1/6/2026, 6:14:46 AM
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
bolt.diy
https://github.com/pinokiofactory/boltv3.4.0updated 12/6/2025, 9:59:32 PMindexed 1/6/2026, 6:17:33 AM
Prompt, run, edit, and deploy full-stack web apps. https://github.com/stackblitz-labs/bolt.diy
ACE-Step
https://github.com/pinokiofactory/ACE-Stepv3.7updated 12/6/2025, 11:34:57 AMindexed 1/6/2026, 6:16:56 AM
A Step Towards Music Generation Foundation Model
echomimic2
https://github.com/pinokiofactory/echomimic2v3.7updated 12/6/2025, 5:47:56 AMindexed 1/6/2026, 6:19:17 AM
[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2
DiffRhythm
https://github.com/pinokiofactory/diffrhythmv3.7updated 12/5/2025, 1:50:16 AMindexed 1/6/2026, 6:16:19 AM
Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm
Wan2GP
https://github.com/6Morpheus6/wan2gpv3.7updated 12/4/2025, 8:06:23 PMindexed 1/6/2026, 6:20:04 AM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
pyramidflow
https://github.com/pinokiofactory/pyramidflowv3.7updated 12/4/2025, 6:27:40 PMindexed 1/6/2026, 6:16:35 AM
Pyramd Flow Video Generation AI (text-to-video & image-to-video) https://github.com/jy0205/Pyramid-Flow
Wan2GP
https://github.com/pinokiofactory/wanv3.7updated 12/4/2025, 5:35:10 PMindexed 1/6/2026, 6:19:26 AM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Invoke
https://github.com/pinokiofactory/invokev3.7updated 12/4/2025, 5:43:02 AMindexed 1/6/2026, 6:20:03 AM
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
Allegro-txt2vid
https://github.com/pinokiofactory/Allegro-txt2vid-installv3.7updated 12/4/2025, 5:36:20 AMindexed 1/6/2026, 6:19:15 AM
[NVIDIA ONLY] Generate videos with Allegro txt2vid model https://github.com/rhymes-ai/Allegro
PreviousPage 3 / 18Next