Store

https://github.com/6Morpheus6/wan2gp-amdv3.7updated 12/17/2025, 7:45:09 PMindexed 1/6/2026, 6:15:24 AM

[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)

Moondream3 Gradio UI

https://github.com/PierrunoYT/moondream-3-pinokiov1.0.0updated 12/17/2025, 5:24:23 PMindexed 1/6/2026, 6:15:41 AM

A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.

MagicQuill

https://github.com/pinokiofactory/MagicQuillv3.7updated 12/17/2025, 5:51:58 AMindexed 1/6/2026, 6:18:31 AM

An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.

#ai #image

chatterbox

https://github.com/Blizaine/chatterbox-Turbov3.7updated 12/15/2025, 8:42:23 PMindexed 1/6/2026, 6:15:34 AM

Audio Flamingo 3

https://github.com/PierrunoYT/Audio-Flamingo-3-Pinokiov1.0.0updated 12/15/2025, 4:41:03 PMindexed 1/6/2026, 6:16:53 AM

NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface

MFLUX-WEBUI

https://github.com/pinokiofactory/MFLUX-WEBUIv2.1updated 12/15/2025, 2:06:08 AMindexed 1/6/2026, 6:16:42 AM

[MAC ONLY] A powerful and user-friendly web interface for FLUX, powered by MLX and Gradio via MFLUX

#ai #flux

Puter Model Emulator

https://github.com/amondeuz/puter-model-emulatorv4.0updated 12/14/2025, 8:14:49 PMindexed 1/6/2026, 6:17:12 AM

Resemble Enhance

https://github.com/sealad886/pinokio-resemble-enhancev2.0updated 12/13/2025, 11:46:00 PMindexed 1/6/2026, 6:16:40 AM

AI-powered speech denoising + enhancement (Gradio web demo + CLI).

GLM-TTS

https://github.com/PierrunoYT/GLM-TTS-Pinokiov1.0.0updated 12/13/2025, 8:56:50 AMindexed 1/6/2026, 6:17:38 AM

🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.

Forge Neo

https://github.com/6Morpheus6/forge-neov2.0updated 12/12/2025, 10:30:40 PMindexed 1/6/2026, 6:17:14 AM

[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo

MuseTalk

https://github.com/manat0912/TalkingMusev3.7updated 12/12/2025, 9:24:47 AMindexed 1/6/2026, 6:18:55 AM

Ultimate-TTS-Studio-SUP3R-Edition

https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition-Pinokiov3.7updated 12/10/2025, 10:35:56 PMindexed 1/6/2026, 6:16:51 AM

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Chattered

https://github.com/6Morpheus6/Chatteredv3.7updated 12/9/2025, 5:50:02 AMindexed 1/6/2026, 6:16:15 AM

All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)

Dia

https://github.com/pinokiofactory/diav3.7updated 12/7/2025, 7:54:59 PMindexed 1/6/2026, 6:16:57 AM

Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

#ai #audio

Ollama Web Interface

https://github.com/JL-Bones/Ollama_Webupdated 12/6/2025, 10:59:27 PMindexed 1/6/2026, 6:19:40 AM

A web interface for managing and interacting with Ollama models

zonos

https://github.com/pinokiofactory/zonosv3.7updated 12/6/2025, 10:44:22 PMindexed 1/6/2026, 6:14:46 AM

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #audio

bolt.diy

https://github.com/pinokiofactory/boltv3.4.0updated 12/6/2025, 9:59:32 PMindexed 1/6/2026, 6:17:33 AM

Prompt, run, edit, and deploy full-stack web apps. https://github.com/stackblitz-labs/bolt.diy

#ai #coding

ACE-Step

https://github.com/pinokiofactory/ACE-Stepv3.7updated 12/6/2025, 11:34:57 AMindexed 1/6/2026, 6:16:56 AM

A Step Towards Music Generation Foundation Model

echomimic2

https://github.com/pinokiofactory/echomimic2v3.7updated 12/6/2025, 5:47:56 AMindexed 1/6/2026, 6:19:17 AM

[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2

DiffRhythm

https://github.com/pinokiofactory/diffrhythmv3.7updated 12/5/2025, 1:50:16 AMindexed 1/6/2026, 6:16:19 AM

Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm

#ai #music