Store

MFLUX-WEBUI

https://github.com/pinokiofactory/MFLUX-WEBUIv2.1updated 12/15/2025, 2:06:08 AMindexed 1/6/2026, 6:16:42 AM

[MAC ONLY] A powerful and user-friendly web interface for FLUX, powered by MLX and Gradio via MFLUX

#ai #flux

Puter Model Emulator

https://github.com/amondeuz/puter-model-emulatorv4.0updated 12/14/2025, 8:14:49 PMindexed 1/6/2026, 6:17:12 AM

Resemble Enhance

https://github.com/sealad886/pinokio-resemble-enhancev2.0updated 12/13/2025, 11:46:00 PMindexed 1/6/2026, 6:16:40 AM

AI-powered speech denoising + enhancement (Gradio web demo + CLI).

GLM-TTS

https://github.com/PierrunoYT/GLM-TTS-Pinokiov1.0.0updated 12/13/2025, 8:56:50 AMindexed 1/6/2026, 6:17:38 AM

🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.

Forge Neo

https://github.com/6Morpheus6/forge-neov2.0updated 12/12/2025, 10:30:40 PMindexed 1/6/2026, 6:17:14 AM

[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo

MuseTalk

https://github.com/manat0912/TalkingMusev3.7updated 12/12/2025, 9:24:47 AMindexed 1/6/2026, 6:18:55 AM

Ultimate-TTS-Studio-SUP3R-Edition

https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition-Pinokiov3.7updated 12/10/2025, 10:35:56 PMindexed 1/6/2026, 6:16:51 AM

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Chattered

https://github.com/6Morpheus6/Chatteredv3.7updated 12/9/2025, 5:50:02 AMindexed 1/6/2026, 6:16:15 AM

All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)

Dia

https://github.com/pinokiofactory/diav3.7updated 12/7/2025, 7:54:59 PMindexed 1/6/2026, 6:16:57 AM

Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

#ai #audio

Ollama Web Interface

https://github.com/JL-Bones/Ollama_Webupdated 12/6/2025, 10:59:27 PMindexed 1/6/2026, 6:19:40 AM

A web interface for managing and interacting with Ollama models

zonos

https://github.com/pinokiofactory/zonosv3.7updated 12/6/2025, 10:44:22 PMindexed 1/6/2026, 6:14:46 AM

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #audio

bolt.diy

https://github.com/pinokiofactory/boltv3.4.0updated 12/6/2025, 9:59:32 PMindexed 1/6/2026, 6:17:33 AM

Prompt, run, edit, and deploy full-stack web apps. https://github.com/stackblitz-labs/bolt.diy

#ai #coding

ACE-Step

https://github.com/pinokiofactory/ACE-Stepv3.7updated 12/6/2025, 11:34:57 AMindexed 1/6/2026, 6:16:56 AM

A Step Towards Music Generation Foundation Model

echomimic2

https://github.com/pinokiofactory/echomimic2v3.7updated 12/6/2025, 5:47:56 AMindexed 1/6/2026, 6:19:17 AM

[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2

DiffRhythm

https://github.com/pinokiofactory/diffrhythmv3.7updated 12/5/2025, 1:50:16 AMindexed 1/6/2026, 6:16:19 AM

Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm

#ai #music

Wan2GP

https://github.com/6Morpheus6/wan2gpv3.7updated 12/4/2025, 8:06:23 PMindexed 1/6/2026, 6:20:04 AM

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

pyramidflow

https://github.com/pinokiofactory/pyramidflowv3.7updated 12/4/2025, 6:27:40 PMindexed 1/6/2026, 6:16:35 AM

Pyramd Flow Video Generation AI (text-to-video & image-to-video) https://github.com/jy0205/Pyramid-Flow

#ai #video

Wan2GP

https://github.com/pinokiofactory/wanv3.7updated 12/4/2025, 5:35:10 PMindexed 1/6/2026, 6:19:26 AM

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#ai #video

Invoke

https://github.com/pinokiofactory/invokev3.7updated 12/4/2025, 5:43:02 AMindexed 1/6/2026, 6:20:03 AM

The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI

#ai #image

Allegro-txt2vid

https://github.com/pinokiofactory/Allegro-txt2vid-installv3.7updated 12/4/2025, 5:36:20 AMindexed 1/6/2026, 6:19:15 AM

[NVIDIA ONLY] Generate videos with Allegro txt2vid model https://github.com/rhymes-ai/Allegro

#ai #video