Pinokio Registry

https://github.com/cocktailpeanut/hallucinatorv2.0updated 11/7/2024, 6:18:48 PMindexed 1/6/2026, 6:15:37 AM

[NVIDIA ONLY] Autocomplete any voice(s), powered by Hertz AI (Standard Intelligence)

https://github.com/SUP3RMASS1VE/Diffusers-Image-Outpaintingv3.6updated 3/30/2025, 8:00:10 PMindexed 1/6/2026, 6:15:39 AM

Moondream3 Gradio UI

https://github.com/PierrunoYT/moondream-3-pinokiov1.0.0updated 12/17/2025, 5:24:23 PMindexed 1/6/2026, 6:15:41 AM

A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.

StyleAligned

https://github.com/cocktailpeanut/StyleAligned.pinokiov3.0updated 3/23/2025, 2:37:06 AMindexed 1/6/2026, 6:15:44 AM

Style Aligned Image Generation via Shared Attention https://style-aligned-gen.github.io/

AudioLDM 2

https://github.com/cocktailpeanut/AudioLDM2.pinokioupdated 11/3/2023, 8:52:23 PMindexed 1/6/2026, 6:16:05 AM

[Nvidia GPU only] One click installer for AudioLDM 2 Gradio UI

DenseDiffusion

https://github.com/cocktailpeanut/densediffusion.pinokioupdated 8/6/2024, 3:54:11 AMindexed 1/6/2026, 6:16:07 AM

Dense Text-to-Image Generation with Attention Modulation

LocalAIVtuber

https://github.com/Feedjer/LocalAIVtuberv2.0updated 8/30/2024, 3:14:41 PMindexed 1/6/2026, 6:15:14 AM

A tool for hosting AI vtubers that runs fully locally and offline: https://github.com/0Xiaohei0/LocalAIVtuber

PuLID Gradio Demo

https://github.com/mr-szgz/pulidv3.7updated 7/19/2025, 7:21:57 PMindexed 1/6/2026, 6:15:20 AM

Fish-Speech

https://github.com/SUP3RMASS1VE/Fish-Speechv3.7updated 6/5/2025, 5:35:39 PMindexed 1/6/2026, 6:15:22 AM

florence-sam

https://github.com/pinokiofactory/florence-samv2.0updated 8/1/2024, 6:30:38 PMindexed 1/6/2026, 6:15:26 AM

Integrates Florence2 and SAM2 models for detailed image captioning and object detection. Florence2 generates detailed captions that are then used to perform phrase grounding. The Segment Anything Model 2 (SAM2) converts these phrase-grounded boxes into masks. https://huggingface.co/spaces/SkalskiP/florence-sam

XTTS

https://github.com/cocktailpeanut/xtts.pinokiov3.0updated 4/23/2025, 9:33:19 PMindexed 1/6/2026, 6:15:31 AM

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

BRIA-RMBG-2.0

https://github.com/ai-anchorite/BRIA-RMBG-2.0v2.0updated 11/13/2024, 4:37:14 PMindexed 1/6/2026, 6:15:32 AM

Companion App.Pinokio

https://github.com/cocktailpeanut/companion-app.pinokioupdated 7/14/2023, 5:19:11 PMindexed 1/6/2026, 6:16:09 AM

GLM-4-Voice

https://github.com/appotry/GLM4Voicev1.0updated 11/10/2024, 7:00:31 PMindexed 1/6/2026, 6:16:10 AM

GLM-4-Voice | 端到端中英语音对话模型

Bolt.new

https://github.com/gotoolkits/bolt.newv2.0updated 12/24/2024, 9:57:58 AMindexed 1/6/2026, 6:16:11 AM

AudioSep

https://github.com/cocktailpeanut/AudioSep.pinokiov2.0updated 2/26/2025, 11:48:02 PMindexed 1/6/2026, 6:16:11 AM

Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)

Bark Voice Cloning

https://github.com/cocktailpeanutlabs/barkv1.1updated 3/20/2025, 7:19:45 PMindexed 1/6/2026, 6:16:12 AM

Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning

fluxtrain

https://github.com/cocktailpeanutlabs/fluxtrainv2.0updated 9/4/2024, 9:57:47 AMindexed 1/6/2026, 6:16:14 AM

fluxgym

https://github.com/huqianghui/fluxgymv3.2updated 9/1/2025, 2:38:20 PMindexed 1/6/2026, 6:16:17 AM

[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)

Prototype

https://github.com/pinokiocomputer/prototypev4.0updated 7/1/2025, 3:29:55 PMindexed 1/6/2026, 6:16:17 AM