Pinokio Registry

LCM Lora

https://github.com/cocktailpeanut/lcmloraupdated 11/16/2023, 11:12:34 PMindexed 1/6/2026, 6:19:06 AM

moshi

https://github.com/pinokiofactory/moshiv2.0updated 3/16/2025, 11:50:27 PMindexed 1/6/2026, 6:19:07 AM

[Mac only] a speech-text foundation model for real time dialogue https://github.com/kyutai-labs/moshi

SD-Next

https://github.com/SUP3RMASS1VE/SD-Nextv3.7updated 5/20/2025, 7:55:10 PMindexed 1/6/2026, 6:19:09 AM

SD.Next: All-in-one WebUI for AI generative image and video creation

ZETA

https://github.com/cocktailpeanutlabs/zetav1.2updated 3/17/2025, 2:31:43 AMindexed 1/6/2026, 6:19:10 AM

Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing

StepmediaHRM

https://github.com/tmone/sm-hrmv3.7updated 5/14/2025, 7:27:44 PMindexed 1/6/2026, 6:19:11 AM

VoxCPM

https://github.com/Paxurux/Voxcpmv3.7updated 9/22/2025, 5:01:32 AMindexed 1/6/2026, 6:19:12 AM

Voice Synthesis Platform with Smart Chunking, Batch Processing, and Voice Cloning capabilities.

Nemoml

https://github.com/6Morpheus6/Nemomlv3.7updated 6/12/2025, 7:56:40 PMindexed 1/6/2026, 6:19:13 AM

[NVIDIA ONLY] A minimal Gradio interface for Automatic Speech Recognition. Transcribe Audio in Malayalam language.

video-background-removal

https://github.com/pinokiofactory/video-background-removalv2.0updated 10/11/2024, 5:35:51 PMindexed 1/6/2026, 6:19:05 AM

remove or change any video background https://huggingface.co/spaces/innova-ai/video-background-removal

AutoGen Studio

https://github.com/GivEN29/autogen-studio-pinokioupdated 3/10/2024, 7:50:03 AMindexed 1/6/2026, 6:19:06 AM

Declaratively define and modify agents and multi-agent workflows through a point and click, drag and drop interface (e.g., you can select the parameters of two agents that will communicate to solve your task).

Kokoro-TTS-Local v0.19

https://github.com/SUP3RMASS1VE/Kokoro-TTSv3.2updated 2/3/2025, 11:35:00 PMindexed 1/6/2026, 6:19:14 AM

A local implementation of the Kokoro Text-to-Speech model

OminiControl

https://github.com/pinokiofactory/ominicontrolv2.0updated 11/27/2024, 9:17:29 AMindexed 1/6/2026, 6:19:15 AM

A minimal and universal controller for FLUX.1 https://github.com/Yuanshi9815/OminiControl

Bagel

https://github.com/6Morpheus6/bagelv3.7updated 11/5/2025, 8:32:00 AMindexed 1/6/2026, 6:19:16 AM

[NVIDIA ONLY] [RTX 50 Support] Image generation, image editing and free-form manipulation with a VLM (Minimum Requirements 12GB VRAM / 32GB RAM Recommended Requirements 24GB VRAM / 48GB RAM)

FBCNN

https://github.com/KenjieDec/FBCNN-Pinokiov1.0updated 12/25/2025, 8:30:59 PMindexed 1/6/2026, 6:19:07 AM

Remove JPEG compression artifacts from images using FBCNN model

lavie

https://github.com/cocktailpeanut/lavie.pinokioupdated 8/6/2024, 3:31:57 AMindexed 1/6/2026, 6:19:07 AM

Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie

LlamaFactory

https://github.com/pinokiofactory/llamafactoryv1.5updated 3/17/2025, 12:35:47 AMindexed 1/6/2026, 6:19:10 AM

Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory

macOS-use

https://github.com/pinokiofactory/macOS-usev3.6updated 4/1/2025, 4:14:57 AMindexed 1/6/2026, 6:19:10 AM

[Mac Only] We make AI agents that control Mac apps: https://github.com/browser-use/macOS-use

TRELLIS

https://github.com/pinokiofactory/TRELLISv3.2updated 5/7/2025, 11:10:25 PMindexed 1/6/2026, 6:19:11 AM

DatasetHelpers

https://github.com/Feedjer/DatasetHelpersv2.0updated 8/30/2024, 9:53:59 AMindexed 1/6/2026, 6:19:12 AM

Clarity Refiners UI

https://github.com/pinokiofactory/clarity-refiners-uiv3.7updated 12/2/2025, 10:18:27 PMindexed 1/6/2026, 6:19:13 AM

An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)

PRX-1024 Text-to-Image

https://github.com/PierrunoYT/Photoroom-PRX-Pinokiov1.0.0updated 11/17/2025, 3:29:31 PMindexed 1/6/2026, 6:19:16 AM

Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model