Pinokio Registry

https://github.com/6Morpheus6/OmniSVGv3.7updated 7/25/2025, 8:20:32 PMindexed 1/6/2026, 6:19:42 AM

[NVIDIA ONLY] End-to-end multimodal SVG generator capable of generating complex and detailed SVGs, from simple icons to intricate anime characters. (Minimum Requirements 12GB VRAM / 32GB RAM, Recommended Requirements 24GB VRAM / 24GB RAM)

Open WebUI

https://github.com/cocktailpeanutlabs/open-webuiv1.2updated 7/13/2024, 12:38:16 PMindexed 1/6/2026, 6:19:43 AM

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui

Orpheus-TTS-FastAPI

https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 12/2/2025, 8:56:20 PMindexed 1/6/2026, 6:19:43 AM

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

Z-Image-Turbo

https://github.com/PierrunoYT/Z-Image-Pinokiov1.0.0updated 1/4/2026, 2:21:50 PMindexed 1/6/2026, 6:19:44 AM

⚡️ Efficient 6B parameter image generation model with sub-second inference. Generate high-quality, photorealistic images with only 8 inference steps. Features bilingual text rendering (Chinese & English) and Single-Stream Diffusion Transformer architecture.

RVC

https://github.com/cocktailpeanut/rvc.pinokiov3.7updated 11/24/2025, 2:17:03 AMindexed 1/6/2026, 6:19:45 AM

1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

kohya_ss

https://github.com/cocktailpeanut/kohya.pinokioupdated 8/6/2024, 3:48:29 AMindexed 1/6/2026, 6:19:45 AM

1 Click Installer for kohya_ss, a Stable Diffusion LoRa & Dreambooth WebUI (https://github.com/bmaltais/kohya_ss)

MagicAnimate

https://github.com/cocktailpeanut/MagicAnimate.pinokiov3.0updated 3/7/2025, 8:33:05 PMindexed 1/6/2026, 6:19:46 AM

[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/

Realtime BakLLaVA

https://github.com/cocktailpeanut/bakllava.pinokioupdated 11/6/2023, 8:03:41 AMindexed 1/6/2026, 6:19:46 AM

llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)

Debator

https://github.com/malfunctionize/debatorupdated 6/11/2023, 10:41:31 PMindexed 1/6/2026, 6:19:47 AM

MLX-Video-Transcription

https://github.com/pinokiofactory/mlx-video-transcriptionv2.0updated 10/3/2024, 7:11:21 PMindexed 1/6/2026, 6:19:47 AM

[Mac Only] Super Fast MLX Powered Video Transcription https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator/ by https://x.com/RayFernando1337

gligen

https://github.com/cocktailpeanutlabs/gligenv1.2updated 3/17/2025, 2:10:56 AMindexed 1/6/2026, 6:19:47 AM

An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui

Re-Forge

https://github.com/supersonic13/pinokio-reforgev1.2updated 7/19/2024, 8:17:35 AMindexed 1/6/2026, 6:19:48 AM

Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference. https://github.com/Panchovix/stable-diffusion-webui-reForge

InvokeAI

https://github.com/cocktailpeanutlabs/invokeaiv1.1updated 5/10/2024, 7:10:52 AMindexed 1/6/2026, 6:19:49 AM

Generative AI for Professional Creatives

UVR5-WebUI

https://github.com/SUP3RMASS1VE/UVR5-WebUIv2.0updated 3/1/2025, 10:10:13 PMindexed 1/6/2026, 6:19:49 AM

The best vocal remover application on the internet, and it's totally free and open source!

omnigen

https://github.com/pinokiofactory/omnigenv3.7updated 12/2/2025, 11:04:47 PMindexed 1/6/2026, 6:19:48 AM

A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen

ModelScope Video2Video (Nvidia GPU only)

https://github.com/cocktailpeanut/ms-video2video.pinokioupdated 11/1/2023, 2:48:16 AMindexed 1/6/2026, 6:19:50 AM

enhance the resolution and spatiotemporal continuity of text-generated videos and image-generated videos

Moondream1

https://github.com/cocktailpeanutlabs/moondream1v1.1updated 7/11/2024, 10:59:29 AMindexed 1/6/2026, 6:19:50 AM

moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1

HY-MT1.5

https://github.com/PierrunoYT/Tencent-HY-MT1.5-Pinokiov1.0.0updated 1/4/2026, 3:43:42 PMindexed 1/6/2026, 6:20:01 AM

🌐 Hunyuan Translation Model Version 1.5 - Supporting mutual translation across 33 languages. Features HY-MT1.5-1.8B (fast) and HY-MT1.5-7B (accurate) models with terminology intervention, contextual translation, and formatted translation support.

browser-use

https://github.com/pinokiofactory/browser-usev3.6updated 4/1/2025, 4:25:09 AMindexed 1/6/2026, 6:20:01 AM

Run AI Agent in your browser. https://github.com/browser-use/web-ui

YFrope

https://github.com/yxzysy/YFropev2.0updated 12/17/2024, 5:03:06 PMindexed 1/6/2026, 6:20:02 AM