Store

https://github.com/cocktailpeanutlabs/sprightv1.5updated 3/17/2025, 2:20:18 AMindexed 1/6/2026, 6:17:30 AM

Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I

https://github.com/cocktailpeanutlabs/customnetv1.5updated 3/17/2025, 2:19:48 AMindexed 1/6/2026, 6:17:34 AM

A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet

Stable Cascade

https://github.com/cocktailpeanutlabs/stablecascadev3.0updated 3/17/2025, 2:15:31 AMindexed 1/6/2026, 6:17:06 AM

Stable Cascade from StabilityAI

gligen

https://github.com/cocktailpeanutlabs/gligenv1.2updated 3/17/2025, 2:10:56 AMindexed 1/6/2026, 6:19:47 AM

An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui

CosXL

https://github.com/cocktailpeanutlabs/cosxlv1.5updated 3/17/2025, 1:51:07 AMindexed 1/6/2026, 6:16:11 AM

Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI, https://huggingface.co/spaces/multimodalart/cosxl

face-to-all

https://github.com/cocktailpeanutlabs/face-to-allv1.5updated 3/17/2025, 1:43:17 AMindexed 1/6/2026, 6:15:34 AM

diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all

instantstyle

https://github.com/cocktailpeanutlabs/instantstylev1.5updated 3/17/2025, 1:34:48 AMindexed 1/6/2026, 6:19:02 AM

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle

parler-tts

https://github.com/cocktailpeanutlabs/parler-ttsv1.5updated 3/17/2025, 1:07:45 AMindexed 1/6/2026, 6:19:03 AM

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini

Openvoice2

https://github.com/cocktailpeanutlabs/openvoice2v3.0updated 3/17/2025, 12:46:07 AMindexed 1/6/2026, 6:16:43 AM

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

ZeST

https://github.com/cocktailpeanutlabs/zestv1.5updated 3/17/2025, 12:39:18 AMindexed 1/6/2026, 6:17:08 AM

ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)

LlamaFactory

https://github.com/pinokiofactory/llamafactoryv1.5updated 3/17/2025, 12:35:47 AMindexed 1/6/2026, 6:19:10 AM

Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory

StableAudio

https://github.com/pinokiofactory/stableaudiov1.5updated 3/17/2025, 12:31:08 AMindexed 1/6/2026, 6:17:03 AM

An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools

flashdiffusion

https://github.com/pinokiofactory/flashdiffusionv1.5updated 3/17/2025, 12:27:30 AMindexed 1/6/2026, 6:16:44 AM

Accelerating any conditional diffusion model for few steps image generation https://gojasper.github.io/flash-diffusion-project/

RC Stable Audio Tools

https://github.com/pinokiofactory/rc-stableaudiov2.0updated 3/17/2025, 12:05:09 AMindexed 1/6/2026, 6:16:30 AM

Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools

audiocraft_plus

https://github.com/pinokiofactory/audiocraft_plusv2.0updated 3/17/2025, 12:02:34 AMindexed 1/6/2026, 6:17:55 AM

AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top https://github.com/GrandaddyShmax/audiocraft_plus

flux-webui

https://github.com/pinokiofactory/flux-webuiv2.0updated 3/17/2025, 12:00:21 AMindexed 1/6/2026, 6:17:36 AM

Minimal Flux Web UI powered by Gradio & Diffusers (Flux Schnell + Flux Merged)

moshi

https://github.com/pinokiofactory/moshiv2.0updated 3/16/2025, 11:50:27 PMindexed 1/6/2026, 6:19:07 AM

[Mac only] a speech-text foundation model for real time dialogue https://github.com/kyutai-labs/moshi

devika

https://github.com/cocktailpeanutlabs/devikav3.0updated 3/8/2025, 7:33:17 PMindexed 1/6/2026, 6:17:08 AM

Agentic AI Software Engineer https://github.com/stitionai/devika

MagicAnimate

https://github.com/cocktailpeanut/MagicAnimate.pinokiov3.0updated 3/7/2025, 8:33:05 PMindexed 1/6/2026, 6:19:46 AM

[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/

Leffa

https://github.com/ai-anchorite/Leffav3.6updated 3/5/2025, 6:59:19 AMindexed 1/6/2026, 6:16:09 AM