Pinokio
Explore tags
dust3r
https://github.com/cocktailpeanutlabs/dust3rv1.3updated 3/17/2025, 2:33:40 AMindexed 1/20/2026, 9:13:14 AM
Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/
differential-diffusion-ui
https://github.com/cocktailpeanutlabs/differential-diffusion-uiv1.2updated 3/17/2025, 2:32:35 AMindexed 1/20/2026, 9:15:20 AM
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
ZETA
https://github.com/cocktailpeanutlabs/zetav1.2updated 3/17/2025, 2:31:43 AMindexed 1/20/2026, 9:14:16 AM
Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing
Arc2Face
https://github.com/cocktailpeanutlabs/arc2facev1.5updated 3/17/2025, 2:24:34 AMindexed 1/20/2026, 9:11:01 AM
A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face
spright
https://github.com/cocktailpeanutlabs/sprightv1.5updated 3/17/2025, 2:20:18 AMindexed 1/20/2026, 9:13:34 AM
Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I
CustomNet
https://github.com/cocktailpeanutlabs/customnetv1.5updated 3/17/2025, 2:19:48 AMindexed 1/20/2026, 9:12:56 AM
A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet
Stable Cascade
https://github.com/cocktailpeanutlabs/stablecascadev3.0updated 3/17/2025, 2:15:31 AMindexed 1/20/2026, 9:12:45 AM
Stable Cascade from StabilityAI
gligen
https://github.com/cocktailpeanutlabs/gligenv1.2updated 3/17/2025, 2:10:56 AMindexed 1/20/2026, 9:11:18 AM
An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui
CosXL
https://github.com/cocktailpeanutlabs/cosxlv1.5updated 3/17/2025, 1:51:07 AMindexed 1/20/2026, 9:12:53 AM
Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI, https://huggingface.co/spaces/multimodalart/cosxl
face-to-all
https://github.com/cocktailpeanutlabs/face-to-allv1.5updated 3/17/2025, 1:43:17 AMindexed 1/20/2026, 9:14:32 AM
diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all
instantstyle
https://github.com/cocktailpeanutlabs/instantstylev1.5updated 3/17/2025, 1:34:48 AMindexed 1/20/2026, 9:10:33 AM
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle
parler-tts
https://github.com/cocktailpeanutlabs/parler-ttsv1.5updated 3/17/2025, 1:07:51 AMindexed 1/30/2026, 12:53:16 PM
a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini
ZeST
https://github.com/cocktailpeanutlabs/zestv1.5updated 3/17/2025, 12:39:18 AMindexed 1/20/2026, 9:12:44 AM
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)
LlamaFactory
https://github.com/pinokiofactory/llamafactoryv1.5updated 3/17/2025, 12:35:47 AMindexed 1/20/2026, 9:14:08 AM
Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory
StableAudio
https://github.com/pinokiofactory/stableaudiov1.5updated 3/17/2025, 12:31:08 AMindexed 1/20/2026, 9:12:13 AM
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
flashdiffusion
https://github.com/pinokiofactory/flashdiffusionv1.5updated 3/17/2025, 12:27:30 AMindexed 1/20/2026, 9:12:00 AM
Accelerating any conditional diffusion model for few steps image generation https://gojasper.github.io/flash-diffusion-project/
RC Stable Audio Tools
https://github.com/pinokiofactory/rc-stableaudiov2.0updated 3/17/2025, 12:05:09 AMindexed 1/20/2026, 9:11:55 AM
Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools
audiocraft_plus
https://github.com/pinokiofactory/audiocraft_plusv2.0updated 3/17/2025, 12:02:34 AMindexed 1/20/2026, 9:13:22 AM
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top https://github.com/GrandaddyShmax/audiocraft_plus
moshi
https://github.com/pinokiofactory/moshiv2.0updated 3/16/2025, 11:50:27 PMindexed 1/20/2026, 9:14:32 AM
[Mac only] a speech-text foundation model for real time dialogue https://github.com/kyutai-labs/moshi
GitHub - voipnuggets/flux-generator: Local image and music generation for Apple Silicon
https://github.com/voipnuggets/flux-generatorupdated 3/10/2025, 2:18:48 PMindexed 1/27/2026, 4:52:22 AM
Local image and music generation for Apple Silicon - GitHub - voipnuggets/flux-generator: Local image and music generation for Apple Silicon
PreviousPage 30 / 39Next