Pinokio
Explore tags
Bark Voice Cloning
https://github.com/cocktailpeanutlabs/barkv1.1updated 3/20/2025, 7:19:45 PMindexed 1/20/2026, 9:11:47 AM
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
Curated by @pinokio
gligen
https://github.com/cocktailpeanutlabs/gligenv1.2updated 3/17/2025, 2:10:56 AMindexed 1/20/2026, 9:11:18 AM
An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui
Curated by @pinokio
MeloTTS
https://github.com/cocktailpeanutlabs/melottsv1.2updated 3/17/2025, 2:35:10 AMindexed 1/20/2026, 9:13:58 AM
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS
Curated by @pinokio
remove-video-bg
https://github.com/cocktailpeanutlabs/remove-video-bgv1.2updated 3/17/2025, 2:34:24 AMindexed 1/20/2026, 9:12:44 AM
Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background
Curated by @pinokio
Chatbot-Ollama
https://github.com/cocktailpeanutlabs/chatbot-ollamav1.2updated 8/5/2024, 11:37:13 PMindexed 1/20/2026, 9:12:40 AM
open source chat UI for Ollama https://github.com/ivanfioravanti/chatbot-ollama
Curated by @pinokio
dust3r
https://github.com/cocktailpeanutlabs/dust3rv1.3updated 3/17/2025, 2:33:40 AMindexed 1/20/2026, 9:13:14 AM
Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/
Curated by @pinokio
differential-diffusion-ui
https://github.com/cocktailpeanutlabs/differential-diffusion-uiv1.2updated 3/17/2025, 2:32:35 AMindexed 1/20/2026, 9:15:20 AM
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
Curated by @pinokio
ZETA
https://github.com/cocktailpeanutlabs/zetav1.2updated 3/17/2025, 2:31:43 AMindexed 1/20/2026, 9:14:16 AM
Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing
Curated by @pinokio
moondream2
https://github.com/cocktailpeanutlabs/moondream2v3.0updated 3/31/2025, 7:49:18 PMindexed 1/20/2026, 9:12:38 AM
a tiny vision language model that kicks ass and runs anywhere https://github.com/vikhyat/moondream
Curated by @pinokio
supir
https://github.com/cocktailpeanutlabs/supirv1.2updated 3/30/2025, 6:02:54 AMindexed 1/20/2026, 9:13:46 AM
[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life https://supir.xpixel.group
Curated by @pinokio
Arc2Face
https://github.com/cocktailpeanutlabs/arc2facev1.5updated 3/17/2025, 2:24:34 AMindexed 1/20/2026, 9:11:01 AM
A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face
Curated by @pinokio
brushnet
https://github.com/cocktailpeanutlabs/brushnetv3.0updated 4/21/2025, 4:12:20 PMindexed 1/20/2026, 9:14:06 AM
A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion https://huggingface.co/spaces/TencentARC/BrushNet
Curated by @pinokio
spright
https://github.com/cocktailpeanutlabs/sprightv1.5updated 3/17/2025, 2:20:18 AMindexed 1/20/2026, 9:13:34 AM
Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I
Curated by @pinokio
CustomNet
https://github.com/cocktailpeanutlabs/customnetv1.5updated 3/17/2025, 2:19:48 AMindexed 1/20/2026, 9:12:56 AM
A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet
Curated by @pinokio
face-to-all
https://github.com/cocktailpeanutlabs/face-to-allv1.5updated 3/17/2025, 1:43:17 AMindexed 1/20/2026, 9:14:32 AM
diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all
Curated by @pinokio
instantstyle
https://github.com/cocktailpeanutlabs/instantstylev1.5updated 3/17/2025, 1:34:48 AMindexed 1/20/2026, 9:10:33 AM
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle
Curated by @pinokio
parler-tts
https://github.com/cocktailpeanutlabs/parler-ttsv1.5updated 3/17/2025, 1:07:45 AMindexed 1/20/2026, 9:13:28 AM
a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini
Curated by @pinokio
Lobe Chat
https://github.com/cocktailpeanutlabs/lobev2.0updated 3/31/2025, 4:37:26 PMindexed 1/20/2026, 9:13:43 AM
An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system. https://github.com/lobehub/lobe-chat
Curated by @pinokio
Openvoice2
https://github.com/cocktailpeanutlabs/openvoice2v3.0updated 3/17/2025, 12:46:07 AMindexed 1/20/2026, 9:12:07 AM
Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793
Curated by @pinokio
ZeST
https://github.com/cocktailpeanutlabs/zestv1.5updated 3/17/2025, 12:39:18 AMindexed 1/20/2026, 9:12:44 AM
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)
Curated by @pinokio
PreviousPage 2 / 6Next