Store

Tag:#aix

vid2pose

https://github.com/cocktailpeanutlabs/vid2posev1updated 3/20/2025, 3:59:35 AMindexed 1/20/2026, 9:11:13 AM

Video to Openpose & DWPose (All OS supported) https://github.com/sdbds/vid2pose

#ai #utility

MAGNeT

https://github.com/cocktailpeanutlabs/magnetv3.0updated 3/20/2025, 3:10:23 AMindexed 1/20/2026, 9:13:14 AM

MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md

#ai #audio-generation #music #musicgen #song #song-generation

InstantID

https://github.com/cocktailpeanutlabs/instantidv3.0updated 3/20/2025, 3:06:48 AMindexed 1/20/2026, 9:12:10 AM

state-of-the-art tuning-free method to achieve ID-Preserving generation with only single image, supporting various downstream tasks. https://instantid.github.io/

#image-generation #ai

PCM

https://github.com/pinokiofactory/pcmv3.0updated 3/20/2025, 3:06:05 AMindexed 1/20/2026, 9:12:35 AM

Phased Consistency Model - generate high quality images with 2 steps https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM

#ai #image-generation

BRIA RMBG

https://github.com/cocktailpeanutlabs/bria-rmbgv1.1updated 3/20/2025, 2:56:52 AMindexed 1/20/2026, 9:14:58 AM

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4

#ai #image-edit

[NVIDIA GPU ONLY] LGM

https://github.com/cocktailpeanutlabs/lgmv3.0updated 3/17/2025, 10:41:16 PMindexed 1/20/2026, 9:10:47 AM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation https://huggingface.co/spaces/ashawkey/LGM

#3d #3dgen #ai

MeloTTS

https://github.com/cocktailpeanutlabs/melottsv1.2updated 3/17/2025, 2:35:10 AMindexed 1/20/2026, 9:13:58 AM

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS

#ai #tts

remove-video-bg

https://github.com/cocktailpeanutlabs/remove-video-bgv1.2updated 3/17/2025, 2:34:24 AMindexed 1/20/2026, 9:12:44 AM

Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background

#ai #video-edit

dust3r

https://github.com/cocktailpeanutlabs/dust3rv1.3updated 3/17/2025, 2:33:40 AMindexed 1/20/2026, 9:13:14 AM

Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/

#3dgen #ai

differential-diffusion-ui

https://github.com/cocktailpeanutlabs/differential-diffusion-uiv1.2updated 3/17/2025, 2:32:35 AMindexed 1/20/2026, 9:15:20 AM

Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/

#ai #image-edit #image-generation

ZETA

https://github.com/cocktailpeanutlabs/zetav1.2updated 3/17/2025, 2:31:43 AMindexed 1/20/2026, 9:14:16 AM

Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing

#ai #audio-edit #audio-generation

Arc2Face

https://github.com/cocktailpeanutlabs/arc2facev1.5updated 3/17/2025, 2:24:34 AMindexed 1/20/2026, 9:11:01 AM

A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face

#ai #face

spright

https://github.com/cocktailpeanutlabs/sprightv1.5updated 3/17/2025, 2:20:18 AMindexed 1/20/2026, 9:13:34 AM

Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I

#ai #image-generation

CustomNet

https://github.com/cocktailpeanutlabs/customnetv1.5updated 3/17/2025, 2:19:48 AMindexed 1/20/2026, 9:12:56 AM

A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet

#ai

gligen

https://github.com/cocktailpeanutlabs/gligenv1.2updated 3/17/2025, 2:10:56 AMindexed 1/20/2026, 9:11:18 AM

An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui

#ai

face-to-all

https://github.com/cocktailpeanutlabs/face-to-allv1.5updated 3/17/2025, 1:43:17 AMindexed 1/20/2026, 9:14:32 AM

diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all

#ai

instantstyle

https://github.com/cocktailpeanutlabs/instantstylev1.5updated 3/17/2025, 1:34:48 AMindexed 1/20/2026, 9:10:33 AM

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle

#ai #image-generation

parler-tts

https://github.com/cocktailpeanutlabs/parler-ttsv1.5updated 3/17/2025, 1:07:45 AMindexed 1/20/2026, 9:13:28 AM

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini

#ai #tts

Openvoice2

https://github.com/cocktailpeanutlabs/openvoice2v3.0updated 3/17/2025, 12:46:07 AMindexed 1/20/2026, 9:12:07 AM

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS https://x.com/myshell_ai/status/1783161876052066793

#ai #tts

ZeST

https://github.com/cocktailpeanutlabs/zestv1.5updated 3/17/2025, 12:39:18 AMindexed 1/20/2026, 9:12:44 AM

ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)

#ai #image-edit