dreamtalk
https://github.com/cocktailpeanutlabs/dreamtalkv3.0updated 3/23/2025, 12:46:56 AMindexed 1/6/2026, 6:18:27 AM
When Expressive Talking Head Generation Meets Diffusion Probabilistic Models (https://github.com/ali-vilab/dreamtalk)
StreamDiffusion
https://github.com/cocktailpeanutlabs/streamdiffusionv3.0updated 3/23/2025, 12:17:49 AMindexed 1/6/2026, 6:14:49 AM
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation https://github.com/cumulo-autumn/StreamDiffusion
IP-Adapter-FaceID
https://github.com/cocktailpeanutlabs/faceidv3.0updated 3/22/2025, 11:02:11 PMindexed 1/6/2026, 6:19:41 AM
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID
Moore-AnimateAnyone
https://github.com/cocktailpeanutlabs/moore-animateanyonev3.0updated 3/22/2025, 10:48:57 PMindexed 1/6/2026, 6:18:16 AM
[NVIDIA GPU ONLY] Unofficial Implementation of Animate Anyone https://github.com/MooreThreads/Moore-AnimateAnyone
OpenVoice
https://github.com/cocktailpeanutlabs/openvoicev1updated 3/22/2025, 10:34:34 PMindexed 1/6/2026, 6:17:31 AM
Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice
Moore-AnimateAnyone-Mini
https://github.com/cocktailpeanutlabs/moore-animateanyone-miniv3.0updated 3/22/2025, 1:28:10 AMindexed 1/6/2026, 6:16:39 AM
[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size) https://github.com/sdbds/Moore-AnimateAnyone-for-windows
Bark Voice Cloning
https://github.com/cocktailpeanutlabs/barkv1.1updated 3/20/2025, 7:19:45 PMindexed 1/6/2026, 6:16:12 AM
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
VideoCrafter 2
https://github.com/cocktailpeanutlabs/videocrafter2v3.0updated 3/20/2025, 4:04:42 AMindexed 1/6/2026, 6:15:40 AM
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models https://github.com/AILab-CVC/VideoCrafter
vid2pose
https://github.com/cocktailpeanutlabs/vid2posev1updated 3/20/2025, 3:59:35 AMindexed 1/6/2026, 6:17:35 AM
Video to Openpose & DWPose (All OS supported) https://github.com/sdbds/vid2pose
MAGNeT
https://github.com/cocktailpeanutlabs/magnetv3.0updated 3/20/2025, 3:10:23 AMindexed 1/6/2026, 6:17:39 AM
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md
InstantID
https://github.com/cocktailpeanutlabs/instantidv3.0updated 3/20/2025, 3:06:48 AMindexed 1/6/2026, 6:16:41 AM
state-of-the-art tuning-free method to achieve ID-Preserving generation with only single image, supporting various downstream tasks. https://instantid.github.io/
PCM
https://github.com/pinokiofactory/pcmv3.0updated 3/20/2025, 3:06:05 AMindexed 1/6/2026, 6:17:06 AM
Phased Consistency Model - generate high quality images with 2 steps https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM
BRIA RMBG
https://github.com/cocktailpeanutlabs/bria-rmbgv1.1updated 3/20/2025, 2:56:52 AMindexed 1/6/2026, 6:15:29 AM
Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4
[NVIDIA GPU ONLY] LGM
https://github.com/cocktailpeanutlabs/lgmv3.0updated 3/17/2025, 10:41:16 PMindexed 1/6/2026, 6:18:16 AM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation https://huggingface.co/spaces/ashawkey/LGM
MeloTTS
https://github.com/cocktailpeanutlabs/melottsv1.2updated 3/17/2025, 2:35:10 AMindexed 1/6/2026, 6:18:57 AM
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS
remove-video-bg
https://github.com/cocktailpeanutlabs/remove-video-bgv1.2updated 3/17/2025, 2:34:24 AMindexed 1/6/2026, 6:17:00 AM
Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background
dust3r
https://github.com/cocktailpeanutlabs/dust3rv1.3updated 3/17/2025, 2:33:40 AMindexed 1/6/2026, 6:17:16 AM
Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/
differential-diffusion-ui
https://github.com/cocktailpeanutlabs/differential-diffusion-uiv1.2updated 3/17/2025, 2:32:35 AMindexed 1/6/2026, 6:14:51 AM
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
ZETA
https://github.com/cocktailpeanutlabs/zetav1.2updated 3/17/2025, 2:31:43 AMindexed 1/6/2026, 6:19:10 AM
Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing
Arc2Face
https://github.com/cocktailpeanutlabs/arc2facev1.5updated 3/17/2025, 2:24:34 AMindexed 1/6/2026, 6:17:17 AM
A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face
PreviousPage 10 / 18Next