Store
Explore tags
Florence-2 Image Captioning
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
No fortress, purely open ground. OpenManus is Coming.
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
diffusers-image-fillFeatured
Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill
Contribute to cubiq/ComfyUI_InstantID development by creating an account on GitHub.
Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
Spark-TTS Inference Code
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video processing, everything is automatically saved to an outputs folder (w/ file-naming conventions) & I've converted the .pth models to .safetensors.
Improving Diffusion Models for Authentic Virtual Try-on in the Wild https://huggingface.co/spaces/yisol/IDM-VTON
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
A Vietnamese Voice Cloning Text-to-Speech Model ✨
Slightly improved official version for finetune xtts
StoryDiffusion ComicsFeatured
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.
