Store
Explore tags
Florence-2 Image Captioning
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
diffusers-image-fillFeatured
Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
Improving Diffusion Models for Authentic Virtual Try-on in the Wild https://huggingface.co/spaces/yisol/IDM-VTON
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
StoryDiffusion ComicsFeatured
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
deep hermes, but without the need for a system prompt. Autonomously responds based on its OWN judgment https://github.com/cocktailpeanut/deeperhermes
moondream2Featured
a tiny vision language model that kicks ass and runs anywhere https://github.com/vikhyat/moondream
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
AudioSepFeatured
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Video2OpenposeFeatured
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation https://github.com/cumulo-autumn/StreamDiffusion
IP-Adapter-FaceIDFeatured
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID
