Explore tags
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
Improving Diffusion Models for Authentic Virtual Try-on in the Wild https://huggingface.co/spaces/yisol/IDM-VTON
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
browser-useFeatured
Run AI Agent in your browser. https://github.com/browser-use/web-ui
deep hermes, but without the need for a system prompt. Autonomously responds based on its OWN judgment https://github.com/cocktailpeanut/deeperhermes
macOS-useFeatured
[Mac Only] We make AI agents that control Mac apps: https://github.com/browser-use/macOS-use
moondream2Featured
a tiny vision language model that kicks ass and runs anywhere https://github.com/vikhyat/moondream
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
AudioSepFeatured
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation https://github.com/cumulo-autumn/StreamDiffusion
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID