Explore tags
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
No fortress, purely open ground. OpenManus is Coming.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
[Nvidia GPU only] High-Quality Image Restoration Following Human Instructions
Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill
Contribute to cubiq/ComfyUI_InstantID development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
An enhanced version of Fooocus giving you access to all of the latest AI image generation models
Spark-TTS Inference Code
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video processing, everything is automatically saved to an outputs folder (w/ file-naming conventions) & I've converted the .pth models to .safetensors.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Improving Diffusion Models for Authentic Virtual Try-on in the Wild https://huggingface.co/spaces/yisol/IDM-VTON
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
A Vietnamese Voice Cloning Text-to-Speech Model ✨
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Slightly improved official version for finetune xtts
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
browser-useFeatured
Run AI Agent in your browser. https://github.com/browser-use/web-ui