Store
Explore tags
A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
Minimal Flux Web UI powered by Gradio & Diffusers (Flux Schnell + Flux Merged)
Contribute to schrojunzhang/KarmaDock development by creating an account on GitHub.
[NVIDIA ONLY] a state-of-the-art open-source model for fast feedforward 3D mesh reconstruction from a single image, from Stability AI. https://huggingface.co/spaces/stabilityai/stable-fast-3d
Minimal Flux Web UI powered by Gradio & Diffusers
Phased Consistency Model - generate high quality images with 2 steps https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM
Contribute to compphoto/BoostingMonocularDepth development by creating an account on GitHub.
An arbitrary face-swapping framework on images and videos with one single trained model!
[NVIDIA GPU ONLY] One click installer for Intel's ldm3d
Dense Text-to-Image Generation with Attention Modulation
An open source implementation of Microsoft's VALL-E X zero-shot TTS model
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server (https://github.com/radames/Real-Time-Latent-Consistency-Model)
An AI powered mirror
A Realtime Creation Engine
Vid2DensePoseFeatured
Convert your videos to densepose and use it on MagicAnimate https://github.com/Flode-Labs/vid2densepose
Estimating the Focal Length of a Monocular Image
Integrates Florence2 and SAM2 models for detailed image captioning and object detection. Florence2 generates detailed captions that are then used to perform phrase grounding. The Segment Anything Model 2 (SAM2) converts these phrase-grounded boxes into masks. https://huggingface.co/spaces/SkalskiP/florence-sam
