Explore tags
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation https://huggingface.co/spaces/ashawkey/LGM
Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background
dust3rFeatured
Geometric 3D Vision Made Easy https://dust3r.europe.naverlabs.com/
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
ZETAFeatured
Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing
Arc2FaceFeatured
A Foundation Model of Human Faces https://huggingface.co/spaces/FoivosPar/Arc2Face
sprightFeatured
Generate images with spatial accuracy https://huggingface.co/spaces/SPRIGHT-T2I/SPRIGHT-T2I
CustomNetFeatured
A unified encoder-based framework for object customization in text-to-image diffusion models https://huggingface.co/spaces/TencentARC/CustomNet
gligenFeatured
An intuitive GUI for GLIGEN that uses ComfyUI in the backend https://github.com/mut-ex/gligen-gui
Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI, https://huggingface.co/spaces/multimodalart/cosxl
face-to-allFeatured
diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all
instantstyleFeatured
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required https://huggingface.co/spaces/InstantX/InstantStyle
parler-ttsFeatured
a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation). https://huggingface.co/spaces/parler-tts/parler_tts_mini
ZeSTFeatured
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)
LlamaFactoryFeatured
Unify Efficient Fine-Tuning of 100+ LLMs https://github.com/hiyouga/LLaMA-Factory
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
StableAudioFeatured
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
Accelerating any conditional diffusion model for few steps image generation https://gojasper.github.io/flash-diffusion-project/
Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools