Store
Explore tags
MAGNeTFeatured
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md
PhotoMakerFeatured
Customizing Realistic Human Photos via Stacked ID Embedding https://github.com/TencentARC/PhotoMaker
VideoCrafter 2Featured
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models https://github.com/AILab-CVC/VideoCrafter
Bark Voice CloningFeatured
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
remove-video-bgFeatured
Video background removal tool https://huggingface.co/spaces/amirgame197/Remove-Video-Background
Chatbot-OllamaFeatured
open source chat UI for Ollama https://github.com/ivanfioravanti/chatbot-ollama
differential-diffusion-uiFeatured
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region https://differential-diffusion.github.io/
ZETAFeatured
Zero-Shot Text-Based Audio Editing Using DDPM Inversion https://huggingface.co/spaces/hilamanor/audioEditing
ZeSTFeatured
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)
halloFeatured
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
PhotoMaker2Featured
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
flux-webuiFeatured
Minimal Flux Web UI powered by Gradio & Diffusers (Flux Schnell + Flux Merged)
ApplioFeatured
A simple, high-quality voice conversion tool focused on ease of use and performance.
CogStudioFeatured
[NVIDIA ONLY] Advanced Web UI for CogVideo (text to video, image to video, video to video, extend video, etc) -- Generate videos with less than 10GB VRAM
