Store
Explore tags
Chatterbox-Multilingual
Fast and High-Quality Zero-Shot voice clone Text-to-Speech with Flow Matching Multilingual
RVC-realtime
[WINDOWS/LINUX ONLY] Easily train a good VC model with voice data <= 10 mins!: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Spanish-F5
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
StoryCraft
Generate engaging 1 to 5-minute short stories with LLMs and convert them to audio with Coqui TTS, supports voice cloning, built in speakers and multilingual.
GitHub - lllyasviel/FramePack: Lets make video diffusion practical!
Lets make video diffusion practical! Contribute to lllyasviel/FramePack development by creating an account on GitHub.
GitHub - jvalen/pixel-art-react: Pixel art animation and drawing web app powered by React
Pixel art animation and drawing web app powered by React - jvalen/pixel-art-react
Ovi
Ovi is a veo-3 like, video+audio generation model that simultaneously generates both video and audio content from text or text+image inputs.
Uniqu3D.git
[NVIDIA ONLY] High-Quality and Efficient 3D Mesh Generation from a Single Image (Minimum requirements 12GB VRAM / 24GB RAM)
Roop-Floyd
Next-generation face-swapping and enhancement (Codeberg fork of Roop). Easy GUI for images & videos.
candy-machine
Image Dataset Tagger for Stable Diffusion / Lora / DreamBooth Training: https://github.com/mikeknapp/candy-machine
geo-clip
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
stable-diffusion-webui-amdgpu-forge
Forge for stable-diffusion-webui-amdgpu (formerly stable-diffusion-webui-directml)
