Explore tags
Build your own voice for StyleTTS2
YuEFeatured
[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP
MatAnyoneFeatured
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
cubeFeatured
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
unoFeatured
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
AI视频剪辑
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Synchronized Translation for Videos. Video dubbing
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
Tag manager and captioner for image datasets: https://github.com/jhc13/taggui
Beta release of Archon OS - the knowledge and task management backbone for AI coding assistants.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Frontier Open-Source Text-to-Speech