Store
#ai20#image-generation6#14#gradio4#tts4#video4#video-generation4#audio3#image3#music3#music-generation3#musicgen3#song-generation3##ai-#audio-generation-#song2##ai-#image-generation2##music-generation2#3d2#ai-music2#audio-generation2#cags2#cuentos2#gaussian2#gaussian-splat2#image-edit2#lipsync2#qwen2#song2#suno2#wan2#wan2gp2
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Ultimate-TTS-StudioFeatured
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
StableDAWFeatured
Browser-based AI audio DAW for Stable Audio 3 with text-to-audio, inpainting, LoRA training, FFmpeg effects, waveform editing, sequencer, piano roll, and persistent library. https://github.com/gantasmo/stabledaw
Image to PromptFeatured
Generate editable Ideogram JSON prompts from uploaded images.
SongGeneration StudioFeatured
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
FramePackFeatured
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
Wan2GP - AMDFeatured
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)
Stable Audio 3Featured
Launcher for Stable Audio 3 Small Music, Small SFX, and NVIDIA Medium using public cocktailpeanut Hugging Face mirrors. https://github.com/Stability-AI/stable-audio-3
StoryDiffusion ComicsFeatured
create a story by generating consistent images https://github.com/HVision-NKU/StoryDiffusion
FooocusFeatured
Minimal Stable Diffusion UI
FaceFusion 3.5.4Featured
Industry leading face manipulation platform
Qwen3-TTS MLX WebUI EnhancedFeatured
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
ACE-Step UIFeatured
Open source UI for ACE-Step 1.5 music generation.
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
VideoCrafter 2Featured
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models https://github.com/AILab-CVC/VideoCrafter
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
WorldMirror 2.0Featured
[NVIDIA] Pinokio launcher for the released WorldMirror 2.0 reconstruction app from HY-World 2.0. Uses a cu128 PyTorch baseline with gsplat from PyPI/JIT. https://github.com/Tencent-Hunyuan/HY-World-2.0
WebUI for ML-Sharp (3DGS)Featured
One-click 3D Gaussian Splatting generation from a single image.
Unsloth StudioFeatured
Run and train AI models with a unified local interface. https://github.com/unslothai/unsloth
