Explore tags
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Check-ins27 check-ins
Platforms
GPUNVIDIAAMDApple
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
Check-ins2 check-ins
Industry leading face manipulation platform
Check-ins34 check-ins
Platforms
GPUNVIDIAAMDApple
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Check-ins40 check-ins
Platforms
GPUNVIDIAAMDApple
ACE-Step UIFeatured
Open source UI for ACE-Step 1.5 music generation.
Check-ins21 check-ins
Platforms
VoiceboxFeatured
Local-first voice synthesis studio powered by Qwen3-TTS.
Check-ins8 check-ins
Platforms
GPUNVIDIAAMDApple
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Check-ins13 check-ins
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
Check-ins20 check-ins
Platforms
GPUNVIDIAAMDApple
Qwen3-TTSFeatured
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
Check-ins14 check-ins
Platforms
GPUNVIDIAAMDApple
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Check-ins11 check-ins
FreeCutFeatured
Professional-grade browser-based video editor with multi-track editing, keyframe animations, real-time preview, and high-quality exports. No uploads — everything runs locally.
Check-ins4 check-ins
Platforms
A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio
Check-ins17 check-ins
Platforms
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
Check-ins22 check-ins
Platforms
The AI that actually does things https://openclaw.ai
Check-ins15 check-ins
Platforms
GPUNVIDIAAMDApple
VITS-based Voice Conversion focused on simplicity, quality and performance
State-of-the-art 1B OCR model (83.2% on OlmOCR-Bench). Local version of the HuggingFace demo. Created by Claude Code, orchestrated by TheAwakenOne.
Check-ins4 check-ins
Platforms
The AI that actually does things https://openclaw.ai
Browser-based 3D avatar head and mouth controller with keyboard and gamepad support. https://github.com/promptpirate-x/discord-id-bypass-tool
Check-ins3 check-ins
Platforms
World Model - Image to Video (4-bit Quantized, ~20GB VRAM)
Check-ins8 check-ins