Explore tags
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Check-ins27 check-ins
Platforms
GPUNVIDIAAMDApple
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
Check-ins2 check-ins
Industry leading face manipulation platform
Check-ins34 check-ins
Platforms
GPUNVIDIAAMDApple
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Check-ins40 check-ins
Platforms
GPUNVIDIAAMDApple
ACE-Step UIFeatured
Open source UI for ACE-Step 1.5 music generation.
Check-ins21 check-ins
Platforms
VoiceboxFeatured
Local-first voice synthesis studio powered by Qwen3-TTS.
Check-ins8 check-ins
Platforms
GPUNVIDIAAMDApple
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Check-ins13 check-ins
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
Check-ins20 check-ins
Platforms
GPUNVIDIAAMDApple
Qwen3-TTSFeatured
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
Check-ins14 check-ins
Platforms
GPUNVIDIAAMDApple
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Check-ins11 check-ins
FreeCutFeatured
Professional-grade browser-based video editor with multi-track editing, keyframe animations, real-time preview, and high-quality exports. No uploads — everything runs locally.
Check-ins4 check-ins
Platforms
A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio
Check-ins17 check-ins
Platforms
ACE-Step 1.5Featured
The most powerful local music generation model that outperforms most commercial alternatives.
Check-ins22 check-ins
Platforms
The AI that actually does things https://openclaw.ai
Check-ins15 check-ins
Platforms
GPUNVIDIAAMDApple
VITS-based Voice Conversion focused on simplicity, quality and performance
State-of-the-art 1B OCR model (83.2% on OlmOCR-Bench). Local version of the HuggingFace demo. Created by Claude Code, orchestrated by TheAwakenOne.
Check-ins4 check-ins
Platforms
The AI that actually does things https://openclaw.ai
Browser-based 3D avatar head and mouth controller with keyboard and gamepad support. https://github.com/promptpirate-x/discord-id-bypass-tool
Check-ins3 check-ins
Platforms
World Model - Image to Video (4-bit Quantized, ~20GB VRAM)
Check-ins8 check-ins
Unified AI VFX pipeline with CPE prompt engineering, storyboard canvas, and multi-node orchestrator. https://github.com/NickPittas/DirectorsConsole
Check-ins6 check-ins
Platforms