Related tags
A generative speech model for daily dialogue.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Check-ins27 check-ins
Platforms
GPUNVIDIAAMDApple
The AI that actually does things https://openclaw.ai
Check-ins15 check-ins
Platforms
GPUNVIDIAAMDApple
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
Check-ins20 check-ins
Platforms
GPUNVIDIAAMDApple
RVCFeatured
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
Wan: Open and Advanced Large-Scale Video Generative Models
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Check-ins52 check-ins
GPUNVIDIAAMDApple
A Web UI for easy subtitle using whisper model.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple