Store
Explore tags
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
deep hermes, but without the need for a system prompt. Autonomously responds based on its OWN judgment https://github.com/cocktailpeanut/deeperhermes
Video translation & dubbing with voice cloning — 100% local, zero API. Supports 10 languages, automatic transcription, translation, and AI dubbing.
Hunyuan3D-2-LowVRAMFeatured
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
LingBot-World NF4Featured
World Model - Image to Video (4-bit Quantized, ~20GB VRAM)
MatAnyoneFeatured
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
One click face-swap GUI
e2-f5-ttsFeatured
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
AI Green Screen Keyer & Alpha Generator
A multi-voice AI audiobook generator built on Qwen3-TTS — annotate scripts with an LLM, assign unique voices to each character, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, train custom voices with LoRA fine-tuning, and export to MP3 or Audacity multi-track projects
Stable Diffusion web UI
Remove backgrounds from videos and images with precision AI matting. Runs locally on 12GB VRAM — Windows, Linux, and macOS.
Clarity Refiners UIFeatured
An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)
AI Song Generation on Mac Apple Silicon, with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model.
TTS app built around the EchoTTS model. TTS, Dub, and voice cloning.
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
Unsloth StudioFeatured
Run and train AI models with a unified local interface. https://github.com/unslothai/unsloth
CogStudioFeatured
[NVIDIA ONLY] Advanced Web UI for CogVideo (text to video, image to video, video to video, extend video, etc) -- Generate videos with less than 10GB VRAM
LightOnOCR-2-1BFeatured
State-of-the-art 1B OCR model (83.2% on OlmOCR-Bench). Local version of the HuggingFace demo. Created by Claude Code, orchestrated by TheAwakenOne.
