Explore tags
Batch resize images to 512, 768, or 1024px on the longest side while preserving aspect ratio. Supports JPG, PNG, BMP, GIF, TIFF, and WebP.
🔊 PocketTTS - A lightweight, CPU-optimized Text-to-Speech (TTS) application by Kyutai Labs. Generate natural-sounding speech with low latency (~200ms), voice cloning support, and 6x real-time performance on CPU. 100M parameter model with 8 preset voices and custom voice cloning. English only. No GPU required!
Batch resize images to predefined sizes (512px, 768px, 1024px) while maintaining aspect ratio
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference. https://github.com/lllyasviel/stable-diffusion-webui-forge?tab=readme-ov-file
Local IMAP email gateway with HTTP REST API, MCP interface, and WebUI for AI agents.
Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.
Vibe KanbanFeatured
Local web UI for orchestrating AI coding agents and tasks (BloopAI/vibe-kanban).
Check-ins5 check-ins
Platforms
VITS-based Voice Conversion focused on simplicity, quality and performance
Check-ins3 check-ins
Platforms
VibeSurf - AI-powered browser assistant for surfing the web with intelligence
Flask-based web UI for AI image/video generation, chat, and text-to-speech with queue management and multi-theme system
Flask-based web UI for AI image/video generation, chat, and text-to-speech with queue management and multi-theme system
A Web UI for easy subtitle using whisper model.
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
Check-ins7 check-ins
GPUNVIDIAAMDApple
The AI that actually does things https://openclaw.ai
Uncensored Deepfakes for images and videos without training and an easy-to-use GUI.
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
Check-ins3 check-ins
a localized version of https://huggingface.co/spaces/multimodalart/MoDA-fast-talking-head
High-performance Local API for Proedit (Modal + Vercel Integrated)