Store
Explore tags

Batch resize images to 512, 768, or 1024px on the longest side while preserving aspect ratio. Supports JPG, PNG, BMP, GIF, TIFF, and WebP.
🔊 PocketTTS - A lightweight, CPU-optimized Text-to-Speech (TTS) application by Kyutai Labs. Generate natural-sounding speech with low latency (~200ms), voice cloning support, and 6x real-time performance on CPU. 100M parameter model with 8 preset voices and custom voice cloning. English only. No GPU required!

Batch resize images to predefined sizes (512px, 768px, 1024px) while maintaining aspect ratio
BiRefNet for background removal
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference. https://github.com/lllyasviel/stable-diffusion-webui-forge?tab=readme-ov-file

Local IMAP email gateway with HTTP REST API, MCP interface, and WebUI for AI agents.

Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.
Vibe KanbanFeatured
Local web UI for orchestrating AI coding agents and tasks (BloopAI/vibe-kanban).
VITS-based Voice Conversion focused on simplicity, quality and performance
VibeSurf - AI-powered browser assistant for surfing the web with intelligence
Flask-based web UI for AI image/video generation, chat, and text-to-speech with queue management and multi-theme system
Flask-based web UI for AI image/video generation, chat, and text-to-speech with queue management and multi-theme system
A Web UI for easy subtitle using whisper model.
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. (On Windows supported by 7900(XT), 7800(XT), 7600(XT), Phoenix, 9070(XT) and Strix Halo)
The AI that actually does things https://openclaw.ai
Uncensored Deepfakes for images and videos without training and an easy-to-use GUI.
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack

a localized version of https://huggingface.co/spaces/multimodalart/MoDA-fast-talking-head

High-performance Local API for Proedit (Modal + Vercel Integrated)