Explore tags
OpenAI-compatible TTS proxy for Ultimate TTS Studio.
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
YuEFeatured
[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP
The AI that actually does things https://openclaw.ai
The AI that actually does things https://openclaw.ai
A Modular AI Image Generation Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. Supports Stable Diffusion, Flux, etc. AI image models, with plans to support AI video, audio, and more in the future.
Flask-based web UI for AI image/video generation, chat, and text-to-speech with queue management and multi-theme system
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Track the most popular Pinokio scripts from verified publishers and the community. Syncs live data from GitHub.
Check-ins4 check-ins
Platforms
AI Persona Interaction System with localized memory and emotional intelligence.
Next-generation face-swapping and enhancement (Codeberg fork of Roop). Easy GUI for images & videos.
Industry leading face manipulation platform
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)
Check-ins3 check-ins
GPUNVIDIAAMDApple
FlashVSR - Video and Image Upscaler: [Runs on 12GB vram, 32GB ram] Diffusion-Based Streaming Video Super-Resolution
Fast Lipsync application for smaller GPU's.
Use PuterOS free credits and models as back-ends to pinokio apps. VERY basic app. More coming soon!!!
Image generation using zai-org/GLM-Image with Gradio UI. Supports text-to-image and image-to-image generation.
Image Upscale is an AI-powered application designed to enhance and upscale images using advanced techniques like Stable Diffusion and Tile ControlNet. It provides high-quality image enhancement with options for HDR effects and customizable settings.
High-Quality Text-to-Speech for Indian Languages