Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
Next generation face swapper and enhancer
FlashVSR - Video and Image Upscaler: [Runs on 12GB vram, 32GB ram] Diffusion-Based Streaming Video Super-Resolution
Deepfakes Software For All
[NVIDIA ONLY] Advanced Web UI for CogVideo (text to video, image to video, video to video, extend video, etc) -- Generate videos with less than 10GB VRAM
cogvideoFeatured
[NVIDIA ONLY] Generate videos with less than 10GB VRAM https://github.com/THUDM/CogVideo
i have patched and reengineered open sora to work on windows
Minimal Flux Web UI powered by Gradio & Diffusers (Flux Schnell + Flux Merged)
ColorFusion-XL Video Colorization (PAL-stabil, RTX3080-optimiert)
AudioCraft Plus is an all-in-one WebUI for the original AudioCraft, adding many quality features on top https://github.com/GrandaddyShmax/audiocraft_plus
All in one Gradio interface for chatterbox. Voice cloning from uploaded audio samples, automatic text processing for long content and real-time speech generation with configurable parameters. (Minimum Requirements 4GB VRAM / Recommended Requirements 8GB VRAM)
VibeVoice RealtimeFeatured
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
PinokioLangGraph — Agent-StateSync for SillyTavern. A Pinokio script that runs a FastAPI + LangGraph agent as middleware between SillyTavern and your LLMs.
Swap faces in photos and videos in seconds — no training required. Powered by InsightFace and ONNX, with optional TensorRT acceleration, multi-face targeting, enhancement pipelines, and a clean one-click interface.
LTX-Desktop Video Generation + Editor - Powered By WanGP
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
LivePortraitFeatured
Bring portraits to life! https://github.com/KwaiVGI/LivePortrait
High-Quality Text-to-Speech for Indian Languages
Fast Lipsync application for smaller GPU's.
