Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
# SillyTavern Character Generator
A pinokio script for https://github.com/Tremontaine/character-card-generator
When used with KoboldCPP use http://localhost:5001/v1
Where 5001 is the port reported by KoboldCPP when starting
Text API Key needs to be filled with anything. (If left empty will give a error so just add anything to it)
Generate music in different genres using text and audio prompts.
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Uncensored Deepfakes for images and videos without training and an easy-to-use GUI.
🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.
[v0.5.1] FramePack Video App offering multiple generation types: Original, F1, video extension, end frame. Features include: LoRA support, job queueing, advanced timestamped prompts, offline mode, a post-processing suite including upscaling, interpolation, filters and more!
Autonomous 16x16 Chess-Grid research agent (KoboldCPP + Qwen GGUF). Walks a grid of Markdown knowledge cells, synthesizes short papers, scores novelty, updates a persistent soul.md.
TTS app built around the EchoTTS model. TTS, Dub, and voice cloning.
RC Stable Audio ToolsFeatured
Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools
FooocusFeatured
Minimal Stable Diffusion UI
Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.
One-click install & launch for Stable Diffusion WebUI. Free, local, no API key needed. Just type a prompt and create images.
An Efficient Framework For High Fidelity Face Swapping
Kimodo generates high-quality 3D human and robot motions and is controlled through text prompts
Kimodo generates high-quality 3D human and robot motions and is controlled through text prompts
[NVIDIA ONLY] Stable Video Diffusion Streamlit App. Currently supports Nvidia GPU machines only.
LingBot-World NF4Featured
World Model - Image to Video (4-bit Quantized, ~20GB VRAM)
Minimal Stable Diffusion UI
[NVIDIA ONLY] AllTalk-TTS is a unified UI for E5-TTS, XTTS, Vite TTS, Piper TTS, Parler TTS and RVC, based on CoquiTTS, including a finetune mode.
[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
