Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio
A full-scale college management portal with an integrated AI assistant (AIRA) and automated WhatsApp parent notifications.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team
1 Click Installer for kohya_ss, a Stable Diffusion LoRa & Dreambooth WebUI (https://github.com/bmaltais/kohya_ss)

Installe StreamSplat dans un env conda isolé Pinokio + build rasterizer + Depth Anything V2 checkpoint + reset propre.
IP-Adapter-FaceIDFeatured
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models https://github.com/AILab-CVC/VideoCrafter
Stable Diffusion web UI UX: https://github.com/anapnoe/stable-diffusion-webui-ux
Supertonic is a lightning-fast, on-device text-to-speech system designed for extreme performance with minimal computational overhead. Powered by ONNX Runtime,
The most powerful local music generation model that outperforms most commercial alternatives.
StableAudioFeatured
An Open Source Model for Audio Samples and Sound Design https://github.com/Stability-AI/stable-audio-tools
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Fast Speech-to-Text Web UI with Apple MLX and OpenAI Whisper
Anime and manga media server with a local web interface. https://github.com/5rahim/seanime
Video inpainting (object removal / video completion) - sczhou/ProPainter
Community Pinokio package for OpenClaw preconfigured to use localhost-only Ollama or LM Studio endpoints.
[NVIDIA ONLY] A minimal Gradio interface for Automatic Speech Recognition. Transcribe Audio in Malayalam language.
A web interface for managing and interacting with Ollama models