Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store

Local-first Suno-style music studio powered by ACE-Step 1.5.
LightOnOCR-2-1BFeatured
State-of-the-art 1B OCR model (83.2% on OlmOCR-Bench). Local version of the HuggingFace demo. Created by Claude Code, orchestrated by TheAwakenOne.
Simple hello world app using Gradio and uv sync
One-click ComfyUI + Torch + Python installer by Inteliweb AI. https://github.com/Comfy-Org
An AI audiobook generator built on Qwen3-TTS. Annotate your book with an LLM, assign voices, per-line style instructions for delivery, clone voices from reference audio, design new voices from text descriptions, and export to MP3 or Audacity multi-track projects
A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio
LFM2.5-VL-450M (Liquid AI): compact vision–language model for image understanding. Gradio UI with upload/URL, prompt, and generation sliders.
Moore-AnimateAnyone-MiniFeatured
[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size) https://github.com/sdbds/Moore-AnimateAnyone-for-windows
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, with AMD GPU support via ROCm. Windows and Linux.
Open WebUIFeatured
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).
Launcher principal y simple para ComfyUI. Instala ComfyUI + Manager, descarga modelos listos para usar y te guia hacia las plantillas oficiales.
One-click OpenClaw gateway + Nerve launcher with no onboarding prompts.
Turn your eBooks into audiobooks using the OmniVoice text-to-speech model
Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.
Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.
Simple Gradio app for generating images with Tongyi-MAI/Z-Image-Turbo.
Pinokio wrapper for LongCat-AudioDiT with selectable 1B / 3.5B model downloads.
Goose sidecar dashboard draft with Pinokio onboarding-focused launcher.