Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
X-Voice is a multilingual text-to-speech system that enables one speaker to speak 30 languages.
Qwen3-TTS MLX WebUI EnhancedFeatured
High-quality text-to-speech with Beautiful Web UI & API, optimized for Apple Silicon using MLX. Features include Custom Voice (preset speakers), Voice Design (natural language), and Voice Cloning. With enhanced features for saving custom voices and long-form / endless TTS streaming.
A personal fork of lllyasviel/Fooocus v2.5.5 with quality-of-life features: Save Preset, CivitAI Model Settings, LoRA trigger words, Embeddings panel, Wildcards editor, Vary-with-aspect-ratio, Custom Resolution, Asset Browser, Restart UI button.
Linux-tested Pinokio launcher for FluxRT with automatic model downloads.

Local AI YouTube video generator: script, scenes, voiceover, thumbnail and MP4 export.
ACE-Step UIFeatured
Open source UI for ACE-Step 1.5 music generation.

Free LLM API server with dashboard
Based on BFS - Best Face Swap, VisoMaster, and SwapAnyHead.
Hermes Agent with modern WebUI (nesquena/hermes-webui). Persistent memory, multi-provider AI (OpenAI, Anthropic, Gemini, DeepSeek, OpenRouter), scheduled cron jobs, skills, and sessions. Three-panel interface with chat, tasks, memory, and workspace browser. https://github.com/nesquena/hermes-webui
Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI
ForgeFeatured
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
P2PCLAW Agent Benchmark — connect any LLM agent (Claude, GPT, Gemini, Qwen, Kimi, DeepSeek…) and get scored on 10 dimensions + Tribunal IQ. Dashboard runs locally on :8787, leaderboard at p2pclaw.com/app/benchmark.
Automatically clip videos and generate captions for LoRA training using advanced vision models like Gemma-3, Qwen3-VL, and Qwen2-VL.
VoiceboxFeatured
Local-first voice synthesis studio powered by Qwen3-TTS.
A local-first agentic HTML editor that turns markdown, data, and notes into designed HTML artifacts.
Identify and download missing models for ComfyUI workflows automatically.
ComfyuiFeatured
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Decentralized geospatial intelligence dashboard aggregating 60+ real-time public feeds (aircraft, ships, satellites, seismic events, fires, signals) into a unified map UI.
Director's ConsoleFeatured
Unified AI VFX pipeline with CPE prompt engineering, storyboard canvas, and multi-node orchestrator. https://github.com/NickPittas/DirectorsConsole