Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
LFM2-Audio-1.5B is Liquid AI's first end-to-end audio foundation model. Designed with low latency and real time conversation in mind
Pinokio script for https://huggingface.co/Ole1/Joy_Caption_Batch-GUI
Gradio-based web interface for the LuxTTS voice cloning and text-to-speech model, enabling users to generate customized speech from text using uploaded or recorded audio references with adjustable parameters like speed, guidance scale, and inference steps.
A tool that takes a text document containing a book or a novel, ingests it with an LLM to produce an annotated script, and then uses a TTS API to generate the voice lines, finally stitching them together into an audiobook in MP3 format.
OneTrainer para Pinokio vato loco
Imposing Consistent Light - Control lighting of images
Fast AI Video Generation per GPU poor (Wan2.1, Hunyuan, LTV). Gradio UI su http://127.0.0.1:7860
RVCFeatured
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference. https://github.com/Panchovix/stable-diffusion-webui-reForge
An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system. https://github.com/lobehub/lobe-chat
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
One-click installer for Microsoft TRELLIS.2: High-quality 3D asset generation from images with PBR textures.
Google's official AI agent for your terminal. Access Gemini 2.5 Pro with 1M token context window directly from the command line.

Mon portail IA personnel
Secure Workflow Automation for Technical Teams
Relight any image using AI (SwitchLight-inspired)
Industry leading face manipulation platform
MuseTalk is a cutting-edge video-to-video (V2V) lip-sync solution engineered to deliver highly accurate and natural mouth movements synchronized to audio input. Precision LipSync: Realistic and seamless synchronization of speech audio to facial movements. Efficiently designed to run on 8–12 GB VRAM,
🦙 Let 2 models debate about a topic you pick. Create custom Ollama models with your own system prompts and parameters and use them to debate ot publish on ollama.com Easy-to-use Gradio interface for building personalized AI models with temperature control and custom instructions.