Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
ACE-Step: A Step Towards Music Generation Foundation Model
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Project page for ICCV 2025 paper "Controllable and Expressive One-Shot Video Head Swapping"
Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper
Unified Image Understanding and Generation. Text-to-Image Generation, In-context Generation, Instruction-guided Image Editing, Visual Understanding (Minimum Requirements 12GBV RAM / 48GB RAM, Recommended Requirements 24GB VRAM / 32GB RAM)
MIDIfren is an Audio Stem & MIDI Processor in Python🎵. Convert audio to MIDI, extract stems, sonify MIDI files ...
a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI. https://huggingface.co/spaces/stabilityai/TripoSR
Contribute to bmaltais/kohya_ss development by creating an account on GitHub.
Contribute to presenton/presenton_docker development by creating an account on GitHub.
Contribute to presenton/presenton_electron development by creating an account on GitHub.
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
Full-stack AI video generation app with image/text input and premium NSFW toggle
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
ICCV‑23 video in‑/out‑painting
Contribute to mannaandpoem/OpenManus development by creating an account on GitHub.
