Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.
LTX-Desktop Video Generation + Editor - Powered By WanGP
A Python framework for AI-driven character animation using neural networks.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
High-quality rapid TTS voice cloning model (150x+ realtime) — 48kHz speech, voice cloning
[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Wan2GPFeatured
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
LivePortraitFeatured
Bring portraits to life! https://github.com/KwaiVGI/LivePortrait
High-Quality Text-to-Speech for Indian Languages
Fast Lipsync application for smaller GPU's.
Automatically remove watermarks from videos generated by Sora AI.
Native and Compact Structured Latents for 3D Generation
[AMD ONLY] Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video, Flux and more. (On Windows supported by all dedicated AMD GPUs from RDNA 2 - RDNA 4)
Flexible Automapper for Beatsaber made for any difficulty
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Contribute to shivampkumar/trellis-mac development by creating an account on GitHub.
[SIGGRAPH 2026] AniGen: Unified S^3 Fields for Animatable 3D Asset Generation
Upload a short recording of the voice you want to change and a reference clip of the target voice (or leave it blank to anonymize). Adjust simple sliders for speed, pitch, and style, then the app c...
