Project updates
Latest Master Branch
Fork from https://github.com/facefusion/facefusion-pinokio Which always installs / Updates to the latest Mast...

X-Voice The universal translator
X-Voice is a voice clone app that lets you clone voices in any language. The Zero-Shot Voice Cloning tab work...

help from devs to improve FluxRT
I’ve completed a full installer for FluxRT and have already integrated several new modes into the UI, includi...

Fooocus2026 — Pinokio launcher available now
A one-click way to install my fork of Fooocus 2.5.5, with the quality-of-life additions I kept missing in ups...
Store
リアルタイムボイスチェンジャー Realtime Voice Changer
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially for my kids.
ComfyUI Chatterbox TTS & Voice Conversion Node
🔥 [ICCV 2025 Highlight] Official open-source repo for LVFace: Progressive Cluster Optimization for Large Vision Models in Face Recognition
Contribute to ZeusDevs666/ZeusGPT development by creating an account on GitHub.
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
🎬 Professional Video Dubbing Pipeline with Parakeet-TDT-0.6b-v2, Gemini AI, and Edge TTS. Complete solution for automated video dubbing with step-by-step processing and batch video creation from multiple audio files.
The first automated AI tool discovery platform using the revolutionary .awesome-ai.md standard. Real-time GitHub scanning, automated curation, and live leaderboards for AI tools.
Audio cloning - CPU-Only Inference Fork with auto-installer and launcher
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework
Compatible with all CUDA cards. Windows and linux
MeloTTSFeatured
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS
Offline AI music recommender & recogniser. Local song ID + smart playlists (CLAP + FAISS).
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
[NVIDIA ONLY] End-to-end multimodal SVG generator capable of generating complex and detailed SVGs, from simple icons to intricate anime characters. (Minimum Requirements 12GB VRAM / 32GB RAM, Recommended Requirements 24GB VRAM / 24GB RAM)
Contribute to VikingOwl91/polyscribe development by creating an account on GitHub.
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
SkyReels-V2: Infinite-length Film Generative model
