Pinokio
Explore tags
GitHub - mcmonkeyprojects/SwarmUI: SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
https://github.com/mcmonkeyprojects/SwarmUIupdated 1/26/2026, 10:32:46 PMindexed 1/27/2026, 2:12:28 AM
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. - mcmonkeyprojects/Swa...
GitHub - deepbeepmeep/Wan2GP: A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
https://github.com/deepbeepmeep/Wan2GPupdated 1/26/2026, 8:56:26 PMindexed 1/27/2026, 12:04:11 PM
A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux. - deepbeepmeep/Wan2GP
LuxTTS Studio
https://github.com/TheAwaken1/LuxTTS-Studiov2.0updated 1/26/2026, 7:53:06 PMindexed 1/26/2026, 11:39:48 PM
Gradio-based web interface for the LuxTTS voice cloning and text-to-speech model, enabling users to generate customized speech from text using uploaded or recorded audio references with adjustable parameters like speed, guidance scale, and inference steps.
Eloquent
https://github.com/boneylizard/Eloquentv2.1updated 1/26/2026, 6:41:26 PMindexed 1/27/2026, 12:36:08 AM
Local AI Workstation
Ultimate-TTS-Studio
https://github.com/pinokiofactory/Ultimate-TTS-Studiov3.7updated 1/26/2026, 5:29:34 PMindexed 1/28/2026, 8:16:05 PM
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Videomama Pinokio
https://github.com/MustangXPress7/VideoMaMa-Pinokiov5.0updated 1/26/2026, 5:21:31 PMindexed 1/26/2026, 5:22:11 PM
open-notebook
https://github.com/lfnovo/open-notebookupdated 1/26/2026, 10:44:04 AMindexed 1/28/2026, 1:50:59 PM
An Open Source implementation of Notebook LM with more flexibility and features
GitHub - jianchang512/pyvideotrans: Translate the video from one language to another and embed dubbing & subtitles.
https://github.com/jianchang512/pyvideotransupdated 1/26/2026, 10:29:09 AMindexed 1/27/2026, 12:11:52 PM
Translate the video from one language to another and embed dubbing & subtitles. - jianchang512/pyvideotrans
GitHub - pashkov256/deletor: Manage and delete files efficiently with an interactive TUI and scriptable CLI.
https://github.com/pashkov256/deletorupdated 1/26/2026, 10:28:56 AMindexed 1/27/2026, 3:46:40 PM
Manage and delete files efficiently with an interactive TUI and scriptable CLI. - pashkov256/deletor
UVR5-UI
https://github.com/Eddycrack864/UVR5-UI-pinokiov3.2updated 1/26/2026, 9:18:53 AMindexed 1/26/2026, 7:31:27 PM
koboldcpp
https://github.com/LostRuins/koboldcppupdated 1/26/2026, 5:06:36 AMindexed 1/28/2026, 4:08:26 AM
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
https://github.com/Tencent-Hunyuan/HunyuanImage-3.0updated 1/26/2026, 2:04:27 AMindexed 1/27/2026, 4:44:54 PM
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0
HeartMuLa (HeartMuLaGen)
https://github.com/the-hornery/heartmula.pinokiov3.7updated 1/26/2026, 1:32:34 AMindexed 1/26/2026, 7:30:45 PM
Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.
PocketTTS
https://github.com/PierrunoYT/pocket-tts-pinokiov5.0updated 1/25/2026, 9:50:57 PMindexed 1/26/2026, 7:30:45 PM
馃攰 PocketTTS - A lightweight, CPU-optimized Text-to-Speech (TTS) application by Kyutai Labs. Generate natural-sounding speech with low latency (~200ms), voice cloning support, and 6x real-time performance on CPU. 100M parameter model with 8 preset voices and custom voice cloning. English only. No GPU required!
ChatterBox
https://github.com/PierrunoYT/chatterbox-tts-appv3.7updated 1/25/2026, 9:39:26 PMindexed 1/26/2026, 7:30:47 PM
Qwen3-Audiobook-Converter
https://github.com/WhiskeyCoder/Qwen3-Audiobook-Converterupdated 1/25/2026, 3:04:35 PMindexed 1/28/2026, 1:13:58 PM
Convert PDFs, EPUBs, DOCX, DOC, and TXT files into high-quality audiobooks using **Qwen3 TTS Voice Model** - an open-source voice synthesis system that excels at natural speech generation and voice cloning.
FaceFusion 3.4.1
https://github.com/facefusion/facefusion-pinokiov1.6updated 1/25/2026, 12:08:25 PMindexed 1/26/2026, 7:30:46 PM
Industry leading face manipulation platform
LivePortrait
https://github.com/6Morpheus6/liveportraitv3.7updated 1/25/2026, 5:43:01 AMindexed 1/25/2026, 5:43:39 AM
Bring portraits to life! https://github.com/KwaiVGI/LivePortrait
Orpheus-TTS-FastAPI
https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1/24/2026, 11:02:12 PMindexed 1/24/2026, 11:41:38 PM
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
GitHub - NVIDIA/personaplex: PersonaPlex code.
https://github.com/NVIDIA/personaplexupdated 1/24/2026, 10:46:36 PMindexed 1/27/2026, 2:40:12 PM
PersonaPlex code. Contribute to NVIDIA/personaplex development by creating an account on GitHub.
PreviousPage 4 / 35Next