Store

GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

https://github.com/Tencent-Hunyuan/HunyuanImage-3.0updated 1/26/2026, 2:04:27 AMindexed 1/27/2026, 4:44:54 PM

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0

HeartMuLa (HeartMuLaGen)

https://github.com/the-hornery/heartmula.pinokiov3.7updated 1/26/2026, 1:32:34 AMindexed 1/26/2026, 7:30:45 PM

Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.

PocketTTS

https://github.com/PierrunoYT/pocket-tts-pinokiov5.0updated 1/25/2026, 9:50:57 PMindexed 1/26/2026, 7:30:45 PM

🔊 PocketTTS - A lightweight, CPU-optimized Text-to-Speech (TTS) application by Kyutai Labs. Generate natural-sounding speech with low latency (~200ms), voice cloning support, and 6x real-time performance on CPU. 100M parameter model with 8 preset voices and custom voice cloning. English only. No GPU required!

ChatterBox

https://github.com/PierrunoYT/chatterbox-tts-appv3.7updated 1/25/2026, 9:39:26 PMindexed 1/26/2026, 7:30:47 PM

Wan2GP

https://github.com/6Morpheus6/wan2gpv3.7updated 1/25/2026, 6:29:25 PMindexed 1/25/2026, 6:30:11 PM

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

#wan2gp

Qwen3-Audiobook-Converter

https://github.com/WhiskeyCoder/Qwen3-Audiobook-Converterupdated 1/25/2026, 3:04:35 PMindexed 1/28/2026, 1:13:58 PM

Convert PDFs, EPUBs, DOCX, DOC, and TXT files into high-quality audiobooks using **Qwen3 TTS Voice Model** - an open-source voice synthesis system that excels at natural speech generation and voice cloning.

FaceFusion 3.4.1

https://github.com/facefusion/facefusion-pinokiov1.6updated 1/25/2026, 12:08:25 PMindexed 1/26/2026, 7:30:46 PM

Industry leading face manipulation platform

#faceswap #ai #video

LivePortrait

https://github.com/6Morpheus6/liveportraitv3.7updated 1/25/2026, 5:43:01 AMindexed 1/25/2026, 5:43:39 AM

Bring portraits to life! https://github.com/KwaiVGI/LivePortrait

Orpheus-TTS-FastAPI

https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1/24/2026, 11:02:12 PMindexed 1/24/2026, 11:41:38 PM

Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS

#ai #tts

GitHub - NVIDIA/personaplex: PersonaPlex code.

https://github.com/NVIDIA/personaplexupdated 1/24/2026, 10:46:36 PMindexed 1/27/2026, 2:40:12 PM

PersonaPlex code. Contribute to NVIDIA/personaplex development by creating an account on GitHub.

Forge Neo

https://github.com/6Morpheus6/forge-neov2.0updated 1/24/2026, 4:41:48 PMindexed 1/24/2026, 4:42:02 PM

[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo

LiquidAI-LFM2.5 Playground

https://github.com/TheAwaken1/LiquidAI-LFM2.5-Playgroundv2.0updated 1/24/2026, 3:15:55 PMindexed 1/24/2026, 3:16:01 PM

Local multimodal app powered by Liquid AI LFM2.5-Audio-1.5B and LFM2.5-VL-1.6B models, delivering real-time voice chat, text-to-speech synthesis, long-form audio transcription, and multi-image vision reasoning.

Qwen3 Tts Cpu Pinokio

https://github.com/akhileshwebx/Qwen3-TTS-CPU-Pinokioupdated 1/24/2026, 12:13:29 PMindexed 1/24/2026, 12:14:00 PM

HeartMuLa/HeartMuLa-oss-3B · Hugging Face

https://huggingface.co/HeartMuLa/HeartMuLa-oss-3Bindexed 1/24/2026, 6:18:49 AM

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

VoxForge Pro

https://github.com/shinshekai/VoxForge-Prov5.0updated 1/23/2026, 10:04:39 PMindexed 1/23/2026, 10:05:00 PM

Premium AI-Powered Audiobook Generator with 47 voices, PDF processing, and voice cloning

e2-f5-tts

https://github.com/pinokiofactory/e2-f5-ttsv3.7updated 1/23/2026, 9:14:27 PMindexed 1/27/2026, 1:36:05 PM

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #ai

GitHub - mrxmentalist/Heartmula-Suno-UI: "HeartMuLa Suno UI Pro: A self-contained, high-performance music generation dashboard with built-in LLM assistants and optimized VRAM management.

https://github.com/mrxmentalist/Heartmula-Suno-UIupdated 1/23/2026, 7:59:24 PMindexed 1/23/2026, 7:59:40 PM

"HeartMuLa Suno UI Pro: A self-contained, high-performance music generation dashboard with built-in LLM assistants and optimized VRAM management. - mrxmentalist/Heartmula-Suno-UI

Hardcore Codex

https://github.com/cocktailpeanut/hardcore_codexupdated 1/23/2026, 5:18:30 PMindexed 1/23/2026, 7:47:09 PM

OpenAI Codex CLI with --dangerously-bypass-approvals-and-sandbox

ComfyDock

https://github.com/comfygit-ai/ComfyDock-Pinokiov2.0updated 1/23/2026, 3:49:54 PMindexed 1/23/2026, 7:47:43 PM

Manage your ComfyUI environments with Docker

GLM-Image

https://github.com/shinshekai/GLM-Imagev5.0updated 1/23/2026, 12:37:08 PMindexed 1/23/2026, 7:47:11 PM

Image generation using zai-org/GLM-Image with Gradio UI. Supports text-to-image and image-to-image generation.