Store
Explore tags
GitHub - Tencent-Hunyuan/HunyuanImage-3.0: HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation - Tencent-Hunyuan/HunyuanImage-3.0

HeartMuLa (HeartMuLaGen)
Pinokio wrapper: installs HeartMuLa heartlib + downloads checkpoints + launches a Gradio UI for music generation.
PocketTTS
馃攰 PocketTTS - A lightweight, CPU-optimized Text-to-Speech (TTS) application by Kyutai Labs. Generate natural-sounding speech with low latency (~200ms), voice cloning support, and 6x real-time performance on CPU. 100M parameter model with 8 preset voices and custom voice cloning. English only. No GPU required!
Wan2GP
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Qwen3-Audiobook-Converter
Convert PDFs, EPUBs, DOCX, DOC, and TXT files into high-quality audiobooks using **Qwen3 TTS Voice Model** - an open-source voice synthesis system that excels at natural speech generation and voice cloning.
Orpheus-TTS-FastAPI
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
GitHub - NVIDIA/personaplex: PersonaPlex code.
PersonaPlex code. Contribute to NVIDIA/personaplex development by creating an account on GitHub.
Forge Neo
[NVIDIA ONLY] Stable Diffusion WebUI Forge supporting Flux, Qwen, wan, nunchaku and more in a lightweight WebUI. https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
LiquidAI-LFM2.5 Playground
Local multimodal app powered by Liquid AI LFM2.5-Audio-1.5B and LFM2.5-VL-1.6B models, delivering real-time voice chat, text-to-speech synthesis, long-form audio transcription, and multi-image vision reasoning.
HeartMuLa/HeartMuLa-oss-3B 路 Hugging Face
We鈥檙e on a journey to advance and democratize artificial intelligence through open source and open science.
VoxForge Pro
Premium AI-Powered Audiobook Generator with 47 voices, PDF processing, and voice cloning
e2-f5-tts
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
GitHub - mrxmentalist/Heartmula-Suno-UI: "HeartMuLa Suno UI Pro: A self-contained, high-performance music generation dashboard with built-in LLM assistants and optimized VRAM management.
"HeartMuLa Suno UI Pro: A self-contained, high-performance music generation dashboard with built-in LLM assistants and optimized VRAM management. - mrxmentalist/Heartmula-Suno-UI
GLM-Image
Image generation using zai-org/GLM-Image with Gradio UI. Supports text-to-image and image-to-image generation.