Store
Explore tags
ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Z-Fusion
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
Tongyi-MAI/Z-Image · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Wan2GP
Fast AI Video Generation per GPU poor (Wan2.1, Hunyuan, LTV). Gradio UI su http://127.0.0.1:7860
Wan2GP
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
MuseTalk
MuseTalk is a cutting-edge video-to-video (V2V) lip-sync solution engineered to deliver highly accurate and natural mouth movements synchronized to audio input. Precision LipSync: Realistic and seamless synchronization of speech audio to facial movements. Efficiently designed to run on 8–12 GB VRAM,
LiquidAI - LFM2-Audio-1.5B
LFM2-Audio-1.5B is Liquid AI's first end-to-end audio foundation model. Designed with low latency and real time conversation in mind
Ollama Model Creator
🦙 Let 2 models debate about a topic you pick. Create custom Ollama models with your own system prompts and parameters and use them to debate ot publish on ollama.com Easy-to-use Gradio interface for building personalized AI models with temperature control and custom instructions.
city96/FLUX.1-schnell-gguf · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
HeartMuLa Studio
A professional, Suno-like music generation studio for HeartLib. https://github.com/fspecii/HeartMuLa-Studio
PersonaPlex
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices. Requires powerful NVIDIA GPU (16-24GB VRAM), 32GB RAM, and Hugging Face account.
Qwen3-TTS
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
FunAudioLLM/Fun-CosyVoice3-0.5B-2512 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
TTS-Story
Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.
ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Skywork/SkyReels-V2-DF-14B-720P · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.