Store
Explore tags
PersonaPlex
🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices. Requires powerful NVIDIA GPU (16-24GB VRAM), 32GB RAM, and Hugging Face account.
FunAudioLLM/Fun-CosyVoice3-0.5B-2512 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Skywork/SkyReels-V2-DF-14B-720P · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
TRELLIS.2
One-click installer for Microsoft TRELLIS.2: High-quality 3D asset generation from images with PBR textures.
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
mlx-community/Qwen3-TTS-12Hz-1.7B-CustomVoice-8bit · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Z-Fusion
Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]
ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
huihui-ai/Qwen3-32B-abliterated · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
TTS-Story
Multi-Voice Text-to-Speech for Stories and Audiobooks. Supports Kokoro and Chatterbox TTS engines with GPU acceleration.
SongGeneration Studio
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
jordand/echo-tts-base · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Echo-TTS Preview - a Hugging Face Space by jordand
Fast, multi-speaker TTS (44.1kHz) with voice cloning
