RC Stable Audio Tools
Advanced Gradio UI for Stable Audio https://github.com/RoyalCities/RC-stable-audio-tools
LCM
Fast Image generator using Latent consistency models https://replicate.com/blog/run-latent-consistency-model-on-mac
Tetris
🎮 Classic Tetris game with modern web UI. Play the timeless puzzle game in your browser with smooth controls, score tracking, and beautiful graphics.
RMBG-2-Studio
Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0
OpenVoice
Instantly clone any voice from any text to any speech, in any language https://huggingface.co/spaces/myshell-ai/OpenVoice
PhotoMaker2
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
FlashFace
Human Image Personalization with High-fidelity Identity Preservation: https://github.com/ali-vilab/FlashFace
Direct3D-S2
[NVIDIA ONLY] Direct3D-S2 is a scalable 3D shape generation framework leveraging sparse volumetric representations for high-resolution outputs. It features Spatial Sparse Attention (SSA), a novel mechanism that accelerates Diffusion Transformer computations on sparse data, achieving up to 9.6× speedup in training. The unified Sparse VAE architecture maintains a consistent sparse volumetric format across input, latent, and output stages, significantly improving efficiency and stability.
CosXL
Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI, https://huggingface.co/spaces/multimodalart/cosxl
flashdiffusion
Accelerating any conditional diffusion model for few steps image generation https://gojasper.github.io/flash-diffusion-project/
VyvoTTS LFM2
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
