Store
Explore tags
diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of https://huggingface.co/spaces/multimodalart/face-to-all
Collection of the best Applio plugins.
Fair-code workflow automation platform with 400+ integrations and native AI capabilities
Added support for russian language in train/inference scripts + example of train 60 hours
[NVIDIA ONLY] Gradio demo for Flux Kontext based on Diffusers with single and multiple images.
AudioX Diffusion Transformer for Anything-to-Audio Generation
Contribute to remphanstar/long-nose development by creating an account on GitHub.
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters. https://docs.sillytavern.app/
[ICCV 2025] STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
ACE-Step: A Step Towards Music Generation Foundation Model
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM
Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper
Unified Image Understanding and Generation. Text-to-Image Generation, In-context Generation, Instruction-guided Image Editing, Visual Understanding (Minimum Requirements 12GBV RAM / 48GB RAM, Recommended Requirements 24GB VRAM / 32GB RAM)
