Store
Explore tags
A multi-voice TTS system trained with an emphasis on quality
Search the web and your self-hosted apps using local AI agents.
InstantIR: Blind Image Restoration with Instant Generative Reference 🔥
Foundational Models for State-of-the-Art Speech and Text Translation
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
Amphion: An Open-Source Audio, Music, and Speech Generation Toolkit: https://github.com/open-mmlab/Amphion
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+ Storage -- Read the README for more!
Text-to-Speech for languages of India
HallucinatorFeatured
[NVIDIA ONLY] Autocomplete any voice(s), powered by Hertz AI (Standard Intelligence)
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
real time face swap and one-click video deepfake with only a single image
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Fooocus web UI for Stable Diffusion
Discover scalable Generative AI and LLM projects for innovative NLP applications, focusing on language understanding and transformation. - adityapatils/LLM-GEN-AI
Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning
