Explore tags
A multi-voice TTS system trained with an emphasis on quality
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Search the web and your self-hosted apps using local AI agents.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
InstantIR: Blind Image Restoration with Instant Generative Reference 🔥
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Foundational Models for State-of-the-Art Speech and Text Translation
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
Amphion: An Open-Source Audio, Music, and Speech Generation Toolkit: https://github.com/open-mmlab/Amphion
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+ Storage -- Read the README for more!
Text-to-Speech for languages of India
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
HallucinatorFeatured
[NVIDIA ONLY] Autocomplete any voice(s), powered by Hertz AI (Standard Intelligence)
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
real time face swap and one-click video deepfake with only a single image
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Fooocus web UI for Stable Diffusion
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Discover scalable Generative AI and LLM projects for innovative NLP applications, focusing on language understanding and transformation. - adityapatils/LLM-GEN-AI
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple