Store
Explore tags
A Web UI for easy subtitle using fish-speech model.
Easily train a good VC model with voice data <= 10 mins!
chat-with-mlxFeatured
[Mac Onlyl] An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework. https://github.com/qnguyen3/chat-with-mlx
Search the web and your self-hosted apps using local AI agents.
Foundational Models for State-of-the-Art Speech and Text Translation
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
Amphion: An Open-Source Audio, Music, and Speech Generation Toolkit: https://github.com/open-mmlab/Amphion
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+ Storage -- Read the README for more!
Text-to-Speech for languages of India
HallucinatorFeatured
[NVIDIA ONLY] Autocomplete any voice(s), powered by Hertz AI (Standard Intelligence)
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
real time face swap and one-click video deepfake with only a single image
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Fooocus web UI for Stable Diffusion
Discover scalable Generative AI and LLM projects for innovative NLP applications, focusing on language understanding and transformation. - adityapatils/LLM-GEN-AI
