Explore tags
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Contribute to chucuoi1/F5-TTS development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
GPU Poor Version of Hunyuan3D-2
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Open-Sora: Democratizing Efficient Video Production for All
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
A repo of tons of data generated by AI labs, feeding into site. - GitHub - lossless-group/lossless-data: A repo of tons of data generated by AI labs, feeding into site.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
AudioSepFeatured
Separate Anything You Describe (https://huggingface.co/spaces/Audio-AGI/AudioSep)
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
dreamtalkFeatured
When Expressive Talking Head Generation Meets Diffusion Probabilistic Models (https://github.com/ali-vilab/dreamtalk)
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation https://github.com/cumulo-autumn/StreamDiffusion
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID
[NVIDIA GPU ONLY] Unofficial Implementation of Animate Anyone https://github.com/MooreThreads/Moore-AnimateAnyone
[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size) https://github.com/sdbds/Moore-AnimateAnyone-for-windows
An Astro SSG on LLM content automation. . Contribute to mpstaton/lossless-public development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
vid2poseFeatured
Video to Openpose & DWPose (All OS supported) https://github.com/sdbds/vid2pose
InstantIDFeatured
state-of-the-art tuning-free method to achieve ID-Preserving generation with only single image, supporting various downstream tasks. https://instantid.github.io/
PCMFeatured
Phased Consistency Model - generate high quality images with 2 steps https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM
BRIA RMBGFeatured
Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple