Explore tags
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1
Contribute to sukebenet/instruct-pix2pix development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Contribute to andrewyng/translation-agent development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Fake Cam is a Python application that simulates a virtual camera by broadcasting images or videos from your computer. It utilizes OpenCV for video/image capture and manipulation, and PyVirtualCam to create a virtual camera device that can be used in various applications.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
A webui for propainter. Easily pick up objects from the video and eliminate them.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
[NVIDIA ONLY] Stable Video Diffusion Streamlit App. Currently supports Nvidia GPU machines only.
Newer GUI version available at https://github.com/Michael-Sebero/PrivateGPT4Linux
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Contribute to AIFSH/ComfyUI-Hallo development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Declaratively define and modify agents and multi-agent workflows through a point and click, drag and drop interface (e.g., you can select the parameters of two agents that will communicate to solve your task).
PyTorch implementation of Real-ESRGAN model
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Turn any image into a video! (Web UI created by fffiloni: https://huggingface.co/spaces/fffiloni/MS-Image2Video)
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
[Nvidia GPU only] One click installer for AudioLDM 2 Gradio UI
Contribute to camenduru/Lumina-Next-T2I-hf development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
This codebase is for a React and Electron-based app that executes the FreedomGPT LLM locally (offline and private) on Mac and Windows using a chat-based interface
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple