Explore tags
Customized ID Consistent for human: https://github.com/JackAILab/ConsistentID
Stable Diffusion web UI UX: https://github.com/anapnoe/stable-diffusion-webui-ux
Contribute to yohanshin/WHAM development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
A Gradio UI for XTTSv2 and RVC, allowing for real-time voice conversion.
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Swift client for the fal.ai model APIs
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
🔊 Text-Prompted Generative Audio Model
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
one-click face swap
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
SuperPrompter is a Python-based application that utilises the SuperPrompt-v1 model to generate optimised text prompts for AI/LLM image generation (for use with Stable Diffusion etc...) from user prompts.
AnimateDiff for Stable Diffusion WebUI Forge, mirror for https://github.com/continue-revolution/sd-webui-animatediff/tree/forge/master
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
DeOldify for Stable Diffusion WebUI:This is an extension for StableDiffusion's AUTOMATIC1111 web-ui that allows colorize of old photos and old video. It is based on deoldify.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
The script utilizes various deep learning models to create detailed character cards, including names, summaries, personalities, greeting messages, and character avatars.
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Next generation face swapper and enhancer
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners. - blazzbyte/Open...
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
Contribute to cocktailpeanut/stable-diffusion-webui-forge development by creating an account on GitHub.
Check-insNo check-ins yet
Platforms
GPUNVIDIAAMDApple