Store
Explore tags
A Gradio UI for XTTSv2 and RVC, allowing for real-time voice conversion.
SuperPrompter is a Python-based application that utilises the SuperPrompt-v1 model to generate optimised text prompts for AI/LLM image generation (for use with Stable Diffusion etc...) from user prompts.
The script utilizes various deep learning models to create detailed character cards, including names, summaries, personalities, greeting messages, and character avatars.
Next generation face swapper and enhancer
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
Stable Diffusion UI with patches by lllyasviel
Flexible Automapper for Beatsaber made for any difficulty
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1

Massively Multilingual Speech (MMS): Persian Text-to-Speech

Efficiently separate audio tracks with Spleeter
Next generation face swapper and enhancer
Unlock the new experience of Housing App Android Setup with Automation using Pinokio
A Realtime Creation Engine
Next generation face swapper and enhancer

Open Vocabulary Image Segmentation using Segment Anything Model and MetaCLIP combo
3D Gaussian Splatting for Real-Time Radiance Field Rendering
DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Next generation face swapper and enhancer
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing https://rese1f.github.io/StableVideo/