Pinokio

cocktailpeanut

@cocktailpeanut
46 posts53 checkpointsJoined 1/20/2026, 9:27:14 AM
pinokio creator https://x.com/cocktailpeanut
Creations by @cocktailpeanut
99 total
sdxl turboUpdated 2 years ago
A Real-Time Text-to-Image Generation Model
Realtime BakLLaVAUpdated 2 years ago
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
stable-diffusion-webui-forgeUpdated 2 years ago
Contribute to cocktailpeanut/stable-diffusion-webui-forge development by creating an account on GitHub.
Whisper-WebUIUpdated 2 years ago
A Web UI for easy subtitle using whisper model.
Moondream1Updated 2 years ago
moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1
FooocusUpdated 2 years ago
Focus on prompting and generating
FaceFusionUpdated 2 years ago
Next generation face swapper and enhancer
DEUSUpdated 2 years ago
A Realtime Creation Engine
SAM_and_MetaCLIPUpdated 2 years ago
Open Vocabulary Image Segmentation using Segment Anything Model and MetaCLIP combo
kohya_ssUpdated 3 years ago
Contribute to cocktailpeanut/kohya_ss development by creating an account on GitHub.
facefusionUpdated 3 years ago
Next generation face swapper and enhancer
StableVideoUpdated 3 years ago
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing https://rese1f.github.io/StableVideo/
audiocraftUpdated 3 years ago
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Stable Diffusion web UIUpdated 3 years ago
AUTOMATIC1111/stable-diffusion-webui
AudiocraftUpdated 3 years ago
Text to audio, open sourced by Meta