Pinokio
Explore tags
Bark Voice Cloning
https://github.com/cocktailpeanut/bark.pinokioupdated 6/17/2024, 5:30:55 PMindexed 1/23/2026, 7:46:14 PM
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
AudioLDM 2
https://github.com/cocktailpeanut/AudioLDM2.pinokioupdated 6/17/2024, 5:30:43 PMindexed 1/23/2026, 7:46:55 PM
[Nvidia GPU only] One click installer for AudioLDM 2 Gradio UI
ModelScope Video2Video (Nvidia GPU only)
https://github.com/cocktailpeanut/ms-video2video.pinokioupdated 6/12/2024, 4:50:32 AMindexed 1/23/2026, 7:47:05 PM
enhance the resolution and spatiotemporal continuity of text-generated videos and image-generated videos
Xorbits Inference
https://github.com/cocktailpeanut/xinference.pinokioupdated 6/11/2024, 7:05:48 AMindexed 1/23/2026, 7:45:25 PM
LLM Web UI and API
Rope
https://github.com/Hillobar/Ropeupdated 5/27/2024, 7:46:33 PMindexed 1/27/2026, 6:36:48 PM
GUI-focused roop
paligemma
https://github.com/cocktailpeanutlabs/paligemmav1.5updated 5/15/2024, 6:31:03 PMindexed 1/20/2026, 9:13:29 AM
an open vision-language model by Google. PaliGemma is designed as a versatile model for transfer to a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation https://huggingface.co/spaces/google/paligemma
sillytavern-pinokio
https://github.com/supersonic13/sillytavern-pinokiov1.5updated 5/14/2024, 8:46:04 AMindexed 1/23/2026, 7:48:37 PM
Brought to you by Cohee, RossAscends, and the SillyTavern community, SillyTavern is a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters.
InvokeAI
https://github.com/cocktailpeanutlabs/invokeaiv1.1updated 5/10/2024, 7:10:52 AMindexed 1/20/2026, 9:14:59 AM
Generative AI for Professional Creatives
GitHub - VHDsdk2/Fooocus-pony-diffusion-v6-xl: fooocus but with pony diffusion (mainly for colab)
https://github.com/VHDsdk2/Fooocus-pony-diffusion-v6-xlupdated 5/9/2024, 5:03:30 PMindexed 1/26/2026, 11:12:02 PM
fooocus but with pony diffusion (mainly for colab) - VHDsdk2/Fooocus-pony-diffusion-v6-xl
ConsistentID
https://github.com/Feedjer/ConsistentID.pinokiov1.5updated 5/5/2024, 6:42:02 PMindexed 1/23/2026, 7:48:38 PM
Customized ID Consistent for human: https://github.com/JackAILab/ConsistentID
Singing_SongStarter
https://github.com/Shahnab/singing-songstarter.pinokiov1.5updated 4/23/2024, 2:31:23 PMindexed 1/23/2026, 7:48:41 PM
XTTS-RVC
https://github.com/Shyk92/XTTS-RVC-UI.pinokioupdated 4/13/2024, 6:46:21 PMindexed 1/23/2026, 7:45:39 PM
A Gradio UI for XTTSv2 and RVC, allowing for real-time voice conversion.
Real-ESRGAN
https://github.com/xinntao/Real-ESRGANupdated 4/2/2024, 4:39:11 PMindexed 1/28/2026, 1:04:36 PM
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Fooocus
https://github.com/cocktailpeanut/fooocus.pinokioupdated 4/1/2024, 5:10:12 PMindexed 1/23/2026, 7:46:36 PM
Minimal Stable Diffusion UI
roop
https://github.com/s0md3v/roopupdated 3/25/2024, 4:01:32 AMindexed 1/31/2026, 11:06:05 PM
one-click face swap
SuperPrompter
https://github.com/supersonic13/superprompter-pinokiov1.0updated 3/22/2024, 11:24:09 AMindexed 1/23/2026, 7:48:42 PM
SuperPrompter is a Python-based application that utilises the SuperPrompt-v1 model to generate optimised text prompts for AI/LLM image generation (for use with Stable Diffusion etc...) from user prompts.
OneTrainer
https://github.com/supersonic13/onetrainer-pinokiov1.2updated 3/14/2024, 11:49:48 AMindexed 1/23/2026, 7:48:43 PM
The script utilizes various deep learning models to create detailed character cards, including names, summaries, personalities, greeting messages, and character avatars.
FaceFusion 2.3.0
https://github.com/ngoqquyen/facefusion-pinokiov1updated 3/14/2024, 6:03:00 AMindexed 1/23/2026, 7:48:44 PM
Next generation face swapper and enhancer
GitHub - blazzbyte/OpenInterpreterUI: Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners.
https://github.com/blazzbyte/OpenInterpreterUIupdated 3/3/2024, 11:50:33 PMindexed 1/27/2026, 2:55:26 PM
Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners. - blazzbyte/Open...
Realtime BakLLaVA
https://github.com/cocktailpeanut/bakllava.pinokioupdated 2/26/2024, 10:51:32 PMindexed 1/23/2026, 7:44:47 PM
llama.cpp with BakLLaVA model describes what does it see (https://github.com/Fuzzy-Search/realtime-bakllava)
PreviousPage 38 / 40Next