Store
Explore tags
VyvoTTS LFM2
High-quality Text-to-Speech powered by VyvoTTS LFM2 model with easy-to-use web interface
Moondream3 Gradio UI
A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.
AudioGradio
One click installer for AudioCraft MusicGen and AudioGen Gradio UI (Requires at least Pinokio v0.0.56)
IndexTTS-2
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application

ComfyUI Image to 3D
ComfyUI with TRELLIS2, GeometryPack, and UniRig custom nodes for image-to-3D generation
PhotoMaker2
Customizing Realistic Human Photos via Stacked ID Embedding https://huggingface.co/spaces/TencentARC/PhotoMaker-V2
FramePack
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
Audio Flamingo 3
NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface
Umo
Multi-Identity Consistency for Image Customization via Matching Reward https://github.com/bytedance/UMO
SillyTavern Character Generator
# SillyTavern Character Generator
A pinokio script for https://github.com/Tremontaine/character-card-generator
When used with KoboldCPP use http://localhost:5001/v1
Where 5001 is the port reported by KoboldCPP when starting
Text API Key needs to be filled with anything. (If left empty will give a error so just add anything to it)
GLM-TTS
๐๏ธ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.