Pinokio
Explore tags
e2-f5-tts
https://github.com/pinokiofactory/e2-f5-ttsv3.7updated 1/23/2026, 9:14:27 PMindexed 1/27/2026, 1:36:05 PM
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
Curated by @pinokio
VibeVoice Realtime
https://github.com/pinokiofactory/vibevoice-realtimev5.0updated 12/22/2025, 10:00:08 PMindexed 1/20/2026, 9:13:58 AM
Realtime streaming TTS demo using microsoft/VibeVoice-Realtime-0.5B
Curated by @pinokio
Hunyuan3D-2-LowVRAM
https://github.com/pinokiofactory/Hunyuan3d-2-lowvramv3.7updated 12/27/2025, 8:44:51 PMindexed 1/20/2026, 9:14:34 AM
Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP
Curated by @pinokio
OpenAudio
https://github.com/pinokiofactory/openaudiov3.7updated 1/3/2026, 1:47:18 PMindexed 1/27/2026, 9:08:41 AM
Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech
Curated by @pinokio
Forge
https://github.com/pinokiofactory/stable-diffusion-webui-forgev2.0updated 1/7/2026, 1:28:44 AMindexed 1/20/2026, 9:14:53 AM
[NVIDIA ONLY] The most efficient way to run FLUX (Optimized to run even on low memory machines, as low as 3GB VRAM with 512x512 resolution) https://github.com/lllyasviel/stable-diffusion-webui-forge
Curated by @pinokio
Whisper-WebUI
https://github.com/pinokiofactory/whisper-webuiv3.7updated 1/20/2026, 11:36:49 PMindexed 1/23/2026, 7:45:51 PM
A Web UI for easy subtitle using whisper model.
Curated by @pinokio
Comfyui
https://github.com/pinokiofactory/comfyv3.7updated 1/14/2026, 11:37:40 AMindexed 1/24/2026, 11:52:22 PM
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
Curated by @pinokio
MagicQuill
https://github.com/pinokiofactory/MagicQuillv3.7updated 1/11/2026, 8:07:39 PMindexed 1/20/2026, 9:13:50 AM
An intelligent, interactive Image Editing System. Easily erase and add objects on a user-friendly interface.
Curated by @pinokio
aura-sr-upscaler
https://github.com/pinokiofactory/aura-sr-upscalerv3.7updated 1/13/2026, 4:59:11 PMindexed 1/20/2026, 9:12:03 AM
AuraSR-v2 - An open reproduction of the GigaGAN Upscaler from fal.ai https://huggingface.co/spaces/gokaygokay/AuraSR-v2
Curated by @pinokio
Wan2GP
https://github.com/pinokiofactory/wanv3.7updated 1/28/2026, 9:41:31 AMindexed 1/28/2026, 4:16:21 PM
Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP
Curated by @pinokio
SongGeneration Studio
https://github.com/BazedFrog/SongGeneration-Studiov3.7updated 1/27/2026, 8:47:53 PMindexed 1/27/2026, 9:46:50 PM
AI Song Generation with Full Style Control - Generate complete songs with lyrics, vocals, and instrumental tracks using Tencent AI Lab's SongGeneration (LeVo) model. [NVIDIA ONLY]
Curated by @pinokio
PreviousPage 6 / 6Next