Pinokio

Launcher updates

DocToSpeech

@c0m3b4ck10h ago

About the app - 18-07-2026

The app is made so that you don't have to go through these annoying online converters to get a text file for ...

Maestro

@blizaine21h ago

Maestro v1.3.0 is out: SCAIL-2 character animation, 100% LOCAL, FREE & EASY!

(NEW) "Recast": swap anyone in a video for your own character. Drop a clip, type who to replace ("the woman",...

Underfit

@cocktailpeanut2d ago

Train StableAudio 3 on your Mac with Underfit!

Underfit has shipped MLX support, and now Mac users can train their own StableAudio3 Loras! https://github.co...

Bonsai Demo

@godwish3d ago

PrismML 8B,27b, Bonsai, Ternary

Test New Bonsai 27B

Wan2GP - AMD

@morpheus4d ago

Improved GPU detection

Formerly, if someone had an IGPU and a dedicated GPU from AMD, the GPU detection failed. Pinokio 8 allows us ...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Higgs Audio TTS

Paxurux/higgs-audio-v2-uiv3.7updated 10mo ago

Higgs Audio Text-to-Speech Playground (Requires Python 3.10+)

0 check-insNVIDIAAMDApple

FaceFusion

alexwyattdev/facefusion-pinokiov1.5updated 10mo ago

Industry leading face manipulation platform

0 check-insNVIDIAAMDApple

Stable Diffusion web UI

cocktailpeanutlabs/automatic1111v1.1updated 10mo ago

One-click launcher for Stable Diffusion web UI (AUTOMATIC1111/stable-diffusion-webui)

1 check-inNVIDIAAMDApple

Allegro TI2V (Pinokio)

MaximilianGardiewski/allegro-ti2v.pinokiov1.0.0updated 10mo ago

Text+Image → Video with Allegro-TI2V (Rhymes AI), local one-click via Pinokio

0 check-insNVIDIAAMDApple

Re-Size-Image-Outpaint-app

SUP3RMASS1VE/Re-Size-Image-Outpaintv3.7updated 10mo ago

A powerful tool for extending images to different aspect ratios using Stable Diffusion XL.

@sup3rmass1ve0 check-insNVIDIAAMDApple

YuE-UI

liinlin88888-bot/YuE-UIupdated 10mo ago

Gradio UI for YuE music generation model

1 check-inNVIDIAAMDApple

YuE (for Windows, tuned for RTX 4060 Ti 16GB)

liinlin88888-bot/YuE-for-windowsv0.2updated 10mo ago

Pinokio app to install and run sdbds/YuE-for-windows, tuned defaults for a single RTX 4060 Ti 16GB GPU. Uses Torch 2.5.1+cu124 and requirements-uv.txt.

0 check-insNVIDIAAMDApple

ComfyUI

cocktailpeanutlabs/comfyuiv1.3updated 10mo ago

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface https://github.com/comfyanonymous/ComfyUI

2 check-insNVIDIAAMDApple

DreamO

SUP3RMASS1VE/DreamOv3.7updated 10mo ago

DreamO: A Unified Framework for Image Customization

@sup3rmass1ve0 check-insNVIDIAAMDApple

AIraoke

TheAwaken1/AIraoke-Pinokiov2.0updated 10mo ago

Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.

@theawakenone

1 check-inNVIDIAAMDApple

DetailGen3D

Deathdadev/DetailGen3Dv3.7updated 10mo ago

@death0 check-insNVIDIAAMDApple

Dough Pinokio

banodoco/Dough-pinokiov1updated 10mo ago

Dough is a open source tool for steering AI animations with precision

0 check-insNVIDIAAMDApple

fluxgym

huqianghui/fluxgymv3.2updated 10mo ago

[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)

0 check-insNVIDIAAMDApple

Wav2Lip (macOS Intel, CPU)

kmrvitrine-lab/wav2lip-pinokiov1.0.0updated 10mo ago

Lip-sync vidéo avec Wav2Lip en CPU sur macOS (Intel)

1 check-inNVIDIAAMDApple

Bolt.new

gotoolkits/bolt.newv2.0updated 10mo ago

1 check-inNVIDIAAMDApple

vevo-gui

chameleon-ai/vevo-pinokiov3.2updated 10mo ago

0 check-insNVIDIAAMDApple

Realtime-Transcription

SUP3RMASS1VE/Realtime-Transcriptionv3.6updated 10mo ago

Real Time Speech Transcription

@sup3rmass1ve0 check-insNVIDIAAMDApple

llamacpp

cocktailpeanut/llamacpp.pinokioupdated 10mo ago

Port of Facebook's LLaMA model in C/C++

@cocktailpeanut1 check-inNVIDIAAMDApple

CC Fee Letter Agent

0xyacob/CCFeeAgentv3.7.0updated 10mo ago

Professional fee letter generation and email automation for CC Growth EIS Fund. Automatically generates and sends professional fee letters via Microsoft Graph API with Excel data integration.

1 check-inNVIDIAAMDApple

MeloTTS

cocktailpeanutlabs/melottsv1.2updated 11mo ago

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean https://github.com/myshell-ai/MeloTTS

#ai #tts

2 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5ChatTTS

A generative speech model for daily dialogue.

#6GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#7Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#8openmed

open-source healthcare ai

#9diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

Missing MIDI results

@jacekmachura · RC Stable Audio Tools

What happened? It works fine except it doesn't generate MIDI files Steps to reproduce 1. Click Genera...

installation fails

@biomaurone · Forge Neo

Not installing python 3.11 Steps to reproduce 1. installation Your system (OS / GPU / RAM / VRAM / et...

Downloads slow as hell

@biomaurone · Maestro

A continuous and tedious download suspension INFO: 127.0.0.1:39536 - "GET /api/v1/system-stats HTTP/1...

DocToSpeech: pdftotext.cpp(3): fatal error C1083: �� 㤠�� 䠩� ��祭��: poppler/cpp/poppler-document.h…

@evgrizli · DocToSpeech1

App: DocToSpeech (DocToSpeech-Pinokio.git) Repo: https://github.com/C0m3b4ck/DocToSpeech-Pinokio.git ...

Wan2GP: python: can't open file 'C:\\pinokio\\api\\wan.git\\app\\wgp.py': [Errno 2] No such file or dir…

@johnsonlazarus · Wan2GP

App: Wan2GP (wan.git) Repo: https://github.com/pinokiofactory/wan.git Generated: 2026-07-18T09:50:44....

Global radar

Projects people are discovering or following now.

Followed1 min

Maestro

An all-in-one, 100% local AI video, image & music studio. Its Director mode turns a single prompt into a full music video or short film — LLM-planned, shot by shot. Built on the WanGP pipeline (Wan 2.1/2.2, LTX-2.3, Qwen, Hunyuan Video, Flux). Requires an NVIDIA GPU (6GB+ VRAM).

Followed4 min

Wan 2.1

[NVIDIA ONLY] Super Optimized Gradio UI for Wan2.1 video for GPU poor machines (5GB+ VRAM). Generate up to 12 sec videos https://github.com/deepbeepmeep/Wan2GP

Followed5 min

Open WebUI

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui

Followed7 min

Pocket TTS Studio

Professional text-to-speech with voice cloning — powered by Kyutai Pocket TTS

Followed7 min

Underfit

LoRA fine-tuning dashboard for Stable Audio 3

Launcher updates

Store