Pinokio

Launcher updates

DocToSpeech

@c0m3b4ck7h ago

About the app - 18-07-2026

The app is made so that you don't have to go through these annoying online converters to get a text file for ...

Maestro

@blizaine17h ago

Maestro v1.3.0 is out: SCAIL-2 character animation, 100% LOCAL, FREE & EASY!

(NEW) "Recast": swap anyone in a video for your own character. Drop a clip, type who to replace ("the woman",...

Underfit

@cocktailpeanut1d ago

Train StableAudio 3 on your Mac with Underfit!

Underfit has shipped MLX support, and now Mac users can train their own StableAudio3 Loras! https://github.co...

Bonsai Demo

@godwish3d ago

PrismML 8B,27b, Bonsai, Ternary

Test New Bonsai 27B

Wan2GP - AMD

@morpheus4d ago

Improved GPU detection

Formerly, if someone had an IGPU and a dedicated GPU from AMD, the GPU detection failed. Pinokio 8 allows us ...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

diffusers-image-fill

pinokiofactory/diffusers-image-fillv3.7updated 1mo ago

Remove objects from an image https://huggingface.co/spaces/OzzyGT/diffusers-image-fill

#ai #image-edit

0 check-insNVIDIAAMDApple

zonos

pinokiofactory/zonosv3.7updated 1mo ago

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos

#ai #tts

6 check-insNVIDIAAMDApple

Dia

pinokiofactory/diav3.7updated 1mo ago

Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia

#ai #tts

0 check-insNVIDIAAMDApple

Euraika Avatar Studio

Euraika-Labs/duix-avatar-pinokiov7.0updated 1mo ago

Local-first AI avatar video studio powered by duixcom/Duix-Avatar, Docker, and a consent-aware browser studio.

0 check-insNVIDIAAMDApple

ComfyComfyUI

drago87/ComfyComfyUIv7.0updated 1mo ago

A web control panel for ComfyUI

@drago870 check-insNVIDIAAMDApple

Whisper-WebUI

6Morpheus6/whisper-webuiv3.7updated 1mo ago

A Web UI for easy subtitle using whisper model.

@morpheus

2 check-insNVIDIAAMDApple

e2-f5-tts

pinokiofactory/e2-f5-ttsv3.7updated 1mo ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS

#tts #voice-clone #ai

16 check-insNVIDIAAMDApple

IndexTTS-2

6Morpheus6/IndexTTS2v3.7updated 1mo ago

Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application

@morpheus

1 check-inNVIDIAAMDApple

Sam3D

6Morpheus6/Sam3D-bodyv3.7updated 1mo ago

Create 3D Meshes of Body Poses from Images.

#3d

@morpheus

1 check-inNVIDIAAMDApple

Lens

tehmod/lens-pinokiov7.0updated 1mo ago

Unofficial Pinokio launcher for Microsoft Lens text-to-image inference. Tested on Linux with an RTX 5090.

1 check-inNVIDIAAMDApple

T2I-L2P

tehmod/t2i-l2p-pinokiov7.0updated 1mo ago

L2P pixel-space text-to-image generation demo

1 check-inNVIDIAAMDApple

SRT 字幕校正（LM Studio）

vincentchiou/srt-correctionv2.0updated 1mo ago

使用本地 LM Studio AI 免費校正 ASR 課程字幕，支援 PDF 參考資料，不需 API Key

0 check-insNVIDIAAMDApple

X-Voice

6Morpheus6/X-Voicev5.0updated 1mo ago

X-Voice is a multilingual text-to-speech system that enables one speaker to speak 27 languages.

@morpheus

2 check-insNVIDIAAMDApple

Kokoro-FastAPI

6Morpheus6/Kokoro-FastAPIv3.7updated 1mo ago

A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.

@morpheus

1 check-inNVIDIAAMDApple

OpenClaw (aka ClawdBot)

stoutimon/stoutimon-openclaw.pinokiov1.0.1updated 1mo ago

The AI that actually does things https://openclaw.ai

1 check-inNVIDIAAMDApple

3D Gen Studio

hoodtronik/3DGenStudio-pinokiov7.0updated 1mo ago

Local web UI for orchestrating 3D generation pipelines via ComfyUI / Tripo / Tencent. https://github.com/visualbruno/3DGenStudio

@hoodtronik0 check-insNVIDIAAMDApple

NeuralSampling / LatentGranular

JbfProductions/latentgranular-pinokiov7.0updated 1mo ago

Local Pinokio launcher for naotokui/latentgranular.

0 check-insNVIDIAAMDApple

Gitea

lemanschik/gitea.pinokiov1.0updated 1mo ago

A lightweight, painless, self-hosted Git service written in Go. Installs and launches in seconds, completely offline and locally.

0 check-insNVIDIAAMDApple

Moondream3 Gradio UI

mikecastrodemaria/moondream-3-improvedv5.0updated 1mo ago

A web interface for the Moondream3 vision-language model featuring image captioning, visual question answering, object detection, and object pointing.

@supersoniquestudio

2 check-insNVIDIAAMDApple

glitchframe

OlaProeis/Glitchframev3.7updated 2mo ago

Local GPU-accelerated music video generator: Gradio UI, analysis, SDXL backgrounds, NVENC output.

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5ChatTTS

A generative speech model for daily dialogue.

#6GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#7Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#8openmed

open-source healthcare ai

#9diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

installation fails

@biomaurone · Forge Neo

Not installing python 3.11 Steps to reproduce 1. installation Your system (OS / GPU / RAM / VRAM / et...

Downloads slow as hell

@biomaurone · Maestro

A continuous and tedious download suspension INFO: 127.0.0.1:39536 - "GET /api/v1/system-stats HTTP/1...

DocToSpeech: pdftotext.cpp(3): fatal error C1083: �� 㤠�� 䠩� ��祭��: poppler/cpp/poppler-document.h…

@evgrizli · DocToSpeech1

App: DocToSpeech (DocToSpeech-Pinokio.git) Repo: https://github.com/C0m3b4ck/DocToSpeech-Pinokio.git ...

Wan2GP: python: can't open file 'C:\\pinokio\\api\\wan.git\\app\\wgp.py': [Errno 2] No such file or dir…

@johnsonlazarus · Wan2GP

App: Wan2GP (wan.git) Repo: https://github.com/pinokiofactory/wan.git Generated: 2026-07-18T09:50:44....

Wan 2.1: TypeError: cannot unpack non-iterable NoneType object

@ridwan · Wan 2.1

App: Wan 2.1 (For-Gemini.git) Repo: https://github.com/remphanstar/For-Gemini.git Generated: 2026-07-...

Global radar

Projects people are discovering or following now.

Followed4 min

browser-use

Run AI Agent in your browser. https://github.com/browser-use/web-ui

Followed5 min

Maestro

An all-in-one, 100% local AI video, image & music studio. Its Director mode turns a single prompt into a full music video or short film — LLM-planned, shot by shot. Built on the WanGP pipeline (Wan 2.1/2.2, LTX-2.3, Qwen, Hunyuan Video, Flux). Requires an NVIDIA GPU (6GB+ VRAM).

Followed7 min

Wan 2.1

[NVIDIA ONLY] Super Optimized Gradio UI for Wan2.1 video for GPU poor machines (5GB+ VRAM). Generate up to 12 sec videos https://github.com/deepbeepmeep/Wan2GP

Followed9 min

Ultimate-TTS-Studio

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Followed11 min

Wan2GP

Super Optimized Gradio UI for AI video creation for GPU poor machines (6GB+ VRAM). Supports Wan 2.1/2.2, Qwen, Hunyuan Video, LTX Video and Flux. https://github.com/deepbeepmeep/Wan2GP

Launcher updates

Store