Pinokio

Launcher updates

DocToSpeech

@c0m3b4ck7h ago

About the app - 18-07-2026

The app is made so that you don't have to go through these annoying online converters to get a text file for ...

Maestro

@blizaine18h ago

Maestro v1.3.0 is out: SCAIL-2 character animation, 100% LOCAL, FREE & EASY!

(NEW) "Recast": swap anyone in a video for your own character. Drop a clip, type who to replace ("the woman",...

Underfit

@cocktailpeanut1d ago

Train StableAudio 3 on your Mac with Underfit!

Underfit has shipped MLX support, and now Mac users can train their own StableAudio3 Loras! https://github.co...

Bonsai Demo

@godwish3d ago

PrismML 8B,27b, Bonsai, Ternary

Test New Bonsai 27B

Wan2GP - AMD

@morpheus4d ago

Improved GPU detection

Formerly, if someone had an IGPU and a dedicated GPU from AMD, the GPU detection failed. Pinokio 8 allows us ...

Type:All

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

RVC

cocktailpeanut/rvc.pinokiov3.7updated 15d ago

1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

#rvc #voice-clone #voice #ai

@cocktailpeanut

16 check-insNVIDIAAMDApple

BlueMagpie-TTS

openformosa/bluemagpie-ttsupdated 16d ago

BlueMagpie-TTS

0 check-insNVIDIAAMDApple

Higgs Audio v3 TTS

PierrunoYT/HiggsAudioV3-Pinokiov7.0updated 16d ago

Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice

PierrunoYT/OmniVoice-Pinokiov5.0updated 16d ago

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

@pierrunoyt 5 check-insNVIDIAAMDApple

Transcribr

PierrunoYT/Transcribr-Pinokiov5.0updated 16d ago

Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.

@pierrunoyt0 check-insNVIDIAAMDApple

PersonaPlex

PierrunoYT/PersonaPlex-Pinokiov5.0updated 16d ago

🗣️ PersonaPlex - NVIDIA's real-time speech-to-speech conversational AI model. Natural full-duplex conversations with customizable personas and voices.

@pierrunoyt 3 check-insNVIDIAAMDApple

Anima-Standalone-Trainer

gazingstars123/anima-standalone-trainerupdated 16d ago

Standalone Anima Lora trainer with GUI

0 check-insNVIDIAAMDApple

DavidAU/Qwen3.5-9B-Claude-4.6-HighIQ-THINKING-HERETIC-UNCENSORED · Hugging Face

huggingface.co/davidau/qwen3.5-9b-claude-4.6-highiq-thinking-heretic-uncensored

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

0 check-insNVIDIAAMDApple

StyleTTS2 Studio - a Hugging Face Space by Wismut

huggingface.co/spaces/wismut/styletts2_studio

Build custom voices in StyleTTS 2

0 check-insNVIDIAAMDApple

CLIProxyAPI

router-for-me/cliproxyapiupdated 16d ago

Wrap Antigravity, ChatGPT Codex, Claude Code, Grok Build as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 3.1 Pro, GPT 5.5, Grok 4.3, Claude model through API

0 check-insNVIDIAAMDApple

Fizgig

shootthesound/fizgigupdated 16d ago

Krea 2 & Klein 9B LoRA Studio — train, profile, repair, and extract Krea 2 & Flux 2 Klein 9B LoRAs

0 check-insNVIDIAAMDApple

GPT-SoVITS

RVC-Boss/GPT-SoVITSupdated 16d ago

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

0 check-insNVIDIAAMDApple

BadToBest/EchoMimicV2 · Hugging Face

huggingface.co/badtobest/echomimicv2

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

0 check-insNVIDIAAMDApple

Argus

realsee-developer/argusupdated 17d ago

[ECCV 2026] Argus: Metric Panoramic 3D Reconstruction for Indoor Scenes

0 check-insNVIDIAAMDApple

SearXNG

cocktailpeanut/searxng.pinokiov7.0updated 17d ago

A privacy-respecting metasearch engine that runs locally.

#search #searchengine

@cocktailpeanut

4 check-insNVIDIAAMDApple

Audiochunker

manat0912/audiochunkerv5.0updated 17d ago

Slice any audio file into fixed-length (10-60s) clips and export them all at once. Powered by ffmpeg.

@manatheturipa

1 check-inNVIDIAAMDApple

OBLITERATUS/Gemma-4-12B-OBLITERATED · Hugging Face

huggingface.co/obliteratus/gemma-4-12b-obliterated

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

0 check-insNVIDIAAMDApple

Ollama Chat

dogman189/astralv7.0updated 17d ago

Electron chat UI for local LLMs via Ollama

0 check-insNVIDIAAMDApple

Wan2.2 14B Fast Preview - a Hugging Face Space by EldMans

huggingface.co/spaces/eldmans/wan2.2_14b_i2v_480p_lightning_nsfw_diffusers

generate a video from an image with a text prompt

0 check-insNVIDIAAMDApple

FunASR

modelscope/funasrupdated 18d ago

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5ChatTTS

A generative speech model for daily dialogue.

#6GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#7Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#8openmed

open-source healthcare ai

#9diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

installation fails

@biomaurone · Forge Neo

Not installing python 3.11 Steps to reproduce 1. installation Your system (OS / GPU / RAM / VRAM / et...

Downloads slow as hell

@biomaurone · Maestro

A continuous and tedious download suspension INFO: 127.0.0.1:39536 - "GET /api/v1/system-stats HTTP/1...

DocToSpeech: pdftotext.cpp(3): fatal error C1083: �� 㤠�� 䠩� ��祭��: poppler/cpp/poppler-document.h…

@evgrizli · DocToSpeech1

App: DocToSpeech (DocToSpeech-Pinokio.git) Repo: https://github.com/C0m3b4ck/DocToSpeech-Pinokio.git ...

Wan2GP: python: can't open file 'C:\\pinokio\\api\\wan.git\\app\\wgp.py': [Errno 2] No such file or dir…

@johnsonlazarus · Wan2GP

App: Wan2GP (wan.git) Repo: https://github.com/pinokiofactory/wan.git Generated: 2026-07-18T09:50:44....

Wan 2.1: TypeError: cannot unpack non-iterable NoneType object

@ridwan · Wan 2.1

App: Wan 2.1 (For-Gemini.git) Repo: https://github.com/remphanstar/For-Gemini.git Generated: 2026-07-...

Global radar

Projects people are discovering or following now.

Followedjust now

BHS Ultimate HeadSwap & FaceSwap

Based on BFS - Best Face Swap, VisoMaster, and SwapAnyHead.

Followed1 min

OpenAudio

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

Followed3 min

Ultimate-TTS-Studio

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Followed4 min

ChatterBox

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.

Followed4 min

Wan 2.1

[NVIDIA ONLY] Super Optimized Gradio UI for Wan2.1 video for GPU poor machines (5GB+ VRAM). Generate up to 12 sec videos https://github.com/deepbeepmeep/Wan2GP

Launcher updates

Store