Pinokio

Launcher updates

DocToSpeech

@c0m3b4ck1h ago

About the app - 18-07-2026

The app is made so that you don't have to go through these annoying online converters to get a text file for ...

Maestro

@blizaine12h ago

Maestro v1.3.0 is out: SCAIL-2 character animation, 100% LOCAL, FREE & EASY!

(NEW) "Recast": swap anyone in a video for your own character. Drop a clip, type who to replace ("the woman",...

Underfit

@cocktailpeanut1d ago

Train StableAudio 3 on your Mac with Underfit!

Underfit has shipped MLX support, and now Mac users can train their own StableAudio3 Loras! https://github.co...

Bonsai Demo

@godwish3d ago

PrismML 8B,27b, Bonsai, Ternary

Test New Bonsai 27B

Wan2GP - AMD

@morpheus4d ago

Improved GPU detection

Formerly, if someone had an IGPU and a dedicated GPU from AMD, the GPU detection failed. Pinokio 8 allows us ...

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

FrameCrop

arnold2006/framecropv4.0updated 14d ago

Batch-crop images to a chosen aspect ratio using a draggable/resizable crop overlay on each image thumbnail.

0 check-insNVIDIAAMDApple

SmolLM3-3B Chatbot

PierrunoYT/SmolLM3-3B-Pinokiov5.0updated 14d ago

Advanced 3B parameter language model with Gradio web interface, GPU acceleration, and complete privacy

@pierrunoyt 2 check-insNVIDIAAMDApple

dots.tts-base

PierrunoYT/dots.tts-Pinokiov5.0updated 15d ago

2B-parameter fully continuous, end-to-end autoregressive text-to-speech with zero-shot voice cloning. https://huggingface.co/rednote-hilab/dots.tts-base

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice Studio

PierrunoYT/OmniVoice-Studio-Pinokiov7.0updated 15d ago

The open-source ElevenLabs alternative. Local voice cloning, video dubbing, and real-time dictation — 646 languages, no API keys.

@pierrunoyt

2 check-insNVIDIAAMDApple

PRX Pixel

PierrunoYT/PRX-Pixel-Pinokiov5.0updated 15d ago

Pixel-space PRX text-to-image pipeline (~7B params, Qwen3-VL text encoder, no VAE)

@pierrunoyt1 check-inNVIDIAAMDApple

MOSS-TTS

PierrunoYT/MossTTS-Pinokiov5.0updated 15d ago

All-in-one Gradio UI for the MOSS-TTS Family: voice cloning, dialogue generation, voice design from text, and sound effects.

@pierrunoyt

4 check-insNVIDIAAMDApple

Sana

PierrunoYT/Sana-Pinokiov5.0updated 15d ago

Fast Image Generation with Sana Diffusion Model

@pierrunoyt

2 check-insNVIDIAAMDApple

Pocket TTS Studio

aashutoshdahal1/Voice-Clone-Generatorv3.7updated 15d ago

Professional text-to-speech with voice cloning — powered by Kyutai Pocket TTS

0 check-insNVIDIAAMDApple

Z-Fusion

ai-anchorite/Z-Fusionv3.7updated 15d ago

Z-Image, Flux2 Klein, & SeedVR2 with a Gradio UI. Uses a built-in ComfyUI backend for speed and efficiency! [8GB+VRAM, 16GB+ RAM]

#image-generation

@anchorite

22 check-insNVIDIAAMDApple

حامی شبکه ماسو (Masu Network Local)

helpsystem/Masuupdated 15d ago

نسخه کاملاً لوکال و آفلاین سامانه مشاوره امن و اضطراری قربانیان خشونت خانگی شبکه ماسو

0 check-insNVIDIAAMDApple

Adoption & Child Care

Brianmwanza-bit/adoption-and-child-care-mainv7.0updated 15d ago

Android app for adoption and child-care management with AI-assisted coding.

0 check-insNVIDIAAMDApple

DramaBox

PierrunoYT/DramaBox-TTS-Pinokiov5.0updated 15d ago

Expressive TTS with voice cloning, prompt-driven speech synthesis built on LTX-2.3 by Resemble AI

#ai #tts #voice-clone

@pierrunoyt

5 check-insNVIDIAAMDApple

Cohere Transcribe

PierrunoYT/cohere-transcribe-pinokiov5.0updated 15d ago

State-of-the-art open-source speech recognition model supporting 14 languages. 2B parameter ASR model from Cohere Labs.

@pierrunoyt1 check-inNVIDIAAMDApple

ChatterBox

PierrunoYT/chatterbox-tts-pinokiov5.0updated 15d ago

AI-Powered Text-to-Speech with Voice Cloning using Chatterbox TTS and a Gradio interface. Includes Turbo, Multilingual (23+ languages), and Original models. Runs locally; CUDA GPU recommended, CPU supported. Windows, Mac, and Linux.

@pierrunoyt0 check-insNVIDIAAMDApple

Audio Flamingo 3

PierrunoYT/Audio-Flamingo-3-Pinokiov7.0updated 15d ago

NVIDIA's Audio Flamingo 3 - Large Audio-Language Model for speech, sound, and music understanding with Gradio web interface

@pierrunoyt0 check-insNVIDIAAMDApple

MLX Media

CharafChnioune/AceJAM-Studiov7.0updated 15d ago

Create songs, albums and artwork locally on Apple MLX with ACE-Step v1.5, MFLUX, local agents and LoRA training.

0 check-insNVIDIAAMDApple

RVC

cocktailpeanut/rvc.pinokiov3.7updated 15d ago

1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

#rvc #voice-clone #voice #ai

@cocktailpeanut

16 check-insNVIDIAAMDApple

Higgs Audio v3 TTS

PierrunoYT/HiggsAudioV3-Pinokiov7.0updated 15d ago

Pinokio launcher for Higgs Audio v3 TTS with Gradio UI, SGLang-Omni backend, and automatic model download.

@pierrunoyt1 check-inNVIDIAAMDApple

OmniVoice

PierrunoYT/OmniVoice-Pinokiov5.0updated 15d ago

Zero-shot multilingual TTS (600+ languages) with voice cloning and voice design — Gradio UI (app/app.py)

@pierrunoyt 5 check-insNVIDIAAMDApple

Transcribr

PierrunoYT/Transcribr-Pinokiov5.0updated 15d ago

Bulk transcribe many YouTube videos, whole playlists, or your own uploaded audio/video files at once with faster-whisper. Outputs txt, srt, vtt, or json.

@pierrunoyt0 check-insNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5ChatTTS

A generative speech model for daily dialogue.

#6GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#7Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#8openmed

open-source healthcare ai

#9diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Latest posts from the community.

Wan2GP: raise AssertionError("Torch not compiled with CUDA enabled")

@johnsonlazarus · Wan2GP

App: Wan2GP (wan.git) Repo: https://github.com/pinokiofactory/wan.git Generated: 2026-07-18T08:57:20....

About the app - 18-07-2026

@c0m3b4ck · DocToSpeech

The app is made so that you don't have to go through these annoying online converters to get a text f...

ID LoRa support and Dramaboc bug fixes

@helenfromua · Maestro

Hi, Maestro community! I recently switched from WanGP to Maestro for a cleaner, no-nonsense interface...

StableDAW: error: Failed to spawn: `uvicorn`

@evgrizli · StableDAW

App: StableDAW (stabledaw.pinokio.git) Repo: https://github.com/cocktailpeanut/stabledaw.pinokio.git ...

TripoSR: raise RuntimeError(f"Patch target not found for {label}: {path}")

@nicedragon9 · TripoSR

App: TripoSR (TripoSR-Pinokio.git) Repo: https://github.com/hoodtronik/TripoSR-Pinokio.git Generated:...

Global radar

Projects people are discovering or following now.

Followed3 min

Hunyuan3D-2

[NVIDIA ONLY] Requires 24GB VRAM (Use the lowvram option, it has the same quality). High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/Tencent/Hunyuan3D-2

Followed4 min

Hunyuan3D-2-LowVRAM

Text/Image to 3D (Cross Platform: Mac + Windows + Linux): High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. https://github.com/deepbeepmeep/Hunyuan3D-2GP

Followed8 min

SadTalker

Fast Lipsync application for smaller GPU's.

Followed11 min

OpenAudio

Multilingual Text-to-Speech with Voice Cloning (Supports: English, Japanese, Korean, Chinese, French, German, Arabic, and Spanish) https://github.com/fishaudio/fish-speech

Followed11 min

Ultimate-TTS-Studio

Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app

Launcher updates

Store