Pinokio

Launcher updates

NEXUS OS

@ramshi23h ago

3D Model Generation Implemented

First steps to get 3D model generation working on Nexus OS. Still more work to be done on it, but it's a step...

DocToSpeech (Coqui)

@c0m3b4ck1d ago

Make TTS from scanned documents WITH EASE!

Wanted TTS of an old 60s manual? Or maybe a handwritten poem? Thanks to the addition of pytesseract to the pr...

AgentsView

@cocktailpeanut2d ago

AgentsView: View all your AI agent history in one place.

AI coding agents are great at moving work forward. Remembering where that work happened is harder. A useful f...

Maestro

@blizaine4d ago

Maestro v1.3.0 is out: SCAIL-2 character animation, 100% LOCAL, FREE & EASY!

(NEW) "Recast": swap anyone in a video for your own character. Drop a clip, type who to replace ("the woman",...

Bonsai Demo

@godwish6d ago

PrismML 8B,27b, Bonsai, Ternary

Test New Bonsai 27B

Type:api

Platform:All

GPU:All

Recommended Latest Check-ins

Sort:Latest

Open WebUI

pinokiofactory/open-webuiv3.4.0updated 3mo ago

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui

#ui #llm #ai

25 check-insNVIDIAAMDApple

VoxCPM 2

PierrunoYT/VoxCPM-1.5-Pinokiov5.0updated 3mo ago

Tokenizer-free TTS for context-aware speech, voice cloning, and voice design. 2B params, 48kHz, 30 languages (Gradio UI).

@pierrunoyt1 check-inNVIDIAAMDApple

ComfyUI

fakumax/comfyui-pinokiov5.0updated 3mo ago

Launcher principal y simple para ComfyUI. Instala ComfyUI + Manager, descarga modelos listos para usar y te guia hacia las plantillas oficiales.

4 check-insNVIDIAAMDApple

OpenClaw + Nerve

neviah/OpenClaw_Nerve_Pinokiov1.0updated 3mo ago

One-click OpenClaw gateway + Nerve launcher with no onboarding prompts.

@ramshi0 check-insNVIDIAAMDApple

Ebook to Audiobook with OmniVoice

quantumlump/Ebook-to-Audiobook-with-OmniVoicev5.0updated 3mo ago

Turn your eBooks into audiobooks using the OmniVoice text-to-speech model

@wildflower

2 check-insNVIDIAAMDApple

LongCat AudioDiT

bbecausereasonss/LongCat-TTS-Pinokiov5.0updated 3mo ago

Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.

@becausereasons

1 check-inNVIDIAAMDApple

LongCat AudioDiT

vdruts/LongCat-TTS-Pinokiov5.0updated 3mo ago

Diffusion-based TTS with zero-shot voice cloning (1B / 3.5B). Upload a voice reference, auto-transcribe, and generate matching speech for video pickups and ADR.

0 check-insNVIDIAAMDApple

Z-Image-Turbo

pinokiofactory/z-image-turbov2.0updated 3mo ago

Simple Gradio app for generating images with Tongyi-MAI/Z-Image-Turbo.

0 check-insNVIDIAAMDApple

LongCat-AudioDiT

r4dius/LongCat-AudioDiT-pinokiov1.0.0updated 3mo ago

Pinokio wrapper for LongCat-AudioDiT with selectable 1B / 3.5B model downloads.

0 check-insNVIDIAAMDApple

BuffedGoose

neviah/BuffedGoosev5.0updated 3mo ago

Goose sidecar dashboard draft with Pinokio onboarding-focused launcher.

@ramshi0 check-insNVIDIAAMDApple

XTTS

cocktailpeanut/xtts.pinokiov3.0updated 3mo ago

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

#ai #tts

@cocktailpeanut1 check-inNVIDIAAMDApple

Open-Hivemind

matthewhand/open-hivemindv1.0updated 3mo ago

Run the Open-Hivemind multi-agent orchestrator locally with Pinokio.

0 check-insNVIDIAAMDApple

Audiobook Studio

senigami/audiobook-studio.pinokiov3.7updated 3mo ago

Local-first AI audiobook production with voice cloning and chapter repair tools. This is the easiest way to install locally, including an optional demo voice library so you can start exploring right away. Live demo: senigami.github.io/audiobook-studio

@senigami

8 check-insNVIDIAAMDApple

Transcribe Studio

PierrunoYT/Download-Transcribe-Translate-Pinokiov5.0updated 3mo ago

YouTube to MP3, Cohere transcription, TranslateGemma translation.

@pierrunoyt0 check-insNVIDIAAMDApple

GLM-TTS

6morpheus6/glm-tts-pinokiov1.0.0updated 3mo ago

🎙️ Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning. High-quality text-to-speech synthesis supporting zero-shot voice cloning and streaming inference with natural emotional expression.

@morpheus0 check-insNVIDIAAMDApple

Video Dubbing Pipeline

Paxurux/Videodubbing-with-geminiv3.7updated 3mo ago

🎬 Professional Video Dubbing Pipeline with Parakeet-TDT-0.6b-v2, Gemini AI, and Edge TTS. Complete solution for automated video dubbing with step-by-step processing and batch video creation from multiple audio files.

2 check-insNVIDIAAMDApple

FunClip Auto

MiguelPR-99/FunClip-Streamer-Tweakedv1.0updated 3mo ago

Open-source, accurate and easy-to-use video clipping tool by Alibaba ModelScope.

0 check-insNVIDIAAMDApple

Comfy LTX Desktop

ArtDesignAwesome/Comfy-LTX-Desktop-INT8-GGUF-Pinokiov5.0updated 3mo ago

Pinokio launcher for Comfy LTX Desktop with GGUF and INT8 support.

1 check-inNVIDIAAMDApple

REAL-Video-Enhancer

manat0912/RVE_Video-upscalerv5.0updated 3mo ago

Interpolate, Upscale, Decompress, and Denoise videos Locally on Linux/Windows/MacOS.

@manatheturipa

3 check-insNVIDIAAMDApple

HY-WorldPlay

manat0912/HY-Worldplay-Pinokio_testv5.0updated 3mo ago

test to get this app working on pinokio

@manatheturipa1 check-inNVIDIAAMDApple

WantedFeed

Most wanted

Follow to get notified when a launcher drops.

#1pinokio

AI Browser

#2OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

#3Local-AI-Image-Generator

A fully self-contained, offline AI image generation studio for Windows. Runs Stable Diffusion (Safetensors/GGUF) locally with zero manual setup. Auto-configures CUDA for Nvidia GPUs and Vulkan for AMD/Intel Arc cards. Zero system-wide dependencies required.

#4MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

#5ChatTTS

A generative speech model for daily dialogue.

#6GitHub - rsxdalv/TTS-WebUI: A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, ...

#7openmed

open-source healthcare ai

#8Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents using any LLM.

#9diffusiongemma-lab

Local web UI for the DiffusionGemma diffusion LLM — watch answers crystallize out of noise

#10meshflow

Repository for the CVPR 2026 paper MeshFlow Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer by Weiyu Li, Antoine Toisoul, Tom Monnier, Roman Shapovalov, Rakesh Ranjan, Ping Tan and Andrea Vedaldi.

Global feed

Launcher updates

Store