Pinokio
Explore tags
ai-toolkit
https://github.com/ai-anchorite/ai-toolkitv3.7updated 12/13/2025, 3:33:21 AMindexed 1/23/2026, 7:47:32 PM
AI Toolkit by Ostris
MuseTalk
https://github.com/manat0912/TalkingMusev3.7updated 12/12/2025, 11:41:39 AMindexed 1/23/2026, 7:45:08 PM
VoxCPM
https://github.com/Paxurux/Voxcpmv3.7updated 12/11/2025, 6:20:28 PMindexed 1/23/2026, 7:46:57 PM
Voice Synthesis Platform with Smart Chunking, Batch Processing, and Voice Cloning capabilities.
RVC
https://github.com/cocktailpeanut/rvc.pinokiov3.7updated 12/11/2025, 2:33:17 PMindexed 1/23/2026, 7:44:52 PM
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
MLX Whisper WebUI
https://github.com/dadleo/mlx-whisper-webui-pinokiov1.1updated 12/10/2025, 7:33:38 PMindexed 1/23/2026, 7:47:35 PM
Fast Speech-to-Text Web UI with Apple MLX and OpenAI Whisper
VoxCPM-1.5
https://github.com/PierrunoYT/VoxCPM-1.5-Pinokiov1.0.0updated 12/9/2025, 5:03:11 PMindexed 1/23/2026, 7:47:32 PM
🎙️ Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning. Features 44.1kHz sampling rate, 6.25Hz token rate, and supports both SFT and LoRA fine-tuning. Built on MiniCPM-4 backbone for highly expressive, natural speech synthesis.
Dia
https://github.com/pinokiofactory/diav3.7updated 12/7/2025, 7:54:59 PMindexed 1/20/2026, 9:12:46 AM
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
FramePack-Studio
https://github.com/FP-Studio/fp-studiov3.7updated 12/7/2025, 10:46:08 AMindexed 1/23/2026, 7:45:22 PM
[v0.5.1] FramePack Video App offering multiple generation types: Original, F1, video extension, end frame. Features include: LoRA support, job queueing, advanced timestamped prompts, offline mode, a post-processing suite including upscaling, interpolation, filters and more!
Ollama Web Interface
https://github.com/JL-Bones/Ollama_Webupdated 12/6/2025, 10:59:27 PMindexed 1/20/2026, 9:14:54 AM
A web interface for managing and interacting with Ollama models
zonos
https://github.com/pinokiofactory/zonosv3.7updated 12/6/2025, 10:44:22 PMindexed 1/20/2026, 9:11:12 AM
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
bolt.diy
https://github.com/pinokiofactory/boltv3.4.0updated 12/6/2025, 9:59:32 PMindexed 1/20/2026, 9:13:03 AM
Prompt, run, edit, and deploy full-stack web apps. https://github.com/stackblitz-labs/bolt.diy
Music Video Cutter
https://github.com/6Morpheus6/mvcv3.7updated 12/6/2025, 8:17:35 PMindexed 1/23/2026, 7:47:47 PM
Automatically create music videos. Synchronize the cuts to the music's beat.
ACE-Step
https://github.com/pinokiofactory/ACE-Stepv3.7updated 12/6/2025, 11:35:01 AMindexed 1/23/2026, 7:45:55 PM
A Step Towards Music Generation Foundation Model
echomimic2
https://github.com/pinokiofactory/echomimic2v3.7updated 12/6/2025, 5:47:56 AMindexed 1/20/2026, 9:14:28 AM
[NVIDIA ONLY] Make virtual avatars talk whatever you want with an image and an audio clip https://github.com/antgroup/echomimic_v2
PRX-1024 Text-to-Image
https://github.com/PierrunoYT/Photoroom-PRX-Pinokiov1.0.0updated 12/5/2025, 7:25:29 PMindexed 1/23/2026, 7:46:45 PM
Gradio web interface for Photoroom's PRX-1024-t2i-beta text-to-image model
Vibevoice Realtime Pinokio
https://github.com/SUP3RMASS1VE/VibeVoice-Realtime-Pinokiov4.0updated 12/5/2025, 3:38:20 PMindexed 1/23/2026, 7:47:34 PM
DiffRhythm
https://github.com/pinokiofactory/diffrhythmv3.7updated 12/5/2025, 1:50:16 AMindexed 1/20/2026, 9:09:38 AM
Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm
pyramidflow
https://github.com/pinokiofactory/pyramidflowv3.7updated 12/4/2025, 6:27:40 PMindexed 1/20/2026, 9:12:09 AM
Pyramd Flow Video Generation AI (text-to-video & image-to-video) https://github.com/jy0205/Pyramid-Flow
Pentest Quick Recon (Lab Only)
https://github.com/IlyasAlla/Pentest-Recon-Pinokioupdated 12/4/2025, 6:02:25 AMindexed 1/23/2026, 7:47:34 PM
One-click, permissioned recon: nmap + web enum, logs to /reports. For authorized testing only.
Invoke
https://github.com/6Morpheus6/invokev3.7updated 12/4/2025, 5:43:02 AMindexed 1/25/2026, 3:22:15 AM
The Gen AI Platform for Pro Studios https://github.com/invoke-ai/InvokeAI
PreviousPage 8 / 28Next