Pinokio
Explore tags
ai-video-composer
https://github.com/pinokiofactory/ai-video-composerv3.7updated 12/2/2025, 10:14:35 PMindexed 1/20/2026, 9:11:19 AM
The ultimate video editor powered by natural language and FFMPEG https://huggingface.co/spaces/huggingface-projects/ai-video-composer
Curated by @pinokio
MMAudio
https://github.com/pinokiofactory/MMAudiov3.7updated 12/2/2025, 10:00:13 PMindexed 1/23/2026, 7:45:21 PM
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
Curated by @pinokio
StyleTTS2 Studio
https://github.com/pinokiofactory/StyleTTS2_Studiov3.7updated 1/4/2026, 5:07:11 AMindexed 1/23/2026, 7:46:14 PM
Build your own voice for StyleTTS2
Curated by @pinokio
bolt.diy
https://github.com/pinokiofactory/boltv3.4.0updated 12/6/2025, 9:59:32 PMindexed 1/20/2026, 9:13:03 AM
Prompt, run, edit, and deploy full-stack web apps. https://github.com/stackblitz-labs/bolt.diy
Curated by @pinokio
Open WebUI
https://github.com/pinokiofactory/open-webuiv3.4.0updated 11/25/2025, 11:47:39 AMindexed 1/20/2026, 9:13:00 AM
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui
Curated by @pinokio
YuE
https://github.com/pinokiofactory/yuev3.7updated 12/2/2025, 9:56:04 PMindexed 1/20/2026, 9:14:03 AM
[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP
Curated by @pinokio
browser-use
https://github.com/pinokiofactory/browser-usev3.6updated 4/1/2025, 4:25:09 AMindexed 1/20/2026, 9:15:25 AM
Run AI Agent in your browser. https://github.com/browser-use/web-ui
Curated by @pinokio
zonos
https://github.com/pinokiofactory/zonosv3.7updated 12/6/2025, 10:44:22 PMindexed 1/20/2026, 9:11:12 AM
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers. https://github.com/Zyphra/Zonos
Curated by @pinokio
macOS-use
https://github.com/pinokiofactory/macOS-usev3.6updated 4/1/2025, 4:14:57 AMindexed 1/20/2026, 9:14:07 AM
[Mac Only] We make AI agents that control Mac apps: https://github.com/browser-use/macOS-use
Curated by @pinokio
MatAnyone
https://github.com/pinokiofactory/MatAnyonev3.3updated 12/2/2025, 9:43:31 PMindexed 1/23/2026, 7:45:54 PM
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
Curated by @pinokio
DiffRhythm
https://github.com/pinokiofactory/diffrhythmv3.7updated 12/5/2025, 1:50:16 AMindexed 1/20/2026, 9:09:38 AM
Generate songs with AI (up to 4 min 45 sec). Both with lyrics or instrumental https://github.com/ASLP-lab/DiffRhythm
Curated by @pinokio
cube
https://github.com/pinokiofactory/cubev3.7updated 12/2/2025, 9:01:20 PMindexed 1/20/2026, 9:10:59 AM
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
Curated by @pinokio
HunyuanVideo
https://github.com/pinokiofactory/hunyuanvideov3.7updated 1/19/2026, 4:49:14 PMindexed 1/27/2026, 8:39:57 PM
[NVIDIA ONLY] Super Optimized Gradio UI for Hunyuan Video Generator that works on GPU poor machines. Generate up to 10~14 sec videos https://github.com/deepbeepmeep/HunyuanVideoGP
Curated by @pinokio
uno
https://github.com/pinokiofactory/unov3.7updated 12/2/2025, 8:55:24 PMindexed 1/20/2026, 9:15:25 AM
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
Curated by @pinokio
Dia
https://github.com/pinokiofactory/diav3.7updated 12/7/2025, 7:54:59 PMindexed 1/20/2026, 9:12:46 AM
Dia is a 1.6B parameter text to speech model created by Nari Labs. Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc. https://github.com/nari-labs/dia
Curated by @pinokio
FramePack
https://github.com/pinokiofactory/Frame-Packv3.7updated 12/18/2025, 10:04:25 AMindexed 1/23/2026, 7:46:30 PM
[NVIDIA ONLY] Generate Video Progressively. FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively. https://github.com/lllyasviel/FramePack
Curated by @pinokio
FaceFusion 3.4.1
https://github.com/facefusion/facefusion-pinokiov1.6updated 1/25/2026, 12:08:25 PMindexed 1/26/2026, 7:30:46 PM
Industry leading face manipulation platform
Curated by @pinokio
Ultimate-TTS-Studio
https://github.com/pinokiofactory/Ultimate-TTS-Studiov3.7updated 1/26/2026, 5:29:34 PMindexed 1/28/2026, 8:16:05 PM
Kokoro, KittenTTS, Higgs audio, Chatterbox/Multi, Fish-Speech, F5 & index-tts & indextts2, VoxCPM and VibeVoice in one app
Curated by @pinokio
WebUI for ML-Sharp (3DGS)
https://github.com/francescofugazzi/ml-sharp-pinokiov0.3updated 1/27/2026, 3:26:19 AMindexed 1/27/2026, 8:09:44 AM
One-click 3D Gaussian Splatting generation from a single image.
Curated by @pinokio
Orpheus-TTS-FastAPI
https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 1/24/2026, 11:02:12 PMindexed 1/24/2026, 11:41:38 PM
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
Curated by @pinokio
PreviousPage 5 / 6Next