facepoke
https://github.com/pinokiofactory/facepokev3.7updated 12/4/2025, 3:24:36 AMindexed 1/6/2026, 6:19:24 AM
[NVIDIA Only] Select a portrait, click to move the head around https://github.com/jbilcke-hf/FacePoke
Kokoro-TTS-Multilingual.git
https://github.com/6Morpheus6/Kokoro-TTS-Multilingualv3.7updated 12/4/2025, 2:47:04 AMindexed 1/6/2026, 6:16:43 AM
Super fast Multilingual TTS supporting 54 voices across 8 languages.
Fara-7B Computer Use Agent
https://github.com/neviah/Fara-Pinokiov3.7updated 12/3/2025, 9:25:51 PMindexed 1/6/2026, 6:15:35 AM
Microsoft's 7B parameter computer use agent with Gradio interface
omnigen
https://github.com/pinokiofactory/omnigenv3.7updated 12/2/2025, 11:04:47 PMindexed 1/6/2026, 6:19:48 AM
A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen
InstantIR
https://github.com/pinokiofactory/instantirv3.7updated 12/2/2025, 10:52:14 PMindexed 1/6/2026, 6:18:51 AM
restore low-res images, restore broken images, recreate a new version of the image with a prompt https://huggingface.co/spaces/fffiloni/InstantIR
RMBG-2-Studio
https://github.com/pinokiofactory/RMBG-2-Studiov3.7updated 12/2/2025, 10:50:55 PMindexed 1/6/2026, 6:17:31 AM
Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0
Clarity Refiners UI
https://github.com/pinokiofactory/clarity-refiners-uiv3.7updated 12/2/2025, 10:18:27 PMindexed 1/6/2026, 6:19:13 AM
An enhanced local port of finegrain-image-enhancer powered by Refiners (https://huggingface.co/spaces/finegrain/finegrain-image-enhancer), which was adapted from philz1337x's Clarity Upscaler (https://github.com/philz1337x/clarity-upscaler)
ai-video-composer
https://github.com/pinokiofactory/ai-video-composerv3.7updated 12/2/2025, 10:14:35 PMindexed 1/6/2026, 6:19:16 AM
The ultimate video editor powered by natural language and FFMPEG https://huggingface.co/spaces/huggingface-projects/ai-video-composer
MMAudio
https://github.com/pinokiofactory/MMAudiov3.7updated 12/2/2025, 10:00:10 PMindexed 1/6/2026, 6:15:25 AM
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
StyleTTS2 Studio
https://github.com/pinokiofactory/StyleTTS2_Studiov3.7updated 12/2/2025, 9:58:13 PMindexed 1/6/2026, 6:17:31 AM
Build your own voice for StyleTTS2
YuE
https://github.com/pinokiofactory/yuev3.7updated 12/2/2025, 9:56:04 PMindexed 1/6/2026, 6:19:18 AM
[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP
MatAnyone
https://github.com/pinokiofactory/MatAnyonev3.3updated 12/2/2025, 9:43:27 PMindexed 1/6/2026, 6:18:15 AM
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
Comfyui
https://github.com/pinokiofactory/comfyv3.7updated 12/2/2025, 9:35:15 PMindexed 1/6/2026, 6:14:48 AM
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface. https://github.com/comfyanonymous/ComfyUI
cube
https://github.com/pinokiofactory/cubev3.7updated 12/2/2025, 9:01:20 PMindexed 1/6/2026, 6:14:39 AM
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
HunyuanVideo
https://github.com/pinokiofactory/hunyuanvideov3.7updated 12/2/2025, 8:58:55 PMindexed 1/6/2026, 6:16:51 AM
[NVIDIA ONLY] Super Optimized Gradio UI for Hunyuan Video Generator that works on GPU poor machines. Generate up to 10~14 sec videos https://github.com/deepbeepmeep/HunyuanVideoGP
Orpheus-TTS-FastAPI
https://github.com/pinokiofactory/Orpheus-TTS-FastAPIv3.7updated 12/2/2025, 8:56:20 PMindexed 1/6/2026, 6:19:43 AM
Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
uno
https://github.com/pinokiofactory/unov3.7updated 12/2/2025, 8:55:24 PMindexed 1/6/2026, 6:19:17 AM
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
Wan22 Brkn Prompt Helper
https://github.com/NUVoize/wan22-brkn-prompt-helperupdated 12/1/2025, 4:12:09 PMindexed 1/6/2026, 6:15:27 AM
hallo
https://github.com/pinokiofactory/hallov3.7updated 11/27/2025, 9:34:29 PMindexed 1/6/2026, 6:18:16 AM
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
LatentSync
https://github.com/manat0912/LatentSync-Pinokiov3.7updated 11/27/2025, 10:23:23 AMindexed 1/6/2026, 6:17:34 AM
High quality LipSync Application with a simple UI
PreviousPage 4 / 18Next