Pinokio

Store

Feed
Latest
Tag:#aix
Related tags
diamond
https://github.com/pinokiofactory/diamondv3.7updated 12/2/2025, 11:46:31 PMindexed 1/27/2026, 4:09:42 PM
Diffusion for World Modeling https://diamond-wm.github.io/
omnigen
https://github.com/pinokiofactory/omnigenv3.7updated 12/2/2025, 11:04:47 PMindexed 1/20/2026, 9:15:20 AM
A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen
InstantIR
https://github.com/pinokiofactory/instantirv3.7updated 12/2/2025, 10:52:14 PMindexed 1/20/2026, 9:11:10 AM
restore low-res images, restore broken images, recreate a new version of the image with a prompt https://huggingface.co/spaces/fffiloni/InstantIR
RMBG-2-Studio
https://github.com/pinokiofactory/RMBG-2-Studiov3.7updated 12/2/2025, 10:50:55 PMindexed 1/20/2026, 9:13:10 AM
Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0
ai-video-composer
https://github.com/pinokiofactory/ai-video-composerv3.7updated 12/2/2025, 10:14:35 PMindexed 1/20/2026, 9:11:19 AM
The ultimate video editor powered by natural language and FFMPEG https://huggingface.co/spaces/huggingface-projects/ai-video-composer
MMAudio
https://github.com/pinokiofactory/MMAudiov3.7updated 12/2/2025, 10:00:13 PMindexed 1/23/2026, 7:45:21 PM
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
YuE
https://github.com/pinokiofactory/yuev3.7updated 12/2/2025, 9:56:04 PMindexed 1/20/2026, 9:14:03 AM
[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP
MatAnyone
https://github.com/pinokiofactory/MatAnyonev3.3updated 12/2/2025, 9:43:31 PMindexed 1/23/2026, 7:45:54 PM
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
cube
https://github.com/pinokiofactory/cubev3.7updated 12/2/2025, 9:01:20 PMindexed 1/20/2026, 9:10:59 AM
Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube
uno
https://github.com/pinokiofactory/unov3.7updated 12/2/2025, 8:55:24 PMindexed 1/20/2026, 9:15:25 AM
[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO
Bark Voice Cloning
https://github.com/6Morpheus6/barkv3.7updated 11/30/2025, 4:33:53 AMindexed 1/22/2026, 2:20:35 AM
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning
hallo
https://github.com/pinokiofactory/hallov3.7updated 11/27/2025, 9:34:29 PMindexed 1/20/2026, 9:13:30 AM
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo
SillyTavern
https://github.com/pinokiofactory/sillytavernv1.5updated 11/26/2025, 11:36:47 PMindexed 1/20/2026, 9:10:36 AM
a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters. https://docs.sillytavern.app/
Open WebUI
https://github.com/pinokiofactory/open-webuiv3.4.0updated 11/25/2025, 11:47:39 AMindexed 1/20/2026, 9:13:00 AM
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui
StyleAligned
https://github.com/cocktailpeanut/StyleAligned.pinokiov3.0updated 11/19/2025, 11:40:05 AMindexed 1/23/2026, 7:45:10 PM
Style Aligned Image Generation via Shared Attention https://style-aligned-gen.github.io/
XTTS
https://github.com/cocktailpeanut/xtts.pinokiov3.0updated 11/10/2025, 4:28:58 AMindexed 1/23/2026, 7:47:04 PM
clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)
Florence2
https://github.com/pinokiofactory/florence2v3.7updated 11/6/2025, 4:47:33 AMindexed 1/20/2026, 9:13:56 AM
An advanced vision foundation model from MicroSoft https://huggingface.co/spaces/gokaygokay/Florence-2
lavie
https://github.com/cocktailpeanut/lavie.pinokioupdated 5/27/2025, 4:47:46 AMindexed 1/23/2026, 7:44:45 PM
Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie
artist
https://github.com/pinokiofactory/artistv3.0updated 4/23/2025, 9:33:31 PMindexed 1/20/2026, 9:13:04 AM
Artist is a training-free text-driven image stylization method. You give an image and input a prompt describing the desired style, Artist give you the stylized image in that style. The detail of the original image and the style you provide is harmonically integrated https://huggingface.co/spaces/fffiloni/Artist
brushnet
https://github.com/cocktailpeanutlabs/brushnetv3.0updated 4/21/2025, 4:12:20 PMindexed 1/20/2026, 9:14:06 AM
A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion https://huggingface.co/spaces/TencentARC/BrushNet
PreviousPage 3 / 6Next