Store

Tag:#aix

diamond

https://github.com/pinokiofactory/diamondv3.7updated 12/2/2025, 11:46:31 PMindexed 1/27/2026, 4:09:42 PM

Diffusion for World Modeling https://diamond-wm.github.io/

#ai #game-generation #world-generation

omnigen

https://github.com/pinokiofactory/omnigenv3.7updated 12/2/2025, 11:04:47 PMindexed 1/20/2026, 9:15:20 AM

A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. https://huggingface.co/spaces/Shitao/OmniGen

#ai #image-generation

InstantIR

https://github.com/pinokiofactory/instantirv3.7updated 12/2/2025, 10:52:14 PMindexed 1/20/2026, 9:11:10 AM

restore low-res images, restore broken images, recreate a new version of the image with a prompt https://huggingface.co/spaces/fffiloni/InstantIR

#ai #image-edit

RMBG-2-Studio

https://github.com/pinokiofactory/RMBG-2-Studiov3.7updated 12/2/2025, 10:50:55 PMindexed 1/20/2026, 9:13:10 AM

Enhanced background remove and replace app built around BRIA-RMBG-2.0 https://huggingface.co/briaai/RMBG-2.0

#ai #image-edit #remove-background

ai-video-composer

https://github.com/pinokiofactory/ai-video-composerv3.7updated 12/2/2025, 10:14:35 PMindexed 1/20/2026, 9:11:19 AM

The ultimate video editor powered by natural language and FFMPEG https://huggingface.co/spaces/huggingface-projects/ai-video-composer

#ai #ffmpeg #video-edit

MMAudio

https://github.com/pinokiofactory/MMAudiov3.7updated 12/2/2025, 10:00:13 PMindexed 1/23/2026, 7:45:21 PM

Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio

#ai #audio-generation #video-to-audio

YuE

https://github.com/pinokiofactory/yuev3.7updated 12/2/2025, 9:56:04 PMindexed 1/20/2026, 9:14:03 AM

[NVIDIA ONLY] YuEGP--A Web UI for YuE, an Open Full-song Generation Foundation Model (10G VRAM required), via https://github.com/deepbeepmeep/YuEGP

#ai #song-generation

MatAnyone

https://github.com/pinokiofactory/MatAnyonev3.3updated 12/2/2025, 9:43:31 PMindexed 1/23/2026, 7:45:54 PM

MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git

#ai #edit #video-edit

cube

https://github.com/pinokiofactory/cubev3.7updated 12/2/2025, 9:01:20 PMindexed 1/20/2026, 9:10:59 AM

Roblox Foundation Model for 3D Intelligence --- Cross Platform (Mac, Windows, Linux): Requires 16GB+ VRAM PC or 18GB+ Memory Macs https://github.com/Roblox/cube

#3d #3d-generation #3dgen #ai

uno

https://github.com/pinokiofactory/unov3.7updated 12/2/2025, 8:55:24 PMindexed 1/20/2026, 9:15:25 AM

[NVIDIA ONLY] Generate an image from multiple images https://github.com/bytedance/UNO

#ai #image-generation

Bark Voice Cloning

https://github.com/6Morpheus6/barkv3.7updated 11/30/2025, 4:33:53 AMindexed 1/22/2026, 2:20:35 AM

Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning

hallo

https://github.com/pinokiofactory/hallov3.7updated 11/27/2025, 9:34:29 PMindexed 1/20/2026, 9:13:30 AM

[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation https://github.com/fudan-generative-vision/hallo

#ai #lipsync #video-generation

SillyTavern

https://github.com/pinokiofactory/sillytavernv1.5updated 11/26/2025, 11:36:47 PMindexed 1/20/2026, 9:10:36 AM

a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters. https://docs.sillytavern.app/

#llm #ai

Open WebUI

https://github.com/pinokiofactory/open-webuiv3.4.0updated 11/25/2025, 11:47:39 AMindexed 1/20/2026, 9:13:00 AM

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs https://github.com/open-webui/open-webui

#ai #llm

StyleAligned

https://github.com/cocktailpeanut/StyleAligned.pinokiov3.0updated 11/19/2025, 11:40:05 AMindexed 1/23/2026, 7:45:10 PM

Style Aligned Image Generation via Shared Attention https://style-aligned-gen.github.io/

#image-generation #ai

XTTS

https://github.com/cocktailpeanut/xtts.pinokiov3.0updated 11/10/2025, 4:28:58 AMindexed 1/23/2026, 7:47:04 PM

clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

#ai #tts

Florence2

https://github.com/pinokiofactory/florence2v3.7updated 11/6/2025, 4:47:33 AMindexed 1/20/2026, 9:13:56 AM

An advanced vision foundation model from MicroSoft https://huggingface.co/spaces/gokaygokay/Florence-2

#ai #vision #vlm

lavie

https://github.com/cocktailpeanut/lavie.pinokioupdated 5/27/2025, 4:47:46 AMindexed 1/23/2026, 7:44:45 PM

Text-to-Video (T2V) generation framework from Vchitect https://github.com/Vchitect/LaVie

#ai #video-generation

artist

https://github.com/pinokiofactory/artistv3.0updated 4/23/2025, 9:33:31 PMindexed 1/20/2026, 9:13:04 AM

Artist is a training-free text-driven image stylization method. You give an image and input a prompt describing the desired style, Artist give you the stylized image in that style. The detail of the original image and the style you provide is harmonically integrated https://huggingface.co/spaces/fffiloni/Artist

#ai #image-edit #image-generation

brushnet

https://github.com/cocktailpeanutlabs/brushnetv3.0updated 4/21/2025, 4:12:20 PMindexed 1/20/2026, 9:14:06 AM

A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion https://huggingface.co/spaces/TencentARC/BrushNet

#ai #image-edit #image-generation