Store

Bagel-DFloat11

https://github.com/SUP3RMASS1VE/Bagel-DFloat11v3.7updated 6/26/2025, 2:37:42 PMindexed 1/6/2026, 6:14:54 AM

[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM

Untitled

https://github.com/cocktailpeanutlabs/ghv4.0updated 6/26/2025, 10:51:00 AMindexed 1/6/2026, 6:15:32 AM

Omnigen-2

https://github.com/pipeob0/Omnigen-2v3.7updated 6/25/2025, 4:57:58 PMindexed 1/6/2026, 6:19:24 AM

Omnigen 2

https://github.com/6Morpheus6/omnigen2v3.7updated 6/25/2025, 7:46:35 AMindexed 1/6/2026, 6:15:43 AM

Unified Image Understanding and Generation. Text-to-Image Generation, In-context Generation, Instruction-guided Image Editing, Visual Understanding (Minimum Requirements 12GBV RAM / 48GB RAM, Recommended Requirements 24GB VRAM / 32GB RAM)

Prototype

https://github.com/cocktailpeanutlabs/protov4.0updated 6/24/2025, 11:35:15 AMindexed 1/6/2026, 6:14:53 AM

Slides2Video

https://github.com/elloza/slides2video-pinokio-scriptv3.2updated 6/19/2025, 7:47:38 AMindexed 1/6/2026, 6:18:14 AM

Ovis2-8B

https://github.com/SUP3RMASS1VE/Ovis2-8B-v3.6updated 6/18/2025, 10:34:25 AMindexed 1/6/2026, 6:17:30 AM

interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.

Nemoml

https://github.com/6Morpheus6/Nemomlv3.7updated 6/12/2025, 7:56:40 PMindexed 1/6/2026, 6:19:13 AM

[NVIDIA ONLY] A minimal Gradio interface for Automatic Speech Recognition. Transcribe Audio in Malayalam language.

Fooocus-API

https://github.com/6Morpheus6/Fooocus-APIv3.7updated 6/11/2025, 10:28:30 AMindexed 1/6/2026, 6:17:29 AM

Fooocus powered by FastAPI

fluxgym

https://github.com/mgalore/fluxgym-enhancedv2.1updated 6/11/2025, 8:51:00 AMindexed 1/6/2026, 6:17:53 AM

[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)

HunyuanPortrait

https://github.com/SUP3RMASS1VE/HunyuanPortraitv3.7updated 6/5/2025, 9:20:42 PMindexed 1/6/2026, 6:19:04 AM

Fish-Speech

https://github.com/SUP3RMASS1VE/Fish-Speechv3.7updated 6/5/2025, 5:35:39 PMindexed 1/6/2026, 6:15:22 AM

Direct3D-S2

https://github.com/Deathdadev/Direct3D-S2-Pinokiov3.7updated 6/2/2025, 3:08:57 PMindexed 1/6/2026, 6:16:10 AM

[NVIDIA ONLY] Direct3D-S2 is a scalable 3D shape generation framework leveraging sparse volumetric representations for high-resolution outputs. It features Spatial Sparse Attention (SSA), a novel mechanism that accelerates Diffusion Transformer computations on sparse data, achieving up to 9.6× speedup in training. The unified Sparse VAE architecture maintains a consistent sparse volumetric format across input, latent, and output stages, significantly improving efficiency and stability.

🎬 AutoGif

https://github.com/TheAwaken1/AutoGif-Pinokiov2.0updated 6/1/2025, 10:08:35 AMindexed 1/6/2026, 6:16:40 AM

Transform YouTube videos into stunning animated GIFs with perfectly-timed, stylized subtitles and eye-catching effects.

InfernoSaber Automapper

https://github.com/fred-brenner/InfernoSaber-Appupdated 5/31/2025, 12:35:02 PMindexed 1/6/2026, 6:17:15 AM

Flexible Automapper for Beatsaber made for any difficulty

Chatterbox

https://github.com/Deathdadev/chatterboxv3.7updated 5/29/2025, 11:35:55 AMindexed 1/6/2026, 6:15:11 AM

https://github.com/peanutcocktail/ghtestv3.7updated 5/29/2025, 11:33:08 AMindexed 1/6/2026, 6:16:45 AM

github

DreamO

https://github.com/petermg/DreamO_Pinokiov3.7updated 5/26/2025, 4:16:31 AMindexed 1/6/2026, 6:17:29 AM

AIraoke

https://github.com/TheAwaken1/AIraoke-Pinokiov2.0updated 5/25/2025, 7:20:04 PMindexed 1/6/2026, 6:19:27 AM

Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.

OneTrainerPinokio

https://github.com/odiseum7/OneTrainerPinokiov3.7updated 5/21/2025, 7:24:12 PMindexed 1/6/2026, 6:15:21 AM

OneTrainer para Pinokio vato loco