Bagel-DFloat11
https://github.com/SUP3RMASS1VE/Bagel-DFloat11v3.7updated 6/26/2025, 2:37:42 PMindexed 1/6/2026, 6:14:54 AM
[NVIDIA ONLY] Image generation, image editing and free-form manipulation with a VLM
Untitled
https://github.com/cocktailpeanutlabs/ghv4.0updated 6/26/2025, 10:51:00 AMindexed 1/6/2026, 6:15:32 AM
Omnigen-2
https://github.com/pipeob0/Omnigen-2v3.7updated 6/25/2025, 4:57:58 PMindexed 1/6/2026, 6:19:24 AM
Omnigen 2
https://github.com/6Morpheus6/omnigen2v3.7updated 6/25/2025, 7:46:35 AMindexed 1/6/2026, 6:15:43 AM
Unified Image Understanding and Generation. Text-to-Image Generation, In-context Generation, Instruction-guided Image Editing, Visual Understanding (Minimum Requirements 12GBV RAM / 48GB RAM, Recommended Requirements 24GB VRAM / 32GB RAM)
Prototype
https://github.com/cocktailpeanutlabs/protov4.0updated 6/24/2025, 11:35:15 AMindexed 1/6/2026, 6:14:53 AM
Slides2Video
https://github.com/elloza/slides2video-pinokio-scriptv3.2updated 6/19/2025, 7:47:38 AMindexed 1/6/2026, 6:18:14 AM
Ovis2-8B
https://github.com/SUP3RMASS1VE/Ovis2-8B-v3.6updated 6/18/2025, 10:34:25 AMindexed 1/6/2026, 6:17:30 AM
interacting with the Ovis2-8B model. The script allows users to load the model, process image and video inputs, and generate text-based responses using a conversational chatbot.
Nemoml
https://github.com/6Morpheus6/Nemomlv3.7updated 6/12/2025, 7:56:40 PMindexed 1/6/2026, 6:19:13 AM
[NVIDIA ONLY] A minimal Gradio interface for Automatic Speech Recognition. Transcribe Audio in Malayalam language.
Fooocus-API
https://github.com/6Morpheus6/Fooocus-APIv3.7updated 6/11/2025, 10:28:30 AMindexed 1/6/2026, 6:17:29 AM
Fooocus powered by FastAPI
fluxgym
https://github.com/mgalore/fluxgym-enhancedv2.1updated 6/11/2025, 8:51:00 AMindexed 1/6/2026, 6:17:53 AM
[NVIDIA Only] Dead simple web UI for training FLUX LoRA with LOW VRAM support (From 12GB)
HunyuanPortrait
https://github.com/SUP3RMASS1VE/HunyuanPortraitv3.7updated 6/5/2025, 9:20:42 PMindexed 1/6/2026, 6:19:04 AM
Fish-Speech
https://github.com/SUP3RMASS1VE/Fish-Speechv3.7updated 6/5/2025, 5:35:39 PMindexed 1/6/2026, 6:15:22 AM
Direct3D-S2
https://github.com/Deathdadev/Direct3D-S2-Pinokiov3.7updated 6/2/2025, 3:08:57 PMindexed 1/6/2026, 6:16:10 AM
[NVIDIA ONLY] Direct3D-S2 is a scalable 3D shape generation framework leveraging sparse volumetric representations for high-resolution outputs. It features Spatial Sparse Attention (SSA), a novel mechanism that accelerates Diffusion Transformer computations on sparse data, achieving up to 9.6× speedup in training. The unified Sparse VAE architecture maintains a consistent sparse volumetric format across input, latent, and output stages, significantly improving efficiency and stability.
🎬 AutoGif
https://github.com/TheAwaken1/AutoGif-Pinokiov2.0updated 6/1/2025, 10:08:35 AMindexed 1/6/2026, 6:16:40 AM
Transform YouTube videos into stunning animated GIFs with perfectly-timed, stylized subtitles and eye-catching effects.
InfernoSaber Automapper
https://github.com/fred-brenner/InfernoSaber-Appupdated 5/31/2025, 12:35:02 PMindexed 1/6/2026, 6:17:15 AM
Flexible Automapper for Beatsaber made for any difficulty
Chatterbox
https://github.com/Deathdadev/chatterboxv3.7updated 5/29/2025, 11:35:55 AMindexed 1/6/2026, 6:15:11 AM
gh
https://github.com/peanutcocktail/ghtestv3.7updated 5/29/2025, 11:33:08 AMindexed 1/6/2026, 6:16:45 AM
github
DreamO
https://github.com/petermg/DreamO_Pinokiov3.7updated 5/26/2025, 4:16:31 AMindexed 1/6/2026, 6:17:29 AM
AIraoke
https://github.com/TheAwaken1/AIraoke-Pinokiov2.0updated 5/25/2025, 7:20:04 PMindexed 1/6/2026, 6:19:27 AM
Transform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
OneTrainerPinokio
https://github.com/odiseum7/OneTrainerPinokiov3.7updated 5/21/2025, 7:24:12 PMindexed 1/6/2026, 6:15:21 AM
OneTrainer para Pinokio vato loco
PreviousPage 7 / 18Next