Pinokio

Moondream1

v1.1
https://github.com/cocktailpeanutlabs/moondream1updated 7/11/2024, 10:59:29 AMindexed 1/20/2026, 9:15:23 AM

moondream1 is a tiny (1.6B parameter) vision language model trained by @vikhyatk that performs on par with models twice its size. It is trained on the LLaVa training dataset, and initialized with SigLIP as the vision tower and Phi-1.5 as the text encoder. https://huggingface.co/spaces/vikhyatk/moondream1

TypeApps
Community tagsLoading...
Check-in
Sort
Loading…