Pinokio

paligemma

v1.5
https://github.com/cocktailpeanutlabs/paligemmaupdated 5/15/2024, 6:31:03 PMindexed 1/20/2026, 9:13:29 AM

an open vision-language model by Google. PaliGemma is designed as a versatile model for transfer to a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation https://huggingface.co/spaces/google/paligemma

TypeApps
Community tagsLoading...
Check-in
Sort
Loading…