paligemma

Installable

https://github.com/cocktailpeanutlabs/paligemmaupdated 5/15/2024, 6:31:03 PMindexed 1/20/2026, 9:13:29 AM

an open vision-language model by Google. PaliGemma is designed as a versatile model for transfer to a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation https://huggingface.co/spaces/google/paligemma

Check-in

Repos Used By This App1

PaliGemma Demo - a Hugging Face Space by cocktailpeanuthuggingface.co/spaces/cocktailpeanut/paligemmagit clone in install.js0 check-ins

Community tagsLoading...

Community

Post about paligemma...Post