TypeApps

paligemma

v1.5
Check-in
https://github.com/cocktailpeanutlabs/paligemmaupdated 5/15/2024, 6:31:03 PMindexed 1/6/2026, 6:18:06 AM

an open vision-language model by Google. PaliGemma is designed as a versatile model for transfer to a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation https://huggingface.co/spaces/google/paligemma

Tags
YoursLoading...
·
CommunityLoading...
No activity yet.