Adding PaliGemma2 to KerasHub #1998

divyashreepathihalli · 2024-12-05T04:08:14Z

Sanity Check Colab : https://colab.sandbox.google.com/drive/1xejmaZvLgMFrzIrIRm2gHzrXF5cprp7P
Model summary
PaliGemma 2 is an update of the PaliGemma vision-language model (VLM) which incorporates the capabilities of the Gemma 2 models. The PaliGemma family of models is inspired by PaLI-3 and based on open components such as the SigLIP vision model and Gemma 2 language models. It takes both image and text as input and generates text as output, supporting multiple languages. It is designed for class-leading fine-tune performance on a wide range of vision-language tasks such as image and short video caption, visual question answering, text reading, object detection and object segmentation.

Model architecture
PaliGemma 2 is the composition of a Transformer decoder and a Vision Transformer image encoder. The text decoder is initialized from Gemma 2 in the 2B, 9B, and 27B parameter sizes. The image encoder is initialized from SigLIP-So400m/14. Similar to the original PaliGemma model, PaliGemma 2 is trained following the PaLI-3 recipes.

Inputs and outputs
Input: Image and text string, such as a prompt to caption the image, or a question. Output: Generated text in response to the input, such as a caption of the image, an answer to a question, a list of object bounding box coordinates, or segmentation codewords.

Model implementation author: @james77777778
KerasHub PaliGemma implementation lead: @divyashreepathihalli

* Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]>

mattdangerw

Lgtm!

* Add PaliGemma2 (keras-team#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]>

* Adding PaliGemma2 to KerasHub (#1998) * Add PaliGemma2 (#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]> * Version bump to 0.18.0 * Update pali_gemma_presets.py (#2003) * Update pali_gemma_presets.py * code reformat * Adding PaliGemma2 to KerasHub (#1998) * Add PaliGemma2 (#96) * Add PaliGemma2 arch * Enable mixed precision check for PaliGemma * Add conversion script * Revert ImageConverter and reduce mem usage in the conversion script * Remove `compute_output_spec` * Fix `compute_output_shape` issue for keras 3.1 * Add model cards and update conversion script * update presets --------- Co-authored-by: divyashreepathihalli <[email protected]> * Update pali_gemma_presets.py - remove mix presets * Update pali_gemma_presets.py * Update convert_pali_gemma2_checkpoints.py --------- Co-authored-by: james77777778 <[email protected]> * Update pali_gemma_presets.py (#2003) * Update pali_gemma_presets.py * code reformat --------- Co-authored-by: james77777778 <[email protected]>

james77777778 and others added 3 commits December 4, 2024 01:14

Merge remote-tracking branch 'upstream/master'

ab99ea8

Update pali_gemma_presets.py - remove mix presets

1e6b260

github-actions bot added the Gemma Gemma model specific issues label Dec 5, 2024

divyashreepathihalli requested a review from mattdangerw December 5, 2024 04:08

mattdangerw approved these changes Dec 5, 2024

View reviewed changes

divyashreepathihalli added 2 commits December 4, 2024 20:11

Update pali_gemma_presets.py

c974797

Update convert_pali_gemma2_checkpoints.py

359f9e5

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Dec 5, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Dec 5, 2024

divyashreepathihalli merged commit f251ed3 into keras-team:master Dec 5, 2024
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding PaliGemma2 to KerasHub #1998

Adding PaliGemma2 to KerasHub #1998

divyashreepathihalli commented Dec 5, 2024 •

edited

Loading

mattdangerw left a comment

Adding PaliGemma2 to KerasHub #1998

Adding PaliGemma2 to KerasHub #1998

Conversation

divyashreepathihalli commented Dec 5, 2024 • edited Loading

mattdangerw left a comment

Choose a reason for hiding this comment

divyashreepathihalli commented Dec 5, 2024 •

edited

Loading