Google has introduced PaliGemma 2, a family of vision-language models for advanced task transfer.The models come in three sizes and three resolutions, allowing for transfer learning across diverse domains.PaliGemma 2 achieved state-of-the-art results in various fields, including molecular structure recognition and table structure analysis.The models are designed for accessibility and can operate on low-precision formats for on-device inference.