<ul><li>Google has introduced PaliGemma 2, a family of vision-language models for advanced task transfer.</li><li>The models come in three sizes and three resolutions, allowing for transfer learning across diverse domains.</li><li>PaliGemma 2 achieved state-of-the-art results in various fields, including molecular structure recognition and table structure analysis.</li><li>The models are designed for accessibility and can operate on low-precision formats for on-device inference.</li></ul>

Google Unveils PaliGemma 2 Vision-Language Models for Advanced Task Transfer

Discover more