menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Weak Scali...
source image

Arxiv

14h

read

288

img
dot

Image Credit: Arxiv

Weak Scaling Capability in Token Space: An Observation from Large Vision Language Model

  • The study investigates the scaling capability of vision-language models with respect to the number of vision tokens.
  • The model exhibits weak scaling capabilities on the length of vision tokens, with performance approximately following a power-law relationship.
  • The scaling behavior remains unaffected by the inclusion or exclusion of the user's question in the input.
  • Fusing the user's question with the vision token can enhance model performance when the question is relevant.

Read Full Article

like

17 Likes

For uninterrupted reading, download the app