menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Assessing ...
source image

Arxiv

5d

read

375

img
dot

Image Credit: Arxiv

Assessing Modality Bias in Video Question Answering Benchmarks with Multimodal Large Language Models

  • Multimodal large language models (MLLMs) can process visual, textual, and auditory data.
  • Existing video question-answering benchmarks often exhibit bias towards a single modality.
  • The modality importance score (MIS) is introduced to identify and assess modality bias.
  • MLLM-derived MIS can guide the curation of modality-balanced datasets to enhance multimodal learning.

Read Full Article

like

22 Likes

For uninterrupted reading, download the app