menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Modality-B...
source image

Arxiv

4d

read

195

img
dot

Image Credit: Arxiv

Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining

  • A new study introduces Modality-Balancing Preference Optimization (MBPO) to address modality imbalance in Large Multimodal Models (LMMs).
  • MBPO generates hard negatives to counter biases in Large Language Model (LLM) backbones and incorporates online responses with verified rewards using Group Relative Policy Optimization (GRPO).
  • The method aims to improve reasoning capabilities in LMMs and reduce hallucinations by balancing language prior biases over visual inputs.
  • Experiments show that MBPO enhances performance on vision-language tasks and effectively combats modality imbalance in LMMs.

Read Full Article

like

11 Likes

For uninterrupted reading, download the app