menu
techminis

A naukri.com initiative

google-web-stories
Home

>

Technology News

>

Confrontin...
source image

Hackernoon

11h

read

201

img
dot

Image Credit: Hackernoon

Confronting Multimodal LLM Challenges: Reasoning Gaps and Safety Trade-offs in Phi-3-Vision

  • Phi-3-Vision, a multi-modal LLM, excels in various areas but faces challenges in high-level reasoning tasks and occasionally generates ungrounded outputs, posing reliability concerns in sensitive fields like finance.
  • Safety measures post-training have improved but Phi-3-Vision struggles to avoid providing answers to harmful or sensitive questions, highlighting a trade-off between helpfulness and harmlessness.
  • Future plans involve integrating more reasoning-focused and hallucination-related DPO data into post-training to address the identified limitations.

Read Full Article

like

12 Likes

For uninterrupted reading, download the app