menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

From Passi...
source image

Arxiv

1d

read

22

img
dot

Image Credit: Arxiv

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

  • Existing benchmarks primarily assess passive reasoning abilities of large language models (LLMs), providing all necessary information.
  • A new benchmark called AR-Bench is introduced to evaluate LLMs' active reasoning skills by requiring interaction with external systems to acquire missing evidence.
  • AR-Bench comprises task families like detective cases, situation puzzles, and guessing numbers to measure performance across various reasoning challenges.
  • Empirical evaluation on AR-Bench shows that current LLMs struggle with active reasoning, indicating a need for advancing methodology to enhance their capabilities.

Read Full Article

like

1 Like

For uninterrupted reading, download the app