menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

VeriThinke...
source image

Arxiv

1d

read

345

img
dot

Image Credit: Arxiv

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

  • VeriThinker is introduced as a novel approach for compressing Chain-of-Thought (CoT) reasoning, which helps in making Large Reasoning Models (LRMs) more efficient.
  • Unlike traditional methods, VeriThinker fine-tunes LRMs solely through an auxiliary verification task instead of using synthetic concise CoT data, leading to a reduction in overthinking and unnecessary reasoning chain lengths.
  • Experiments show that VeriThinker significantly reduces reasoning chain lengths while maintaining or slightly improving accuracy when applied to tasks like MATH500 and AIME25, showcasing its effectiveness in enhancing model efficiency.
  • VeriThinker can also be generalized to speculative reasoning and the code is available at https://github.com/czg1225/VeriThinker.

Read Full Article

like

20 Likes

For uninterrupted reading, download the app