menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

SORREL: Su...
source image

Arxiv

3d

read

353

img
dot

Image Credit: Arxiv

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

  • Mixed Integer Linear Program (MILP) solvers heavily rely on hand-crafted heuristics for branching.
  • Data-driven approaches have been used to automatically learn these heuristics.
  • Suboptimal-Demonstration-Guided Reinforcement Learning (SORREL) is proposed to learn branching using suboptimal demonstrations.
  • SORREL shows advanced performance in branching quality and training efficiency for various MILPs.

Read Full Article

like

21 Likes

For uninterrupted reading, download the app