menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Distributi...
source image

Arxiv

1w

read

304

img
dot

Image Credit: Arxiv

Distributionally Robust Policy Learning under Concept Drifts

  • Distributionally robust policy learning aims to find a policy that performs well under the worst-case distributional shift.
  • Existing methods for robust policy learning consider the worst-case joint distribution of the covariate and the outcome, which can be unnecessarily conservative.
  • This paper focuses on robust policy learning under concept drift, where only the conditional relationship between the outcome and the covariate changes.
  • The paper proposes a learning algorithm that maximizes the estimated policy value within a given policy class, with an optimal sub-optimality gap.

Read Full Article

like

18 Likes

For uninterrupted reading, download the app