menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Planning a...
source image

Arxiv

6d

read

222

img
dot

Image Credit: Arxiv

Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem

  • In this work, a generalized restless multi-arm bandit problem with risk-awareness is addressed.
  • Indexability conditions for risk-aware objective are established and a solution based on Whittle index is provided.
  • A Thompson sampling approach is proposed for the learning problem with unknown transition probabilities, achieving bounded regret.
  • Numerical experiments illustrate the efficacy of the method in reducing risk exposure in various applications.

Read Full Article

like

13 Likes

For uninterrupted reading, download the app