menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Regret Bou...
source image

Arxiv

1w

read

380

img
dot

Image Credit: Arxiv

Regret Bounds for Robust Online Decision Making

  • A framework is proposed for robust online decision making, allowing multivalued models.
  • The framework introduces convex sets of probability distributions for decision outcomes.
  • Nature can choose distributions from the set in an arbitrary and non-oblivious manner.
  • The framework demonstrates improved regret bounds in robust linear bandits and tabular robust online reinforcement learning.

Read Full Article

like

22 Likes

For uninterrupted reading, download the app