menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Offline Sa...
source image

Arxiv

3d

read

301

img
dot

Image Credit: Arxiv

Offline Safe Reinforcement Learning Using Trajectory Classification

  • Offline safe reinforcement learning (RL) is a promising approach for learning safe behaviors without risky online interactions with the environment.
  • Existing methods in offline safe RL often result in overly conservative policies or safety constraint violations.
  • This paper proposes a new approach to offline safe RL that learns a policy generating desirable trajectories and avoiding undesirable ones.
  • The approach involves partitioning a pre-collected dataset into desirable and undesirable subsets, and using a classifier to score the trajectories.

Read Full Article

like

18 Likes

For uninterrupted reading, download the app