menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

MAD: A Mag...
source image

Arxiv

1M

read

327

img
dot

Image Credit: Arxiv

MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning

  • A new policy parameterization called magnitude and direction (MAD) policies is introduced for reinforcement learning (RL).
  • MAD policies preserve Lp closed-loop stability for nonlinear dynamical systems, introducing explicit feedback on state-dependent features without compromising stability.
  • The control input magnitude is described with a disturbance-feedback Lp-stable operator, while the direction is selected based on state-dependent features using a universal function approximator.
  • MAD policies maintain closed-loop stability in model-free RL pipelines without requiring model information beyond open-loop stability.

Read Full Article

like

19 Likes

For uninterrupted reading, download the app