menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

TD3-BST: A...
source image

Marktechpost

2w

read

297

img
dot

TD3-BST: A Machine Learning Algorithm to Adjust the Strength of Regularization Dynamically Using Uncertainty Model

  • Researchers from Imperial College London introduced TD3-BST (TD3 with Behavioral Supervisor Tuning), an algorithm that uses an uncertainty model to adjust the strength of regularization dynamically.
  • TD3-BST helps adjust regularization dynamically using an uncertainty network, optimizing Q-values around dataset modes.
  • TD3-BST outperforms other methods and showcases state-of-the-art performance when tested on D4RL datasets.
  • The integration of policy regularization with an ensemble-based source of uncertainty enhances the performance of TD3-BST.

Read Full Article

like

17 Likes

For uninterrupted reading, download the app