menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

World Mode...
source image

Arxiv

1M

read

350

img
dot

Image Credit: Arxiv

World Model Agents with Change-Based Intrinsic Motivation

  • A technique called Change Based Exploration Transfer (CBET) is adapted for world model algorithms like DreamerV3.
  • CBET shows potential in improving DreamerV3's returns in the sparse reward environment of Crafter, but reduces returns and leads to suboptimal policies in Minigrid.
  • Pre-training DreamerV3 with intrinsic rewards does not immediately maximize extrinsic rewards in Minigrid.
  • CBET has a positive impact on DreamerV3 in more complex environments like Crafter, but may be detrimental in environments like Minigrid.

Read Full Article

like

21 Likes

For uninterrupted reading, download the app