menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Closed-Loo...
source image

Arxiv

1M

read

142

img
dot

Image Credit: Arxiv

Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

  • Traffic simulation aims to learn a policy for traffic agents that, when unrolled in closed-loop, faithfully recovers the joint distribution of trajectories observed in the real world.
  • Tokenized multi-agent policies have become the state-of-the-art in traffic simulation, but they suffer from covariate shift when executed in closed-loop during simulation.
  • A new strategy called Closest Among Top-K (CAT-K) rollouts is presented to mitigate covariate shift, enabling improved performance of tokenized traffic simulation policies.
  • CAT-K fine-tuning outperforms larger models in the Waymo Sim Agent Challenge leaderboard, achieving the top spot.

Read Full Article

like

8 Likes

For uninterrupted reading, download the app