Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Realtime environments change as agents perform action inference and learning, requiring high interaction frequencies to minimize regret.
Recent advances in machine learning involve larger neural networks with longer inference times, raising concerns about their applicability in realtime systems.
Proposed algorithms for staggering asynchronous inference processes ensure consistent time intervals for actions, enabling use of models with high inference times.