Researchers propose the Peer-Aware Cost Estimation (PACE) framework for learning the cost parameters of another agent in a linear quadratic differential game with incomplete information.
PACE treats the other agent as a learning agent rather than a stationary optimal agent and models their learning dynamics to infer their cost function parameters.
The PACE framework enables agents to adapt their control policies based on real-time inference of each other's objective functions, using only previous state observations.
Numerical studies show that modeling the learning dynamics of the other agent improves stability and convergence speed compared to approaches assuming complete information.