An optimization framework is developed focusing on transferring control authority to a parametric policy to create an autonomous dynamical system.
The framework allows optimizing policy parameters independently of controls or actions, without relying on approximate Dynamic Programming and Reinforcement Learning.
Simpler algorithms at the autonomous system level are derived, performing computations equivalent to policy gradients, Hessians, and other optimization methods.
The framework is applicable to various tasks like behavioral cloning, mechanism design, system identification, and tuning generative AI models.