Southwest Airlines faced a series of failures during the holiday season, resulting in losses of $750 million and stranding 2 million passengers.
MIT researchers developed a computational system to pinpoint root causes of rare failures using sparse data combined with extensive normal operations data.
The findings were presented at the International Conference on Learning Representations by MIT doctoral student Charles Dawson and professor Chuchu Fan along with colleagues.
The goal was to create a diagnostic tool for real-world systems to predict failures.
The research focused on cyber-physical problems where automated systems interact with real-world complexities, causing unexpected failures.
Unlike robotic systems, airline scheduling lacks comprehensive models, making failure prediction challenging with limited data.
Southwest's scheduling crisis was influenced by the deployment of reserve aircraft, which played a significant role in the system breakdown.
The research team's computational model analyzed public data on flight operations to identify hidden parameters related to aircraft reserves.
The study revealed how failures in one part of the network led to cascading effects, necessitating a drastic system reset by Southwest Airlines.
The research could lead to real-time monitoring systems for preemptive measures in detecting and preventing failures in complex cyber-physical systems.