AsyncFlow is an asynchronous streaming RL framework designed for efficient post-training of large language models.
It aims to address scalability bottlenecks faced by traditional RL frameworks and challenges in complex dataflows, resource idling, and workload imbalance.
AsyncFlow introduces distributed data storage and transfer modules, automated pipeline overlapping, and producer-consumer-based asynchronous workflows for improved computational efficiency.
The framework is decoupled from underlying training and inference engines, allowing for modular and customizable user experiences. Extensive experiments have shown a significant throughput improvement compared to existing baselines.