Ferret is a comprehensive framework designed to enhance online accuracy of Online Continual Learning (OCL) algorithms while adapting to varying memory budgets.
Ferret employs fine-grained pipeline parallelism and an iterative gradient compensation algorithm to handle high-frequency data with minimal latency and counteract the challenge of stale gradients in parallel training.
The automated model partitioning and pipeline planning of Ferret optimize performance regardless of memory limitations.
Experimental results demonstrate that Ferret achieves up to 3.7 times lower memory overhead and outperforms competing methods across diverse memory budgets, making it an efficient and adaptive OCL framework for real-time environments.