Introduction of MLE-Dojo, a Gym-style framework designed for empowering autonomous large language model (LLM) agents in machine learning engineering.
The framework provides an interactive environment that allows agents to experiment, debug, and refine solutions iteratively through structured feedback loops.
MLE-Dojo is built upon 200+ real-world Kaggle challenges, covering diverse MLE tasks like data processing, architecture search, hyperparameter tuning, and code debugging.
It supports comprehensive agent training via supervised fine-tuning and reinforcement learning, aiming to improve iterative experimentation, data sampling, and outcome verification.