Teaching Large Language Models (LLMs) to reason on graphs using Reinforcement Learning (RL).
Introducing G1 approach demonstrating significant improvement in LLMs' graph reasoning abilities through RL training on synthetic graph-theoretic tasks.
Curated Erdős dataset comprising diverse graph tasks used for RL training, resulting in substantial enhancements in graph reasoning.
RL-trained models exhibit strong generalization to unseen tasks, domains, and graph encoding schemes, indicating potential for building strong graph reasoners by finetuning LLMs with RL.