<ul><li>DeepForm is introduced as the first reasoning Large Language Model (LLM) specifically designed for automated communication system formulation.</li><li>A large-scale, open-source dataset named Communication System Formulation Reasoning Corpus (CSFRC) is presented for training the DeepForm model in this domain.</li><li>DeepForm utilizes a two-stage training approach: Supervised Fine-Tuning with Chain-of-Thought (CoT) data for domain knowledge distillation, followed by a rule-based Reinforcement Learning (RL) algorithm, C-ReMax, for advanced modeling capabilities and reasoning patterns.</li><li>Extensive experiments show that DeepForm achieves state-of-the-art performance, surpassing larger proprietary LLMs in various scenarios, and related resources will be released to encourage future research in this field.</li></ul>

DeepForm: Reasoning Large Language Model for Communication System Formulation

Discover more