DeepForm is introduced as the first reasoning Large Language Model (LLM) specifically designed for automated communication system formulation.
A large-scale, open-source dataset named Communication System Formulation Reasoning Corpus (CSFRC) is presented for training the DeepForm model in this domain.
DeepForm utilizes a two-stage training approach: Supervised Fine-Tuning with Chain-of-Thought (CoT) data for domain knowledge distillation, followed by a rule-based Reinforcement Learning (RL) algorithm, C-ReMax, for advanced modeling capabilities and reasoning patterns.
Extensive experiments show that DeepForm achieves state-of-the-art performance, surpassing larger proprietary LLMs in various scenarios, and related resources will be released to encourage future research in this field.