<ul><li>Moxin-LLM is a fully open-source Large Language Model (LLM) adhering to principles of open science, open source, open data, and open access.</li><li>Moxin-LLM aims to address transparency concerns by releasing pre-training code, configurations, datasets, and checkpoints, allowing further innovations on LLMs.</li><li>Moxin-LLM goes through several finetuning stages to enhance reasoning capability, utilizing a post-training framework, instruction data, and Group Relative Policy Optimization (GRPO).</li><li>Experiments show that Moxin-LLM achieves superior performance in zero-shot evaluation, few-shot evaluation, and Chain-of-Thought (CoT) evaluation.</li></ul>

7B Fully Open Source Moxin-LLM -- From Pretraining to GRPO-based Reinforcement Learning Enhancement

Discover more