De novo molecular design has extensive applications in drug discovery and materials science.
Efficient molecular generation and screening methods are essential for accelerating drug discovery and reducing costs.
Direct Preference Optimization (DPO) is adopted from NLP to optimize molecular properties by maximizing the likelihood difference between high- and low-quality molecules.
The proposed method achieved excellent scores on the GuacaMol Benchmark and demonstrated practical efficacy in target protein binding experiments.