A new method called GP-MoLFormer-Sim has been introduced for molecular optimization by leveraging contextual similarity guidance.
The method uses a generative Chemical Language Model (CLM) to navigate and sample from the molecular space while preserving similarity to a target molecule.
GP-MoLFormer-Sim adjusts the autoregressive sampling strategy based on molecular similarity estimates to maintain similarity in generated molecules.
The method, combined with a genetic algorithm (GA), outperforms existing training-free baseline methods in various molecular optimization benchmarks.