TRIDENT is a new framework for molecular representation learning that integrates molecular SMILES, textual descriptions, and taxonomic functional annotations to learn rich molecular representations.
The framework utilizes a volume-based alignment objective to align tri-modal features globally and introduces a local alignment objective to capture detailed relationships between molecular substructures and their corresponding sub-textual descriptions.
TRIDENT achieves state-of-the-art performance on 11 downstream tasks, showcasing the benefits of combining multiple modalities for molecular property prediction.
The article presents a new approach in molecular property prediction that takes into account textual and taxonomic information, leading to improved performance across various tasks.