ProCALM (Protein Conditionally Adapted Language Model) is introduced as an approach for generating proteins with desired functions using adapters with protein language models.
Existing methods in protein language models for conditional generation are limited and lack generalization to unseen functions.
ProCALM utilizes adapters to facilitate the conditional generation of proteins based on versatile representations of protein function like enzyme family, taxonomy, or natural language descriptions.
The approach involves finetuning ProGen2 to enable generation conditioned on specific functions, showcasing improved performance compared to current methods.
ProCALM demonstrates the ability to generalize to rare and unseen functions, surpassing existing approaches in conditional sequence generation.
The method is highlighted for its flexibility, computational efficiency, and potential applicability to a broad array of generative language models.