A recent study by researchers from Hainan University reveals a groundbreaking advancement in plant genomics using large language models (LLMs) in decoding plant DNA.
Traditional machine learning techniques have faced challenges in processing the complex and vast plant genomic datasets, which lack structured grammar like human languages.
The research demonstrates that LLMs can identify regulatory elements in genomic sequences, enhancing the understanding of gene expression and cellular function.
Different LLM architectures, such as encoder-only models and decoder-only models, are applied for plant genomic analysis, showing promise in transforming agricultural innovation.