An AI model named Darwin outperformed other models in text tasks related to materials science, achieving higher F1 scores in NER and RE tasks.
Entity resolution significantly improved performance in ER tasks by refining the inference outputs and correcting incorrect inferences.
Ablation experiments highlighted the effectiveness of each step in the standardization process, with optimizing through the expert dictionary showing the most improvement in ER.
Evaluation of the LLM classifier using manually labeled data demonstrated strong performance in classification tasks.
The Functional Materials Knowledge Graph (FMKG) containing 162,605 nodes and 731,772 edges stored vast information on functional materials like batteries, catalysts, and solar cells.
Analysis of the FMKG revealed Co2O3 as the most frequent material in the battery domain, with lithium-ion batteries being the most prevalent application.
Validation of the FMKG involved experts checking 500 triples, resulting in high entity and relation accuracy, especially in categories like 'Application' and 'Structure/Phase.'
While some labels like 'Descriptor' and 'Property' showed less precision due to broad diversity, overall results remained satisfactory in entity and relation accuracy.
Issues with misclassifications mainly stemmed from 'Formula' entities, impacting 'Name' and 'Acronym' classifications, but the overall impact was deemed insignificant.
The paper on this study is available on arXiv under a CC BY 4.0 DEED license, providing open access to the research findings.