LGMTRL-SCIJan 23, 2022

ULSA: Unified Language of Synthesis Actions for Representation of Synthesis Protocols

arXiv:2201.09329v1
AI Analysis

This work addresses the problem of lacking high-quality datasets for AI-driven materials synthesis prediction, providing a foundation for autonomous robotic synthesis, though it is incremental as it focuses on a specific domain.

The authors tackled the challenge of extracting synthesis actions from scientific publications by proposing the Unified Language of Synthesis Actions (ULSA) for ceramics synthesis, creating a dataset of 3,040 annotated procedures and building a neural network model to map synthesis paragraphs into ULSA, which successfully captured essential vocabulary and features of synthesis protocols.

Applying AI power to predict syntheses of novel materials requires high-quality, large-scale datasets. Extraction of synthesis information from scientific publications is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. In this work, we propose the first Unified Language of Synthesis Actions (ULSA) for describing ceramics synthesis procedures. We created a dataset of 3,040 synthesis procedures annotated by domain experts according to the proposed ULSA scheme. To demonstrate the capabilities of ULSA, we built a neural network-based model to map arbitrary ceramics synthesis paragraphs into ULSA and used it to construct synthesis flowcharts for synthesis procedures. Analysis for the flowcharts showed that (a) ULSA covers essential vocabulary used by researchers when describing synthesis procedures and (b) it can capture important features of synthesis protocols. This work is an important step towards creating a synthesis ontology and a solid foundation for autonomous robotic synthesis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes