DBMTRL-SCIDLNov 3, 2025

Towards Defect Phase Diagrams: From Research Data Management to Automated Workflows

arXiv:2511.01942h-index: 71
AI Analysis

This work addresses data management bottlenecks for materials science researchers, particularly in collaborative projects like CRC 1394, by providing an automated workflow system to streamline data integration and reuse.

The paper tackles the challenge of integrating heterogeneous experimental and simulation data from distributed sources to construct defect phase diagrams for materials design, by establishing a comprehensive research data management infrastructure that reduces friction in data capture and curation, enabling traceable and reusable datasets.

Defect phase diagrams provide a unified description of crystal defect states for materials design and are central to the scientific objectives of the Collaborative Research Centre (CRC) 1394. Their construction requires the systematic integration of heterogeneous experimental and simulation data across research groups and locations. In this setting, research data management (RDM) is a key enabler of new scientific insight by linking distributed research activities and making complex data reproducible and reusable. To address the challenge of heterogeneous data sources and formats, a comprehensive RDM infrastructure has been established that links experiment, data, and analysis in a seamless workflow. The system combines: (1) a joint electronic laboratory notebook and laboratory information management system, (2) easy-to-use large-object data storage, (3) automatic metadata extraction from heterogeneous and proprietary file formats, (4) interactive provenance graphs for data exploration and reuse, and (5) automated reporting and analysis workflows. The two key technological elements are the openBIS electronic laboratory notebook and laboratory information management system, and a newly developed companion application that extends openBIS with large-scale data handling, automated metadata capture, and federated access to distributed research data. This integrated approach reduces friction in data capture and curation, enabling traceable and reusable datasets that accelerate the construction of defect phase diagrams across institutions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes