LGCHEM-PHJun 23, 2025

Federated Learning from Molecules to Processes: A Perspective

arXiv:2506.18525v12 citationsh-index: 13
Originality Synthesis-oriented
AI Analysis

This is an incremental application of an existing method to a new domain, aiming to solve data privacy and collaboration issues for chemical companies.

The paper addresses the problem of data silos in chemical engineering by proposing federated learning to enable collaborative model training without sharing proprietary data, demonstrating in case studies that federated models achieve significantly higher accuracy than individual company models and perform similarly to models trained on combined datasets.

We present a perspective on federated learning in chemical engineering that envisions collaborative efforts in machine learning (ML) developments within the chemical industry. Large amounts of chemical and process data are proprietary to chemical companies and are therefore locked in data silos, hindering the training of ML models on large data sets in chemical engineering. Recently, the concept of federated learning has gained increasing attention in ML research, enabling organizations to jointly train machine learning models without disclosure of their individual data. We discuss potential applications of federated learning in several fields of chemical engineering, from the molecular to the process scale. In addition, we apply federated learning in two exemplary case studies that simulate practical scenarios of multiple chemical companies holding proprietary data sets: (i) prediction of binary mixture activity coefficients with graph neural networks and (ii) system identification of a distillation column with autoencoders. Our results indicate that ML models jointly trained with federated learning yield significantly higher accuracy than models trained by each chemical company individually and can perform similarly to models trained on combined datasets from all companies. Federated learning has therefore great potential to advance ML models in chemical engineering while respecting corporate data privacy, making it promising for future industrial applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes