GUT-IS: A Data-Driven Approach to Integrating Constructs and Their Relations in Information Systems
For IS researchers, it addresses the problem of inconsistent construct definitions hindering cumulative knowledge development, but the approach is incremental.
The paper proposes a data-driven approach to integrate structural equation models by using text embeddings and clustering to group constructs, with a loss function balancing semantic purity and parsimony. Evaluated on two IS datasets, it enables analysis of how construct groupings change with different priorities.
Structural equation modeling is widely used in IS research. However, inconsistent construct definitions impede the cumulative development of knowledge. In this work, we present an approach that aims at the integration of structural equation models into a unified model: We use a combination of task-adapted text embeddings and clustering to produce a candidate set of construct groupings. Subsequently, we select the optimal solution using a loss function that explicitly trades off semantic purity and parsimony in the number of clusters. By making this trade-off explicit, our approach allows to analyze how construct groupings and their relations change as one shifts the priority from purity to parsimony. Empirically, we evaluate and explore the proposed methodology on two datasets from the IS domain.