Sreeram Marimuthu

h-index14
1paper

1 Paper

DBJun 15, 2025Code
Humans, Machine Learning, and Language Models in Union: A Cognitive Study on Table Unionability

Sreeram Marimuthu, Nina Klimenkova, Roee Shraga

Data discovery and table unionability in particular became key tasks in modern Data Science. However, the human perspective for these tasks is still under-explored. Thus, this research investigates the human behavior in determining table unionability within data discovery. We have designed an experimental survey and conducted a comprehensive analysis, in which we assess human decision-making for table unionability. We use the observations from the analysis to develop a machine learning framework to boost the (raw) performance of humans. Furthermore, we perform a preliminary study on how LLM performance is compared to humans indicating that it is typically better to consider a combination of both. We believe that this work lays the foundations for developing future Human-in-the-Loop systems for efficient data discovery.