Web Table Extraction, Retrieval and Augmentation: A Survey
It provides a comprehensive overview for researchers and practitioners working with web tables, but it is incremental as a survey.
This survey synthesizes two decades of research on web tables, organizing literature into six information access tasks such as extraction and search, and describes seminal approaches and resources.
Tables are a powerful and popular tool for organizing and manipulating data. A vast number of tables can be found on the Web, which represents a valuable knowledge resource. The objective of this survey is to synthesize and present two decades of research on web tables. In particular, we organize existing literature into six main categories of information access tasks: table extraction, table interpretation, table search, question answering, knowledge base augmentation, and table augmentation. For each of these tasks, we identify and describe seminal approaches, present relevant resources, and point out interdependencies among the different tasks.