IRAug 29, 2019

Towards More Usable Dataset Search: From Query Characterization to Snippet Generation

arXiv:1908.11146v125 citations
AI Analysis

This work addresses the need for more effective dataset search for researchers and developers, but it is incremental as it builds on existing search engine concepts.

The paper tackled the problem of improving dataset search usability by characterizing 1,947 real queries with a fine-grained scheme and proposing a query-centered framework, including snippet generation evaluated in a preliminary user study.

Reusing published datasets on the Web is of great interest to researchers and developers. Their data needs may be met by submitting queries to a dataset search engine to retrieve relevant datasets. In this ongoing work towards developing a more usable dataset search engine, we characterize real data needs by annotating the semantics of 1,947 queries using a novel fine-grained scheme, to provide implications for enhancing dataset search. Based on the findings, we present a query-centered framework for dataset search, and explore the implementation of snippet generation and evaluate it with a preliminary user study.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes