DLIRDec 16, 2016

Analyzing Web Archives Through Topic and Event Focused Sub-collections

arXiv:1612.05413v18 citations
Originality Synthesis-oriented
AI Analysis

This addresses difficulties for researchers studying societal developments in web archives, but it appears incremental as it builds on existing archival practices.

The paper tackles the challenge of analyzing large, temporal web archives by proposing a methodology to extract and study topic- and event-focused sub-collections, though it does not report concrete numerical results.

Web archives capture the history of the Web and are therefore an important source to study how societal developments have been reflected on the Web. However, the large size of Web archives and their temporal nature pose many challenges to researchers interested in working with these collections. In this work, we describe the challenges of working with Web archives and propose the research methodology of extracting and studying sub-collections of the archive focused on specific topics and events. We discuss the opportunities and challenges of this approach and suggest a framework for creating sub-collections.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes