Extraction of Historical Events from Wikipedia
This work addresses the need for more comprehensive structured historical data for researchers and developers, though it is incremental as it builds on existing extraction methods.
The paper tackled the problem of extracting historical events from Wikipedia articles, which are not captured by existing structured data projects like DBpedia, and resulted in extracting about 121,000 events with over 325,000 links to DBpedia entities.
The DBpedia project extracts structured information from Wikipedia and makes it available on the web. Information is gathered mainly with the help of infoboxes that contain structured information of the Wikipedia article. A lot of information is only contained in the article body and is not yet included in DBpedia. In this paper we focus on the extraction of historical events from Wikipedia articles that are available for about 2,500 years for different languages. We have extracted about 121,000 events with more than 325,000 links to DBpedia entities and provide access to this data via a Web API, SPARQL endpoint, Linked Data Interface and in a timeline application.