QuoteKG: A Multilingual Knowledge Graph of Quotes
This provides a resource for exploring historical and personal contexts of quotes, addressing issues of authenticity and cross-lingual access for researchers and the general public.
The authors tackled the problem of misattributed or out-of-context quotes by creating QuoteKG, the first multilingual knowledge graph of quotes, which includes nearly one million quotes in 55 languages from over 69,000 public figures.
Quotes of public figures can mark turning points in history. A quote can explain its originator's actions, foreshadowing political or personal decisions and revealing character traits. Impactful quotes cross language barriers and influence the general population's reaction to specific stances, always facing the risk of being misattributed or taken out of context. The provision of a cross-lingual knowledge graph of quotes that establishes the authenticity of quotes and their contexts is of great importance to allow the exploration of the lives of important people as well as topics from the perspective of what was actually said. In this paper, we present QuoteKG, the first multilingual knowledge graph of quotes. We propose the QuoteKG creation pipeline that extracts quotes from Wikiquote, a free and collaboratively created collection of quotes in many languages, and aligns different mentions of the same quote. QuoteKG includes nearly one million quotes in $55$ languages, said by more than $69,000$ people of public interest across a wide range of topics. QuoteKG is publicly available and can be accessed via a SPARQL endpoint.