Information retrieval in folktales using natural language processing
This work addresses information retrieval for literary analysis in the folktale domain, but it is incremental as it applies existing NLP methods to a new, specific dataset.
The paper tackled the problem of extracting information about literary characters from unstructured folktale texts by using natural language processing and a domain ontology based on Propp's model, resulting in a system that identifies main characters and their descriptions or actions in stories.
Our aim is to extract information about literary characters in unstructured texts. We employ natural language processing and reasoning on domain ontologies. The first task is to identify the main characters and the parts of the story where these characters are described or act. We illustrate the system in a scenario in the folktale domain. The system relies on a folktale ontology that we have developed based on Propp's model for folktales morphology.