Whodunnit? Crime Drama as a Case for Natural Language Understanding
This work addresses the challenge of complex inferences in natural language understanding for AI researchers, though it is incremental as it applies existing methods to a new domain-specific dataset.
The paper tackles the problem of natural language understanding by proposing crime drama as a new inference task, developing a dataset from CSI episodes and an LSTM-based model that uses multi-modal data, with results showing that an incremental inference strategy improves accuracy.
In this paper we argue that crime drama exemplified in television programs such as CSI:Crime Scene Investigation is an ideal testbed for approximating real-world natural language understanding and the complex inferences associated with it. We propose to treat crime drama as a new inference task, capitalizing on the fact that each episode poses the same basic question (i.e., who committed the crime) and naturally provides the answer when the perpetrator is revealed. We develop a new dataset based on CSI episodes, formalize perpetrator identification as a sequence labeling problem, and develop an LSTM-based model which learns from multi-modal data. Experimental results show that an incremental inference strategy is key to making accurate guesses as well as learning from representations fusing textual, visual, and acoustic input.