CL LGJun 16, 2021

Disentangling Online Chats with DAG-Structured LSTMs

Duccio Pappadopulo, Lisa Bauer, Marco Farina, Ozan İrsoy, Mohit Bansal

arXiv:2106.09024v131.4711 citations

Originality Highly original

AI Analysis

This work addresses the challenge of understanding complex chat structures for downstream tasks like summarization and question answering, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackled the problem of disentangling interwoven sub-conversations in online chat logs, such as those from the Ubuntu IRC dataset, by proposing DAG-LSTMs, a model that achieved state-of-the-art performance in recovering reply-to relations and was competitive on other disentanglement metrics.

Many modern messaging systems allow fast and synchronous textual communication among many users. The resulting sequence of messages hides a more complicated structure in which independent sub-conversations are interwoven with one another. This poses a challenge for any task aiming to understand the content of the chat logs or gather information from them. The ability to disentangle these conversations is then tantamount to the success of many downstream tasks such as summarization and question answering. Structured information accompanying the text such as user turn, user mentions, timestamps, is used as a cue by the participants themselves who need to follow the conversation and has been shown to be important for disentanglement. DAG-LSTMs, a generalization of Tree-LSTMs that can handle directed acyclic dependencies, are a natural way to incorporate such information and its non-sequential nature. In this paper, we apply DAG-LSTMs to the conversation disentanglement task. We perform our experiments on the Ubuntu IRC dataset. We show that the novel model we propose achieves state of the art status on the task of recovering reply-to relations and it is competitive on other disentanglement metrics.

View on arXiv PDF

Similar