CLJun 2, 2021

John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs

Yova Kementchedjhieva, Mark Anderson, Anders Søgaard

arXiv:2106.01060v131.5711 citations

Originality Incremental advance

AI Analysis

This addresses how language models handle causal inference, which is incremental as it builds on known human processing biases.

The study investigated whether pre-trained language models encode implicit causality bias from verbs and found that they do, but prioritize lexical patterns over higher-order signals when explicit cues conflict, leading to higher error rates.

Some interpersonal verbs can implicitly attribute causality to either their subject or their object and are therefore said to carry an implicit causality (IC) bias. Through this bias, causal links can be inferred from a narrative, aiding language comprehension. We investigate whether pre-trained language models (PLMs) encode IC bias and use it at inference time. We find that to be the case, albeit to different degrees, for three distinct PLM architectures. However, causes do not always need to be implicit -- when a cause is explicitly stated in a subordinate clause, an incongruent IC bias associated with the verb in the main clause leads to a delay in human processing. We hypothesize that the temporary challenge humans face in integrating the two contradicting signals, one from the lexical semantics of the verb, one from the sentence-level semantics, would be reflected in higher error rates for models on tasks dependent on causal links. The results of our study lend support to this hypothesis, suggesting that PLMs tend to prioritize lexical patterns over higher-order signals.

View on arXiv PDF

Similar