CLJun 2, 2021

John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs

arXiv:2106.01060v1711 citations
Originality Incremental advance
AI Analysis

This addresses how language models handle causal inference, which is incremental as it builds on known human processing biases.

The study investigated whether pre-trained language models encode implicit causality bias from verbs and found that they do, but prioritize lexical patterns over higher-order signals when explicit cues conflict, leading to higher error rates.

Some interpersonal verbs can implicitly attribute causality to either their subject or their object and are therefore said to carry an implicit causality (IC) bias. Through this bias, causal links can be inferred from a narrative, aiding language comprehension. We investigate whether pre-trained language models (PLMs) encode IC bias and use it at inference time. We find that to be the case, albeit to different degrees, for three distinct PLM architectures. However, causes do not always need to be implicit -- when a cause is explicitly stated in a subordinate clause, an incongruent IC bias associated with the verb in the main clause leads to a delay in human processing. We hypothesize that the temporary challenge humans face in integrating the two contradicting signals, one from the lexical semantics of the verb, one from the sentence-level semantics, would be reflected in higher error rates for models on tasks dependent on causal links. The results of our study lend support to this hypothesis, suggesting that PLMs tend to prioritize lexical patterns over higher-order signals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes