CLAIAug 16, 2024

Chain of Thought Still Thinks Fast: APriCoT Helps with Thinking Slow

arXiv:2408.08651v36 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses bias mitigation for more robust and fair language models, though it is incremental as it builds on existing CoT methods.

The paper tackles the problem of language models absorbing biases from training data, which affects answer choices in MMLU tasks, and introduces APriCoT to reduce bias while improving accuracy.

Language models are known to absorb biases from their training data, leading to predictions driven by statistical regularities rather than semantic relevance. We investigate the impact of these biases on answer choice preferences in the Massive Multi-Task Language Understanding (MMLU) task. Our findings show that these biases are predictive of model preference and mirror human test-taking strategies even when chain of thought (CoT) reasoning is used. To address this issue, we introduce Counterfactual Prompting with Agnostically Primed CoT (APriCoT). We demonstrate that while Counterfactual Prompting with CoT alone is insufficient to mitigate bias, APriCoT effectively reduces the influence of base-rate probabilities while improving overall accuracy. Our results suggest that mitigating bias requires a slow thinking process which CoT alone may not provide as it tends to reinforce fast thinking model bias under some prompting methodologies. APriCoT is a step toward developing more robust and fair language models that can think slow.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes