CLMay 9

Fin-Bias: Comprehensive Evaluation for LLM Decision-Making under human bias in Finance Domain

arXiv:2605.0910690.3
Predicted impact top 31% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For financial AI deployment, this work identifies a critical vulnerability (herding bias) and offers a mitigation, though the benchmark is domain-specific and the method is incremental.

Fin-Bias is a benchmark of 8,868 analyst reports that reveals LLMs herd explicit human biases in financial decision-making; a proposed detection method encourages independent thinking, with some models exceeding human performance in predicting future stock returns.

Large language models (LLMs) are increasingly deployed in financial contexts, raising critical concerns about reliability, alignment, and susceptibility to adversarial manipulation. While prior finance-related benchmarks assess LLMs' capabilities in stock trading, they are often restricted to small sample and fail to demonstrate LLM susceptibility to context with potential human bias. We introduce Fin-Bias (financial herding under long and uncertain financial context), a benchmark for evaluating LLM investment decision-making when faced with uncertainty and possible human-biased opinions. Fin-Bias includes 8868 long firm-specific analyst reports, including firm aspects summarized and analyzed by sophisticated analysts with investment ratings (Bullish/Neutral/Bearish) spanning from various industries. We present large language models with firm analyst reports with/without analyst investment ratings and even with 'fake' rating, to get investment ratings generated by LLMs. Our results reveal that LLMs tend to herd the explicit bias in context. We also develop a method to detect potential human opinions, which can encourage LLMs to think independently, some models even exceed human performance in predicting future stock return.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes