Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models

Narrative over Numbers asks whether large language models inherit one of the more stubborn quirks of human moral psychology: the tendency to care more about a vivid individual than an equivalent statistical group. The short answer is yes, but the interesting answer is that alignment and reasoning do not merely dampen the bias. Sometimes they reshape it, sometimes they amplify it, and occasionally they flip it.

Abstract

The Identifiable Victim Effect (IVE)--the tendency to allocate greater resources to a specific, narratively described victim than to a statistically characterized group facing equivalent hardship--is one of the most robust findings in moral psychology and behavioral economics. As large language models (LLMs) assume consequential roles in humanitarian triage, automated grant evaluation, and content moderation, a critical question arises: do these systems inherit the affective irrationalities present in human moral reasoning? We present the first systematic, large-scale empirical investigation of the IVE in LLMs, comprising $N=51{,}955$ validated API trials across 16 frontier models spanning nine organizational lineages. Using a suite of ten experiments--porting and extending canonical paradigms from Small et al. (2007) and Kogut and Ritov (2005)--we find that the IVE is prevalent but strongly modulated by alignment training. Instruction-tuned models exhibit extreme IVE ($d=1.56$), while reasoning-specialized models invert the effect (down to $d=-0.85$). The pooled effect ($d=0.223$, $p=2 \times 10^{-6}$) is approximately twice the single-victim human meta-analytic baseline ($d \approx 0.10$) reported by Lee and Feeley (2016)--and likely exceeds the overall human pooled effect by a larger margin, given that the group-victim human effect is near zero. Standard Chain-of-Thought (CoT) prompting--contrary to its role as a deliberative corrective--nearly triples the IVE effect size (from $d=0.15$ to $d=0.41$), while only utilitarian CoT reliably eliminates it. We further document psychophysical numbing, perfect quantity neglect, and marginal in-group/out-group cultural bias, with implications for AI deployment in humanitarian and ethical decision-making contexts. Our code and data are publicly available at this https URL.

A compact visual tour of the paper's central question: when moral choice is framed as a person versus a statistic, what do LLMs do?

At a Glance

51,955 validated API trials

16 frontier LLMs audited

10 IVE experiments

d = 0.223 pooled IVE effect

Core Question

The paper ports and extends classic identifiable-victim paradigms to modern LLMs. Each model is placed in allocation settings where a vivid individual and a statistical group face equivalent hardship. The experiment then asks whether narrative specificity changes the model's resource allocation.

The twist is not simply that LLMs show the bias. The more useful finding is that model family, alignment style, and prompting regime all change the direction and strength of the effect.

Key Findings

Alignment Matters

Instruction-tuned models show a strong identifiable-victim preference, while reasoning-specialized systems can invert the effect.

Reasoning Is Not Neutral

Standard Chain-of-Thought prompting nearly triples the IVE effect size instead of reliably correcting it.

Utility Framing Helps

Only utilitarian CoT consistently removes the bias, suggesting that the form of deliberation matters more than deliberation by itself.

Bias Has Texture

The study also documents psychophysical numbing, quantity neglect, and marginal cultural in-group/out-group effects.

Why It Matters

Humanitarian triage, grant review, and moderation pipelines often involve emotionally asymmetric cases: one story may be vivid, while another arrives as a number. If LLMs are used in such settings, their moral behavior cannot be audited only through accuracy, helpfulness, or general preference alignment. We also need to ask whether they preserve, amplify, or suppress human affective distortions when the stakes are not merely linguistic.

BibTeX

@article{raiyan2026narrative,
  title={Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models},
  author={Raiyan, Syed Rifat},
  journal={arXiv preprint arXiv:2604.12076},
  year={2026}
}