Documentation
Learn how KahneBench evaluates Large Language Models for cognitive biases using Kahneman-Tversky dual-process theory.
Categories at a Glance
16 cognitive bias categories from Kahneman-Tversky research
Representativeness
Judging probability by similarity to prototypes rather than base rates
Availability
Judging frequency or probability by ease of recall
Anchoring
Over-reliance on initial information when making estimates
Loss Aversion
Losses loom larger than equivalent gains
Framing
Decisions affected by how options are presented
Reference Dependence
Outcomes evaluated relative to reference points rather than absolutely
Probability Distortion
Systematic misweighting of probabilities
Uncertainty Judgment
Errors in assessing and responding to uncertainty
About KahneBench
KahneBench is a comprehensive cognitive bias benchmark for evaluating Large Language Models. It is grounded in the dual-process theory articulated by Nobel laureate Daniel Kahneman, which distinguishes between the fast, automatic operations of System 1 and the slow, deliberate operations of System 2.
The benchmark tests 69 cognitive biases across 5 ecological domains, using 6 advanced metrics to provide a multi-dimensional "cognitive fingerprint" for any LLM.