EIL
← All projects
LLM Behavioral Research · Active

LLM Bias Agents — Egocentric Bias & Naïve Realism

Behavioral interference experiments measuring egocentric bias, naïve realism, and cognitive inflexibility across frontier LLMs — with a procedural prompt debiasing library adapted from social-cognition research.

LLM evaluation Debiasing Behavioral experiments Multi-agent

Working hypothesis: LLM systems exhibit measurable egocentric bias, naïve realism, and cognitive inflexibility in comparative judgment. Procedural prompt interventions grounded in psychological debiasing theory can reduce these biases — measurably.

Measurement streams

ConstructMetricOperationalization
Naïve realismλ ∈ [0,1]; R = 1 − λBayesian belief updating via log-odds regression of model confidence across interference trials
Egocentric biasRCA-FDR; bias = RCA-FDR − 0.5Response Content Allocation — sentence counts in justification text addressing self-consistent vs counter-attitudinal evidence
Cognitive inflexibilityNLI; Ω = NLI_model / NLI_baselineResponse latency contrast: interference vs baseline conditions

Participants are models, not humans

The empirical backbone is a behavioral interference paradigm in which the participants are LLMs (Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.3, Grok 4.20). We measure baseline bias, apply procedural prompt interventions, then measure again — across multiple domains and models for validity.

The multi-agent paper pipeline

The research itself is produced by a four-agent workflow:

  • Lead Contributor — abstract, introduction, integration
  • Positioning Owner — related work, theoretical framing
  • Method Owner — paradigm, measurement streams, debiasing prompt library
  • Experiment Owner — design, stimuli, results narrative

Each agent has its own skill set (define structure, narrative synthesis, review & merge, consistency check, LaTeX final assembly, version control). The workflow is reusable across projects — it’s also what builds the DCN-V2 paper.