Papers
1–2 of 2Research Paper·Feb 10, 2026
Biases in the Blind Spot: Detecting What LLMs Fail to Mention
Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these *unverbalized biases*. Monitoring models via their...
6.0 viability
Research Paper·Jan 29, 2026
Investigating Associational Biases in Inter-Model Communication of Large Generative Models
Social bias in generative AI can manifest not only as performance disparities but also as associational bias, whereby models learn and reproduce stereotypical associations between concepts and demogra...
5.0 viability