Thought Anchors for Social Bias: Which Reasoning Steps Matter in Extended Thinking LLMs on Latin American Scenarios
Andres Felipe Mosquera Hernandez
We investigate which reasoning steps in extended-thinking LLMs are associated with pro-stereotypical outputs on Latin American social bias scenarios. Social bias benchmarks, including BBQ \citep{parrish2022bbq}, SESGO \citep{robles2024sesgo}, and EsBBQ/CaBBQ \citep{ruizfernandez2025esbbq}, measure final-answer distributions but provide no visibility into intermediate reasoning. We adapt the thought-anchor resampling framework \citep{bogdan2025thoughtanchors} to social bias measurement: for each sentence-level chunk in a model's chain-of-thought, we remove it, resample $K{=}3$ continuations, and measure the change in pro-stereotypical output probability via three importance metrics. We apply this pipeline to SESGO-small, an 80-item stratified pilot subset spanning four Latin American bias categories (\textit{género, clasismo, racismo, xenofobia}), two languages, and three recent models with extended thinking. Across 1,708 chunk-level attributions, disambiguated contexts show higher pro-stereotypical output rates than ambiguous ones (33–75\% vs.\ 0\%), and high-importance reasoning positions tend to be tagged as \texttt{logical\_reasoning} or \texttt{context\_recall} rather than \texttt{stereotype\_activation}. These pilot results suggest that bias-relevant moments may lie in general inference steps rather than overt stereotype invocations, offering a step-level diagnostic target that complements answer-level
benchmarks. More broadly, attributing bias to specific reasoning steps could help
localize where stereotyping enters a model's reasoning, though our findings remain preliminary.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Thought Anchors for Social Bias: Which Reasoning Steps Matter in Extended Thinking LLMs on Latin American Scenarios
},
author={
Andres Felipe Mosquera Hernandez
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


