The Shapes of Bias in Spanish-Prompted LLMs and the Debiasing Prompt Scaffolds
Ian Rios-Sialer
Largelanguagemodelsencodeandamplifyhumansocialbias,andtheharmfallshardestonminoritized
groups. Mitigating that harm needs interventions tuned to each social context, yet until recently there
was no Spanish-language data to even measure such bias. We characterize bias in Spanish-prompted,
open-weight LLMs across model scale, combining behavioral, distributional studies on the new SESGO
benchmark, where an item is ambiguous when its text names no specific person (the correct answer is
unknown,abstain)anddisambiguated whenitspecifiesagroup. Wereporttwopreliminaryfindings. First,
smaller models are more biased: across 12 models abstention accuracy rises with scale and the error split
towardthemarginalizedgroupnarrows. Second,scalelowersuncertainty: forkingthechainofthought,the
larger Qwen3-14B commitsitsanswer atasingledecisivetokenwhilethe smaller Qwen3-0.6B accretes
it gradually. These preliminary results give future LATAM-context interventions, including the debiasing
promptscaffoldsweleavetofuturework,amapofwherebiaslives,andwereleasetheevaluationharness.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) The Shapes of Bias in Spanish-Prompted LLMs and the Debiasing Prompt Scaffolds
},
author={
Ian Rios-Sialer
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


