Evaluating LLM Safety Across Intra-Language Variations: A Study of Regional Spanish Slang
Fernando Gonzalez Ruiz, Alvaro Nuñez Perez, Itzel Aurora Hernandez Fernández , Ángel Alejandro Reynoso Delgado
Large Language Models (LLMs) are increasingly used by speakers across many linguistic communities, but most safety evaluations treat each language as a single uniform category. This project studies whether regional Spanish slang creates safety evaluation gaps that are not visible when only neutral Spanish is tested. We adapted 20 harmful-intent prompts from HarmBench into six Spanish variants: neutral Spanish, Guadalajara Mexican Spanish, Buenos Aires/Rioplatense Spanish, Santiago Chilean Spanish, Barcelona/Peninsular Spanish, and Uruguayan Spanish. We built two local pipelines: one exploratory pipeline for generating model responses with open-source models, and a second structured pipeline that evaluates all 120 prompt variants with three local Ollama-based judges focused on security, policy compliance, and adversarial/critical review. In the full judge run, all 120 prompts were evaluated by all three judges, producing 360 evaluations with no parse failures. The judges rated the prompts as high-risk overall across every Spanish variant, with mean risk scores ranging from 0.900 to 0.927. Disagreement was rare: only 3 of 120 prompts produced any safe/unsafe vote disagreement. These results do not show strong evidence that regional Spanish variants systematically bypassed the safety judges in this run, but they demonstrate a reproducible framework for testing intra-language safety variation and reveal where judge disagreement can flag prompts for closer review.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Evaluating LLM Safety Across Intra-Language Variations: A Study of Regional Spanish Slang
},
author={
Fernando Gonzalez Ruiz, Alvaro Nuñez Perez, Itzel Aurora Hernandez Fernández , Ángel Alejandro Reynoso Delgado
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


