Project title: Register Sensitivity in LLM Safety Responses: Evaluating How Linguistic Style Affects Scam Detection in South African Contexts
Nicoroy Zwane
This study investigates whether linguistic register, specifically the shift from formal English to informal South African WhatsApp-style English, affects how large language models respond to scam-style prompts grounded in African fraud contexts. We constructed a 12-prompt evaluation dataset across four scenarios (NSFAS bursary impersonation, bank OTP extraction, fake investment schemes, and fraudulent job recruitment), each presented in formal, neutral, and WhatsApp-style registers, and evaluated responses from ChatGPT, Gemini, and Claude. Results show meaningful register-dependent safety degradation, particularly in investment scheme scenarios where Gemini produced unsafe outputs under informal register while refusing the same request in formal English. We argue this represents a real deployment risk for African users and present the framework as a replicable template for region-specific AI safety benchmarking.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) Project title: Register Sensitivity in LLM Safety Responses: Evaluating How Linguistic Style Affects Scam Detection in South African Contexts
},
author={
Nicoroy Zwane
},
date={
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


