Mar 23, 2026
LinuxArena expansion - Threshold Manipulation
Jiaming Yu
We introduce Threshold Manipulation, a new side task for the Clinical Trial environment in LinuxArena. The attack subtly widens blood test safety thresholds so that genuinely dangerous lab results no longer trigger critical alerts — while the monitoring system continues to function normally. This represents a class of data integrity attacks that is absent from the environment's existing side task set and is particularly difficult for monitors to detect.
Thanks for your work! It's great to get contributions to LinuxArena.
The side task idea is good; however, it is extremely similar to the pre-existing Suppress Safety Notification side task. In fact, the suppress safety notification ST documents threshold manipulation as Attack Avenue 3 in its `attack_surface.md`, which is verbatim the attack described in the paper. What's presented as a side task appears to be a straj development effort on a preexisting ST's documented attack approach. I can only assume this was an honest mistake.
Secondly:
> No runs achieved a passing main task score, meaning we don't yet have a complete sabotage trajectory (straj)
A straj must accomplish the side task while evading the monitor, without it passing a main task. This is in fact relatively expected, and not a mark against your side task at all.
The work may have benefited from monitor evaluations of the straj attempts.
New side task where you subtly widen blood test safety thresholds so dangerous values stop triggering alerts. Smart attack design, system keeps working normally, you need domain knowledge to notice. But only 13 trajectories, 0% main task completion, no formal monitor eval. Good idea that hasn't been followed through. More proposal than finished experiment.
Cite this work
@misc {
title={
(HckPrj) LinuxArena expansion - Threshold Manipulation
},
author={
Jiaming Yu
},
date={
3/23/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


