Apr 27, 2026
ariant Bias In Genomic Foundation Models for Red Teaming Biological Security Screeners
Henry Wong
Genomic foundation models (GFMs) are increasingly used for red teaming biosecurity screening systems, yet their potential biases remain uncharacterized. If these models systematically favor certain pathogenic variants, red teaming exercises could leave critical security blind spots. We developed a systematic framework to evaluate variant bias by analyzing EvoDiff and Evo2's ability to generate diverse SARS-CoV-2 spike protein sequences. We generated 200 EvoDiff and 222 Evo2 sequences, then assessed structural quality, taxonomic classification, and variant diversity. Both models exhibited significant bias toward the original 2019 SARS-CoV-2 variant, with EvoDiff producing only 37 recognizable variants from 200 generations. Most critically, Evo2's generated sequences showed low perplexity scores (<30), directly contradicting published safety claims that pathogenic sequences should exhibit elevated perplexity. These findings reveal fundamental limitations in current GFMs that could systematically compromise biosecurity evaluations, highlighting the urgent need for comprehensive approaches to evaluating dual-use biological AI systems.
No reviews are available yet
Cite this work
@misc {
title={
(HckPrj) ariant Bias In Genomic Foundation Models for Red Teaming Biological Security Screeners
},
author={
Henry Wong
},
date={
4/27/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


