Apr 26, 2026
Biosecurity Screening Layer for Biology-Oriented AI Interfaces
Adejumobi Joshua, Taiwo Togun
Wittmann et al. (2025) demonstrated that generative protein design tools can produce variants of dangerous toxins that evade existing DNA synthesis screening by preserving biological function while rewriting amino acid sequence — a paraphrase attack. Existing screening systems fail because they rely on sequence similarity, asking does this look like a known toxin? rather than can this do what a toxin does? This project addresses an upstream gap: biology-oriented large language models (LLMs) that can assist users in designing such attacks. We present a four-layer, model-agnostic biosecurity screening system that sits at the interface of biology LLMs and prevents them from being used for protein paraphrase attacks. Our system combines intent detection, LLM-based self-critique, functional toxicity motif analysis, and ESM-2 embedding-based functional similarity scoring. Evaluated using ProLLaMA as the underlying biology LLM, the system blocks 96% of attack prompts while maintaining a 0% false positive rate on legitimate biology queries. Without screening, ProLLaMA answered 100% of attack prompts freely, demonstrating the critical need for this intervention.
This project addresses an important problem with a commendably rigorous approach. However, the current results show only limited success. The intent‑detection layer demonstrates partial effectiveness, as it still struggles to identify evasive prompts that avoid explicit toxin names, which is acknowledged. The function‑based screening using motifs and token embeddings moves in a promising direction, but the present implementation lacks sufficient specificity. Given the constraints of a hackathon timeframe, this effort is appreciated.
Good work for creatively choosing a problem to solve. Models like BioGPT/ProLLaMA are neglected by the biosecurity community and some of your findings here could potentially contribute to safer interfaces with such models. Given their open source nature, though, I'm not sure how tractable solving the uplift-from-bioLLMs problem is.
This is a practical and well-scoped project. You identify a real failure mode in current biosecurity systems and place the defense at a high-leverage point, the LLM interface. The layered design is a strong engineering choice, and the evaluation clearly shows that the intervention changes model behavior in a meaningful way.
The main weakness is confidence in the results. A 96% block rate with 0% false positives suggests the evaluation setup may be too clean. Real users will not submit obvious “attack prompts”. They will obfuscate intent, split tasks across multiple queries, or mix benign and harmful goals.
Also consider system-level limitations. A model-agnostic wrapper is useful, but attackers may bypass the interface entirely or fine-tune their own models. Address where this fits in a broader defense-in-depth strategy. If you can demonstrate robustness under adversarial conditions and clarify the biological validation layer, this could move from a solid engineering solution to a deployable safety system.
Cite this work
@misc {
title={
(HckPrj) Biosecurity Screening Layer for Biology-Oriented AI Interfaces
},
author={
Adejumobi Joshua, Taiwo Togun
},
date={
4/26/26
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}


