Apr 27, 2026

OliGraph: graph-based screening of large oligopools

Ryan Teo, Christopher Kent, Samuel A.R. Richardson

🏆 2nd Place Winner + 🏆 Track 1 Winner: DNA Screening & Synthesis Controls

Existing synthesis screening tools cannot evaluate short oligonucleotide pools, whose overlapping fragments can be reassembled into regulated sequences via polymerase cycling assembly (PCA) yet fall below gene-length detection thresholds. We present OliGraph, an open-source tool that constructs a bi-directed overlap graph from an oligonucleotide pool and extracts contigs for downstream gene-length screening. An optional PCA mode retains only cross-strand overlaps consistent with PCA chemistry. We validated OliGraph in a blinded study across ten simulated pools (70–9,184 oligonucleotides, 30–300 bp) spanning four risk categories. BLAST screening of individual oligonucleotides failed to identify sequences of concern in most pools: three returned zero hits, and vector noise obscured true positives in the remainder. After OliGraph assembly, contig-level BLAST matched the longest assembled sequences (up to 1,905 bp) to sequences of concern at 97–100% identity. In one pool, assembly collapsed 1,634 individual BLAST results into 10 hits from a single contig, all assigned to the same source organism. PCA mode correctly distinguished assemblable from non-assemblable fragments within the same pool. Two pools with no assemblable structure yielded no contigs. OliGraph processed all pools in under 0.2 seconds, fast enough for real-time order screening and consistent with proposals to bring oligonucleotide orders within the scope of synthesis screening regulation.

Reviewer's Comments

Reviewer's Comments

Arrow
Arrow
Arrow

This is an important piece of work in the sequence screening space to address potential vulnerabilities in oligo synthesis. While there are many existing tools for in silico assembly from contigs (e.g., https://edinburgh-genome-foundry.github.io/DnaCauldron/), this is one of the few works that combines the approach with a practical implementation of screening. Since PCA was first developed, many other assembly methods have been developed that use different techniques and short overlaps or none at all. OliGraph would benefit from supporting a larger set of genetic engineering techniques for assembly.

This is a strong submission. The problem is real, the gap is well-documented, and delivering a working open-source tool that closes it is a legitimate contribution. The assembly logic is technically sound — contig reconstruction, threshold precision, and signal separation from background noise all perform as described. The paper is unusually clear for a hackathon submission, and the dual-use appendix reflects genuine biosecurity literacy.

The natural next step is validation on real commercial order data. Self-generated test sets are a reasonable starting point for a hackathon, but the path to regulatory relevance runs through synthesis providers. Getting one or two to run OliGraph against their actual order pipelines would transform this from a promising proof of concept into something policymakers can cite. The EU Biotech Act angle is well-positioned — following through on provider coordination would make that connection concrete rather than aspirational.

On deployment, the paper frames OliGraph as a tool for synthesis providers but doesn't address how it fits within existing screening infrastructure. The cross-provider evasion problem is acknowledged but quickly set aside as a shared limitation. It deserves more than a footnote. A more complete deployment picture would include integration with customer order logs to flag repeat fragmented purchases across time. Current frameworks require providers to conduct both sequence screening and customer verification — knowing who is ordering, not just what. Extending that to assembly-aware oligopool screening would require policy consideration around flagging criteria, escalation thresholds, and what triggers a hold on an order. That operational layer is absent and worth developing.

The more interesting tension is with cryptographic screening like SecureDNA. In cryptographic screening, sequences never leave the provider's premises, and the screening service never sees them in plaintext. OliGraph requires the opposite: you have to reconstruct the full assembly in the clear to know what the pool encodes. These approaches are complementary but in fundamental tension. Making assembly-aware screening privacy-preserving is an open problem; multi-party computation over graph structures is an active research area, but has not been applied in this context. Engaging seriously with this would be the most significant contribution a follow-on version of this work could make.

Two practical issues worth fixing before wider deployment. The web interface doesn't expose the PCA mode toggle — the most biosecurity-relevant feature requires CLI access. And when the tool returns zero results, it does so silently, with no indication of whether the pool is clean or whether a parameter needs adjusting. For the non-specialist operator this paper envisions, that's a dead end.

One framing clarification: OliGraph is an assembly engine. Detection happens downstream in BLAST. Being precise about where OliGraph ends and screening begins would sharpen the contribution and set more accurate expectations for anyone building on this.

Solid foundation. worth developing further

What an exciting project! I learned a bunch by reading it. I have very little to critique here - the write-up was easy to follow (probably could have been somewhat shorter), the results were impressive, the limitations were stated very clearly. Good work.

Cite this work

@misc {

title={

(HckPrj) OliGraph: graph-based screening of large oligopools

},

author={

Ryan Teo, Christopher Kent, Samuel A.R. Richardson

},

date={

4/27/26

},

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Recent Projects

Apr 27, 2026

BioRT-Bench: A Multi-Attack Red-Teaming Benchmark for Bio-Misuse Safeguards in Frontier LLMs

Frontier AI laboratories are expected to maintain safeguards against biological misuse, but whether deployed models actually refuse bio-misuse queries under adversarial pressure is largely unmeasured in the public literature. We introduce BioRT-Bench, a benchmark that runs four attack methods (direct request, PAIR, Crescendo, and base64 encoding) against four frontier models (Claude Sonnet 4.6, GPT-5.4, DeepSeek V4-flash, Kimi K2.5) across 40 prompts spanning five biosecurity-relevant categories. Responses are scored by a calibrated judge extending StrongREJECT with two bio-specific dimensions: specificity and actionability. We measure Attack Success Rate (ASR), where 0 means the model fully refused and 1 means it provided specific, actionable bio-misuse content. Our results reveal a sharp robustness divide: Chinese frontier models (DeepSeek, Kimi) have under 5% refusal rates even under direct request (ASR 0.88 and 0.79), while Western models (Claude, GPT) maintain substantially stronger safeguards (ASR 0.15 and 0.16). Crescendo is the most effective attack across all models, both in bypassing refusal and in eliciting actionable content. Claude Sonnet 4.6 is the most robust model tested, achieving 100% refusal against base64-encoded prompts.

Read More

Apr 27, 2026

PROTEUS (PROTein Evaluation for Unusual Sequences): Structure-Informed Safety Screening for de novo and Evasion-Prone Protein-Coding Sequences

AI protein design tools like RFdiffusion, ProteinMPNN, and Bindcraft make it trivial to produce low-homology sequences that fold into active, potentially hazardous architectures. However, sequence homology-based biosafety screening tools cannot detect proteins that pose functional risk through structurally novel mechanisms with no sequence precedent. We present a tiered computational pipeline that addresses this gap by combining MMseqs2 sequence alignment with structure-based comparison via FoldSeek and DALI against curated toxin databases totaling ~34,000 entries. AlphaFold2-predicted structures are screened for both global fold similarity (FoldSeek) and local active/allosteric site geometry (DALI), capturing convergent functional hazards that sequence screening misses. The pipeline was validated against a panel of toxins, benign proteins, structural mimics, and de novo-designed Munc13 binders, as well as modified ricin variants with residue substitutions. We additionally tested robustness to partial-synthesis evasion, where a bad actor submits multiple shorter coding sequences intended for downstream reassembly into a full toxin-coding gene. We found that while sequence-based screening did not identify any de novo ricin analogues with high certainty, the combined pipeline with FoldSeek and DALI identified all 24 tested de novo ricins as toxic.

Read More

Apr 27, 2026

OmnyraCloud

OmnyraCloud is protocol level biosecurity screening for cloud lab workflows. Today's tools screen DNA sequences at order time — but cloud labs run workflows, and a chain of individually benign steps (serial passage, split orders, surface obfuscation) can pursue a dangerous objective without a single flagged sequence. That's the gap we close.

OmnyraCloud ingests any lab protocol (Autoprotocol, Opentrons, JSON, or free text) and runs a 5-stage pipeline: decompose the workflow → score 5 risk dimensions → ground every flag in retrieved biosecurity literature → audit with LLM-as-judge → cross-check sequences via IBBIS commec. Output: an auditable risk report with citations, not a black box.

IBBIS flagged 1/3 dangerous protocols. Two sequences were screened but returned no HMM matches, but protocol level reasoning caught all three.

Protocol level screening isn't just complementary to sequence screening. It's essential.

Live at https://omnyra-cloud.vercel.app/

Read More

Apr 27, 2026

BioRT-Bench: A Multi-Attack Red-Teaming Benchmark for Bio-Misuse Safeguards in Frontier LLMs

Frontier AI laboratories are expected to maintain safeguards against biological misuse, but whether deployed models actually refuse bio-misuse queries under adversarial pressure is largely unmeasured in the public literature. We introduce BioRT-Bench, a benchmark that runs four attack methods (direct request, PAIR, Crescendo, and base64 encoding) against four frontier models (Claude Sonnet 4.6, GPT-5.4, DeepSeek V4-flash, Kimi K2.5) across 40 prompts spanning five biosecurity-relevant categories. Responses are scored by a calibrated judge extending StrongREJECT with two bio-specific dimensions: specificity and actionability. We measure Attack Success Rate (ASR), where 0 means the model fully refused and 1 means it provided specific, actionable bio-misuse content. Our results reveal a sharp robustness divide: Chinese frontier models (DeepSeek, Kimi) have under 5% refusal rates even under direct request (ASR 0.88 and 0.79), while Western models (Claude, GPT) maintain substantially stronger safeguards (ASR 0.15 and 0.16). Crescendo is the most effective attack across all models, both in bypassing refusal and in eliciting actionable content. Claude Sonnet 4.6 is the most robust model tested, achieving 100% refusal against base64-encoded prompts.

Read More

Apr 27, 2026

PROTEUS (PROTein Evaluation for Unusual Sequences): Structure-Informed Safety Screening for de novo and Evasion-Prone Protein-Coding Sequences

AI protein design tools like RFdiffusion, ProteinMPNN, and Bindcraft make it trivial to produce low-homology sequences that fold into active, potentially hazardous architectures. However, sequence homology-based biosafety screening tools cannot detect proteins that pose functional risk through structurally novel mechanisms with no sequence precedent. We present a tiered computational pipeline that addresses this gap by combining MMseqs2 sequence alignment with structure-based comparison via FoldSeek and DALI against curated toxin databases totaling ~34,000 entries. AlphaFold2-predicted structures are screened for both global fold similarity (FoldSeek) and local active/allosteric site geometry (DALI), capturing convergent functional hazards that sequence screening misses. The pipeline was validated against a panel of toxins, benign proteins, structural mimics, and de novo-designed Munc13 binders, as well as modified ricin variants with residue substitutions. We additionally tested robustness to partial-synthesis evasion, where a bad actor submits multiple shorter coding sequences intended for downstream reassembly into a full toxin-coding gene. We found that while sequence-based screening did not identify any de novo ricin analogues with high certainty, the combined pipeline with FoldSeek and DALI identified all 24 tested de novo ricins as toxic.

Read More

This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.