Nov 23, 2025

GUARDIAN: Guarded Universal Architecture for Defensive Interpretation And traNslation

Aditya Thalang, Josh Brown

GUARDIAN is a multi-stage, LLM-driven system to automate the translation of C codebases to memory-safe Rust. GUARDIAN promotes defense acceleration at-scale by guiding an LLM transpiler with dependency graph strongly-connected-components, static-analysis-guided rule hints, examples from the demonstration corpora and iterative, compiler-guided refinement.

In evaluation on 27 C functions, including 20 with adversarial vulnerabilities, GUARDIAN achieves 100% compilation success and 92.6% fully safe outputs, outperforming a baseline LLM by 22.2pp. GUARDIAN demonstrates that safety-oriented constraints can significantly improve automated translation quality at scale. Limitations include evaluation on a small test set and a lack of functional-equivalence guarantees; future work will target repository-scale evaluation, expanding the classes of vulnerabilities covered by static analyses, adding functional-equivalence guarantees and robust evaluation sandboxing.

Review Project

See Code

Reviewer's Comments

Well-motivated project with a clear defensive use case. The report explains the problem and deployment path well, and the prototype results are encouraging. My main concerns are robustness and real-world reliability. I'd like to see a better argument for the broader qualitative impact of this project.

Cite this work

@misc {

title={

(HckPrj) GUARDIAN: Guarded Universal Architecture for Defensive Interpretation And traNslation

author={

Aditya Thalang, Josh Brown

date={

11/23/25

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Recent Projects

View All

Feb 2, 2026

Markov Chain Lock Watermarking: Provably Secure Authentication for LLM Outputs

We present Markov Chain Lock (MCL) watermarking, a cryptographically secure framework for authenticating LLM outputs. MCL constrains token generation to follow a secret Markov chain over SHA-256 vocabulary partitions. Using doubly stochastic transition matrices, we prove four theoretical guarantees: (1) exponentially decaying false positive rates via Hoeffding bounds, (2) graceful degradation under adversarial modification with closed-form expected scores, (3) information-theoretic security without key access, and (4) bounded quality loss via KL divergence. Experiments on 173 Wikipedia prompts using Llama-3.2-3B demonstrate that the optimal 7-state soft cycle configuration achieves 100\% detection, 0\% FPR, and perplexity 4.20. Robustness testing confirms detection above 96\% even with 30\% word replacement. The framework enables $O(n)$ model-free detection, addressing EU AI Act Article 50 requirements. Code available at \url{https://github.com/ChenghengLi/MCLW}

Feb 2, 2026

Prototyping an Embedded Off-Switch for AI Compute

This project prototypes an embedded off-switch for AI accelerators. The security block requires periodic cryptographic authorization to operate: the chip generates a nonce, an external authority signs it, and the chip verifies the signature before granting time-limited permission. Without valid authorization, outputs are gated to zero. The design was implemented in HardCaml and validated in simulation.

Feb 2, 2026

Fingerprinting All AI Cluster I/O Without Mutually Trusted Processors

We design and simulate a "border patrol" device for generating cryptographic evidence of data traffic entering and leaving an AI cluster, while eliminating the specific analog and steganographic side-channels that post-hoc verification can not close. The device eliminates the need for any mutually trusted logic, while still meeting the security needs of the prover and verifier.

Feb 2, 2026