Jan 24, 2025

Safe ai

Aryan Thakar, Liam Ho

The rapid adoption of AI in critical industries like healthcare and legal services has highlighted the urgent need for robust risk mitigation mechanisms. While domain-specific AI agents offer efficiency, they often lack transparency and accountability, raising concerns about safety, reliability, and compliance. The stakes are high, as AI failures in these sectors can lead to catastrophic outcomes, including loss of life, legal repercussions, and significant financial and reputational damage. Current solutions, such as regulatory frameworks and quality assurance protocols, provide only partial protection against the multifaceted risks associated with AI deployment. This situation underscores the necessity for an innovative approach that combines comprehensive risk assessment with financial safeguards to ensure the responsible and secure implementation of AI technologies across high-stakes industries.

Reviewer's Comments

Reviewer's Comments

Arrow
Arrow
Arrow

Really like the concept. I am very bullish on the usecase, I believe AI insurance will become really big within the next years. Open questions/comments on my end: Insurance is a very special and highly regulated industry, you would need to have someone on the founding team, who knows about it, as winning is most likely about execution speed & capital. You would really like to have someone aligned doing this, as this new insurance player would have vast resources to hire statisticians and scientists to effectively do AI safety and AI risk work. The more technical and aligned the management is, the more deeply technical (and open/cooperative) the company would likely be. Would be cool if you could calculate with an example case of insurance premium and risk, of e.g. the cancer case and put it on the slide. Open question on competition: Why will normal insurance companies not be able to implement this (or how much time do you have to implement it before they do).

I appreciated the thorough exploration from the team of the business case, beyond the technology.

I would have appreciated a deeper exploration on which are the legal provisions that are currently being employed when deploying AI agents into production (which is already happening).

I encourage the team to focus the MVP even further, by choosing one particular segment or vertical, and start mapping all the existing commercially available AI agents which could be analyzed for safety and insurance policy.

The project shows some solid groundwork, especially in their methodology and pilot experiment planning - they've clearly thought through the implementation details and timeline, which is great. The demo looks promising too!

However, there are a couple of areas where it could be stronger. While they're targeting healthcare and legal AI, these fields are already pretty saturated with AI solutions - it would've been super helpful to see how they plan to differentiate from existing players in the market. This makes it a bit tricky to see how their research would scale into something truly impactful.

Also, from an AI safety perspective, the focus seems pretty narrow - they haven't really connected the dots between their technical approach and broader AI risk challenges..particularly x-risk. If they could expand their scope beyond just healthcare/legal and show how their risk assessment platform addresses larger AI safety concerns, that would make the project much more compelling.

Focuses on managing financial downsides rather than making progress on AI safety - insurance payouts don't prevent catastrophic outcomes. The timeline seems too optimistic, and overly focused on profitability. It's also unclear how they estimate the risk of the agents.

Cite this work

@misc {

title={

Safe ai

},

author={

Aryan Thakar, Liam Ho

},

date={

1/24/25

},

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Recent Projects

View All

Feb 2, 2026

Markov Chain Lock Watermarking: Provably Secure Authentication for LLM Outputs

We present Markov Chain Lock (MCL) watermarking, a cryptographically secure framework for authenticating LLM outputs. MCL constrains token generation to follow a secret Markov chain over SHA-256 vocabulary partitions. Using doubly stochastic transition matrices, we prove four theoretical guarantees: (1) exponentially decaying false positive rates via Hoeffding bounds, (2) graceful degradation under adversarial modification with closed-form expected scores, (3) information-theoretic security without key access, and (4) bounded quality loss via KL divergence. Experiments on 173 Wikipedia prompts using Llama-3.2-3B demonstrate that the optimal 7-state soft cycle configuration achieves 100\% detection, 0\% FPR, and perplexity 4.20. Robustness testing confirms detection above 96\% even with 30\% word replacement. The framework enables $O(n)$ model-free detection, addressing EU AI Act Article 50 requirements. Code available at \url{https://github.com/ChenghengLi/MCLW}

Read More

Feb 2, 2026

Prototyping an Embedded Off-Switch for AI Compute

This project prototypes an embedded off-switch for AI accelerators. The security block requires periodic cryptographic authorization to operate: the chip generates a nonce, an external authority signs it, and the chip verifies the signature before granting time-limited permission. Without valid authorization, outputs are gated to zero. The design was implemented in HardCaml and validated in simulation.

Read More

Feb 2, 2026

Fingerprinting All AI Cluster I/O Without Mutually Trusted Processors

We design and simulate a "border patrol" device for generating cryptographic evidence of data traffic entering and leaving an AI cluster, while eliminating the specific analog and steganographic side-channels that post-hoc verification can not close. The device eliminates the need for any mutually trusted logic, while still meeting the security needs of the prover and verifier.

Read More

Feb 2, 2026

Markov Chain Lock Watermarking: Provably Secure Authentication for LLM Outputs

We present Markov Chain Lock (MCL) watermarking, a cryptographically secure framework for authenticating LLM outputs. MCL constrains token generation to follow a secret Markov chain over SHA-256 vocabulary partitions. Using doubly stochastic transition matrices, we prove four theoretical guarantees: (1) exponentially decaying false positive rates via Hoeffding bounds, (2) graceful degradation under adversarial modification with closed-form expected scores, (3) information-theoretic security without key access, and (4) bounded quality loss via KL divergence. Experiments on 173 Wikipedia prompts using Llama-3.2-3B demonstrate that the optimal 7-state soft cycle configuration achieves 100\% detection, 0\% FPR, and perplexity 4.20. Robustness testing confirms detection above 96\% even with 30\% word replacement. The framework enables $O(n)$ model-free detection, addressing EU AI Act Article 50 requirements. Code available at \url{https://github.com/ChenghengLi/MCLW}

Read More

Feb 2, 2026

Prototyping an Embedded Off-Switch for AI Compute

This project prototypes an embedded off-switch for AI accelerators. The security block requires periodic cryptographic authorization to operate: the chip generates a nonce, an external authority signs it, and the chip verifies the signature before granting time-limited permission. Without valid authorization, outputs are gated to zero. The design was implemented in HardCaml and validated in simulation.

Read More

This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.