Aug 25, 2024
Web App for Interacting with Refusal-Ablated Language Model Agents
Simon Lermen
Summary
While many people and policymakers have had contact with
language models, they often have outdated assumptions. A
significant fraction is not aware of agentic capabilities.
Furthermore, most models that are available online have various
safety guardrails. We want to demonstrate refusal-ablated agents
to people to make them aware of various misuse potentials. Giving
people a sense of agentic AI and perhaps having the AI operate
against themselves could provide a better intuition about agency in
AI systems. We present a simple web app that allows users to
instruct and experiment with an unrestricted agent.
Cite this work:
@misc {
title={
Web App for Interacting with Refusal-Ablated Language Model Agents
},
author={
Simon Lermen
},
date={
8/25/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}