Keep Apart Research Going: Donate Today
Jul 1, 2024
An Exploration of Current Theory of Mind Evals
John Henderson, Alan Fung, Bachar Moustapha
Summary
We evaluated the performance of a prominent large language model from Anthropic, on the Theory of Mind evaluation developed by the AI Safety Institute (AISI). Our investigation revealed issues with the dataset used by AISI for this evaluation.
Cite this work:
@misc {
title={
An Exploration of Current Theory of Mind Evals
},
author={
John Henderson, Alan Fung, Bachar Moustapha
},
date={
7/1/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}