May 27, 2024
LLM Benchmarking with Single-Agent Stochastic Dynamic Simulations
Sai Joseph, Anita Beroza, Eleni Angelou, Sofia Mendez, Evelyn Ciara
Summary
A benchmark for evaluating the performance of SOTA LLMs in dynamic real-world scenarios.
Cite this work:
@misc {
title={
LLM Benchmarking with Single-Agent Stochastic Dynamic Simulations
},
author={
Sai Joseph, Anita Beroza, Eleni Angelou, Sofia Mendez, Evelyn Ciara
},
date={
5/27/24
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}