← all tags
#cmu
3 notes.
Frank F. Xu
BPM/wiki/entities/frank-xu
Graham Neubig
BPM/wiki/entities/graham-neubig
TheAgentCompany: Benchmarking LLM Agents on Consequential Real-World Tasks
BPM/wiki/sources/2024-xu-the-agent-company-benchmark