github com

#RESEARCH #AI AGENT #BENCHMARK #TOOL LIBRARIES #DESKTOP USE #DATA ANALYSIS #SOFTWARE TESTING #MOBILE USE #AI AGENT MEMORY #SALES #TRANSLATION

Website

https://github.com/TheAgentCompany/TheAgentCompany

12.0

Last Updated: 2025-02-09

TheAgentCompany measures the progress of these LLM agents' performance on performing real-world professional tasks, by providing an extensible benchmark for evaluating AI agents that interact with the world in similar ways to those of a digital worker: by browsing the Web, writing code, running programs, and communicating with other coworkers.