X

Agent-Eval-Refine Berkeley-NLP

Bing Rank
Average Position of Bing Search Engine Ranking of related query such as 'Sales AI Agent', 'Coding AI Agent', etc.
GitHub Star
Count of Github Repo Stars

Last Updated: 2025-04-16

Information

Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]

Prompts

Reviews

Tags

Write Your Review

Detailed Ratings

ALL
Correctness
Helpfulness
Interesting
Upload Pictures and Videos