Information
Code for Benchmarking Language Model Agents for Data-Driven Science
Detailed Ratings