Last Updated: 2025-02-09
Code for Benchmarking Language Model Agents for Data-Driven Science
Detailed Ratings
Hi there How can I help you today?