MATH-lighteval

Rating

ALL

Similar

DATASET

Webscale-RL

Agent Reinforcement Learning Open Dataset

Agent Function Calling Open Dataset Example

Agent Reinforcement Learning Open Dataset Example

Agent Tool Use Dialogue Open Dataset Example

Agent Tool Use Dialogue Open Dataset

Information

# Dataset Card for Mathematics Aptitude Test of Heuristics (MATH) dataset in lighteval format ## Table of Contents - [Table of Contents](#table-of-contents) - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Builder configs](#builder-configs) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ## Dataset Description - **Homepage:** https://github.com/hendrycks/math - **Repository:** https://github.com/hendrycks/math - **Paper:** https://arxiv.org/pdf/2103.03874.pdf - **Leaderboard:** N/A - **Point of Contact:** Dan Hendrycks ### Dataset Summary The Mathematics Aptitude Test of Heuristics (MATH) dataset consists of problems from mathematics competitions, including the AMC 10, AMC 12, AIME, and more. Each problem in MATH has a full step-by-step solution, which can be used to teach models to generate answer derivations and explanations. This version of the dataset contains appropriate builder configs s.t. it can be used as a drop-in replacement for the inexplicably missing \`lighteval/MATH\` dataset. ## Dataset Structure ### Data Instances A data instance consists of a competition math problem and its step-by-step solution written in LaTeX and natural language. The step-by-step solution contains the final answer enclosed in LaTeX's \`\boxed\` tag. An example from the dataset is: \`\`\` \{'problem': 'A board game spinner is divided into three parts labeled $A$, $B$ and $C$. The probability of the spinner landing on $A$ is $\\frac\{1\}\{3\}$ and the probability of the spinner landing on $B$ is $\\frac\{5\}\{12\}$. What is the probability of the spinner landing on $C$? Express your answer as a common fraction.', 'level': 'Level 1', 'type': 'Counting & Probability', 'solution': 'The spinner is guaranteed to land on exactly one of the three regions, so we know that the sum of the probabilities of it landing in each region will be 1. If we let the probability of it landing in region $C$ be $x$, we then have the equation $1 = \\frac\{5\}\{12\}+\\frac\{1\}\{3\}+x$, from which we have $x=\\boxed\{\\frac\{1\}\{4\}\}$.'\} \`\`\` ### Data Fields * \`problem\`: The competition math problem. * \`solution\`: The step-by-step solution. * \`level\`: The problem's difficulty level from 'Level 1' to 'Level 5', where a subject's easiest problems for humans are assigned to 'Level 1' and a subject's hardest problems are assigned to 'Level 5'. * \`type\`: The subject of the problem: Algebra, Counting & Probability, Geometry, Intermediate Algebra, Number Theory, Prealgebra and Precalculus. ### Data Splits * train: 7,500 examples * test: 5,000 examples ### Builder Configs * default: 7,500 train and 5,000 test examples (full dataset) * algebra: 1,744 train and 1,187 test examples * counting_and_probability: 771 train and 474 test examples * geometry: 870 train 479 test examples * intermediate_algebra: 1,295 train and 903 test examples * number_theory: 869 train and 540 test examples * prealgebra: 1,205 train and 871 test examples * precalculus: 746 train and 546 test examples ## Additional Information ### Licensing Information https://github.com/hendrycks/math/blob/main/LICENSE This repository was created from the [hendrycks/competition_math](https://huggingface.co/datasets/hendrycks/competition_math) dataset. All credit goes to the original authors. ### Citation Information \`\`\`bibtex @article\{hendrycksmath2021, title=\{Measuring Mathematical Problem Solving With the MATH Dataset\}, author=\{Dan Hendrycks and Collin Burns and Saurav Kadavath and Akul Arora and Steven Basart and Eric Tang and Dawn Song and Jacob Steinhardt\}, journal=\{arXiv preprint arXiv:2103.03874\}, year=\{2021\} \} \`\`\` ### Contributions Thanks to [@hacobe](https://github.com/hacobe) for adding this dataset.

Prompts

Reviews

Write Your Review

Detailed Ratings

ALL

Correctness

Helpfulness

Interesting

Upload Pictures and Videos

Name

Size

Type

Download

Last Modified

Community

Add Discussion

Upload Pictures and Videos

Chatbot close

Bot
Hi there
How can I help you today?

Send