Math Reasoning Dataset

APTO Releases Training Dataset to Enhance the Mathematical Reasoning Capabilities of Large Language Models (LLMs)

TOKYO, Sept. 30, 2025 /PRNewswire/ -- As generative AI use continues to increase, accuracy has become the most important metric and a key factor in decisions around adoption and utilization. APTO is ...

Discover Magazine

How Leaky Datasets Undermine AI Math Reasoning Claims

Back in 2019, a group of computer scientists performed a now-famous experiment with far-reaching consequences for artificial intelligence research. At the time, machine vision algorithms were becoming ...

16d

World's Largest Dataset Of Olympiad-Level Maths Problems Created By MIT

You would be amazed to know that the countries competing in the International Mathematical Olympiad arrive with a booklet of their best, most original problems every year. Normally, these booklets get ...

VentureBeat

Meet LLEMMA, the math-focused open source AI that outperforms rivals

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more In a new paper, researchers from various ...

VentureBeat

New open-source math model Light-R1-32B surpasses equivalent DeepSeek performance with only $1000 in training costs

Researchers have introduced Light-R1-32B, a new open-source AI model optimized to solve advanced math problems. It is now available on Hugging Face under a permissive Apache 2.0 license — free for ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

EurekAlert!

Achieving >97% on GSM8K: Deeply understanding the problems makes LLMs better solvers for math word problems

Chain-of-Thought (CoT) prompting has enhanced the performance of Large Language Models (LLMs) across various reasoning tasks. However, CoT still falls short in dealing with complex math word problems, ...

Hosted on MSN

How AI is changing math competitions forever

From high school math modeling challenges to formal theorem-proving competitions, large language models (LLMs) are stepping into the competitive math arena. New datasets, benchmarks, and governance ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results