all AI news
Augmenting Math Word Problems via Iterative Question Composing. (arXiv:2401.09003v3 [cs.CL] UPDATED)
cs.CL updates on arXiv.org arxiv.org
Despite the advancements in large language models (LLMs) for mathematical
reasoning, solving competition-level math problems remains a significant
challenge, especially for open-source LLMs without external tools. We introduce
the MMIQC dataset, comprising a mixture of processed web data and synthetic
question-response pairs, aimed at enhancing the mathematical reasoning
capabilities of base language models. Models fine-tuned on MMIQC consistently
surpass their counterparts in performance on the MATH benchmark across various
model sizes. Notably, Qwen-72B-MMIQC achieves a 45.0% accuracy, exceeding the
previous …
arxiv capabilities challenge competition cs.cl data dataset iterative language language models large language large language models llms math mathematical reasoning question reasoning synthetic tools via web word