OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text | allainews.com

Nov. 28, 2023, 11:03 p.m. | Allen Institute for AI

Allen Institute for AI www.youtube.com

Abstract:

There is growing evidence that pretraining on high quality, carefully thought-out tokens such as code or mathematics plays an important role in improving the reasoning abilities of large language models. For example, Minerva, a PaLM model finetuned on billions of tokens of mathematical documents from arXiv and the web, reported dramatically improved performance on problems that require quantitative reasoning. However, because all known open source web datasets employ preprocessing that does not faithfully preserve mathematical notation, the benefits of …

abstract arxiv code dataset documents evidence example language language models large language large language models mathematics minerva palm quality reasoning role text thought tokens web

More from www.youtube.com / Allen Institute for AI

Robot Learning by Understanding Egocentric Videos 4 days, 6 hours ago | www.youtube.com

abstract and natural language processing computer computer vision +24

Project Sidewalk: Crowd+AI Techniques to Map and Assess Every Sidewalk in the World 1 week, 1 day ago | www.youtube.com

ai techniques every jon map +4

LMQL Programming Large Language Models 2 weeks ago | www.youtube.com

berlin computer computer science eth +19

Does Generative AI Infringe Copyright? 2 weeks, 2 days ago | www.youtube.com

copyright digital family generative +5

Figuring out how the world works: causality in a world full of real people 1 month, 4 weeks ago | www.youtube.com

abstract ai systems alignment build +13

Machine-Checked Proofs, and the Rise of Formal Methods in Mathematics 2 months, 1 week ago | www.youtube.com

abstract artificial artificial intelligence assistant +16

Beyond Test Accuracies for Studying Deep Neural Networks 2 months, 2 weeks ago | www.youtube.com

abstract accuracy beyond community +12

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking 2 months, 3 weeks ago | www.youtube.com

abstract alignment discuss distribution +12

Integrated Systems for Computational Scientific Discovery 3 months ago | www.youtube.com

abstract ai research astronomy biology +13

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

View on ai-jobs.net

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France

View on ai-jobs.net