Improving Mathematical Reasoning with Process Supervision | allainews.com

May 31, 2023, 7 a.m. |

OpenAI Blog openai.com

We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervision”) instead of simply rewarding the correct final answer (“outcome supervision”). In addition to boosting performance relative to outcome supervision, process supervision also has an important alignment benefit: it directly trains the model to produce a chain-of-thought that is endorsed by humans.

alignment art benefit boosting performance process reasoning state supervision trains

More from openai.com / OpenAI Blog

We’re bringing the Financial Times’ world-class journalism to ChatGPT 1 week, 1 day ago | openai.com

chatgpt class financial financial times +4

Introducing more enterprise-grade features for API customers 2 weeks ago | openai.com

api assistants costs customers +6

OpenAI’s commitment to child safety: adopting safety by design principles 2 weeks ago | openai.com

child children commitment companies +7

Introducing OpenAI Japan 3 weeks, 2 days ago | openai.com

asia gpt gpt-4 japan +4

Introducing improvements to the fine-tuning API and expanding our custom models program 1 month ago | openai.com

api build control custom models +6

Start using ChatGPT instantly 1 month ago | openai.com

benefits benefits of ai chatgpt experience +2

Navigating the Challenges and Opportunities of Synthetic Voices 1 month, 1 week ago | openai.com

challenges opportunities scale small +4

Sora: First Impressions 1 month, 1 week ago | openai.com

community creative feedback impressions +1

Global news partnerships: Le Monde and Prisa Media 1 month, 3 weeks ago | openai.com

chatgpt french global international +4

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote

View on ai-jobs.net