March 8, 2024, 3:52 a.m. | /u/H2O3N4

Machine Learning www.reddit.com

Autoregressive prediction has a problem: whether you're asking what color the sky is or to prove the Riemann hypothesis, the amount of compute to generate the next token is the exact same, but it seems obvious which of the two questions requires more compute to answer. So, engineers toil on how to extend an autoregressive model's capabilities to be able to think, for a variable amount of time, before speaking. Here is the solution (exercise left to the reader).

 

A …

autoregressive models color compute engineers generate giving hypothesis machinelearning next prediction prove questions space think token

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Reporting & Data Analytics Lead (Sizewell C)

@ EDF | London, GB

Data Analyst

@ Notable | San Mateo, CA