Feb. 3, 2024, 8:50 p.m. | /u/uwashingtongold

Machine Learning www.reddit.com

Ever since \[Are emergent LLM abilities a mirage?\]([https://arxiv.org/pdf/2304.15004.pdf](https://arxiv.org/pdf/2304.15004.pdf)), it seems like people have been awfully quiet about emergence. But the big \[emergent abilities\]([https://openreview.net/pdf?id=yzkSU5zdwD](https://openreview.net/pdf?id=yzkSU5zdwD)) paper has this paragraph (page 7):

\> It is also important to consider the evaluation metrics used to measure emergent abilities (BIG-Bench, 2022). For instance, using exact string match as the evaluation metric for long-sequence targets may disguise compounding incremental improvements as emergence. Similar logic may apply for multi-step or arithmetic reasoning problems, where models are only …

apply big emergence evaluation evaluation metrics improvements incremental instance logic machinelearning match metrics reasoning string targets

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote