[R] Do people still believe in LLM emergent abilities? | allainews.com

Feb. 3, 2024, 8:50 p.m. | /u/uwashingtongold

Machine Learning www.reddit.com

Ever since \[Are emergent LLM abilities a mirage?\]([https://arxiv.org/pdf/2304.15004.pdf](https://arxiv.org/pdf/2304.15004.pdf)), it seems like people have been awfully quiet about emergence. But the big \[emergent abilities\]([https://openreview.net/pdf?id=yzkSU5zdwD](https://openreview.net/pdf?id=yzkSU5zdwD)) paper has this paragraph (page 7):

\> It is also important to consider the evaluation metrics used to measure emergent abilities (BIG-Bench, 2022). For instance, using exact string match as the evaluation metric for long-sequence targets may disguise compounding incremental improvements as emergence. Similar logic may apply for multi-step or arithmetic reasoning problems, where models are only …

apply big emergence evaluation evaluation metrics improvements incremental instance logic machinelearning match metrics reasoning string targets

More from www.reddit.com / Machine Learning

[P] Skyrim - Open-source model zoo for Large Weather Models an hour ago | www.reddit.com

ai models building capabilities fine-tuning +7

[P] Identify toxic underwater air bubbles lurking in the substrate with aquatic ultrasonic scans via … 3 hours ago | www.reddit.com

arduino classification color identify +11

[P] YARI - Yet Another RAG Implementation. Hybrid context retrieval 4 hours ago | www.reddit.com

api context cosine embedding +14

[D] Is EOS token crucial during pre-training? 8 hours ago | www.reddit.com

documents eos flow information +7

[D] Stack Overflow partnership with OPEN AI 10 hours ago | www.reddit.com

access chart chat chat gpt +16

[D] How does fast inference work with state of the art LLMs? 12 hours ago | www.reddit.com

70b art gpt gpt-4 +11

[D] Llama 3 Monstrosities 1 day, 3 hours ago | www.reddit.com

create easy life llama +4

[D] Get paid for peer reviews on ResearchHub 1 day, 6 hours ago | www.reddit.com

cryptocurrency editor machinelearning mind +6

[D] NER for large text data 1 day, 7 hours ago | www.reddit.com

billion data data scientist hello +8

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net