Feb. 8, 2024, 10:10 p.m. | Harpreet Sahota

Artificialis - Medium medium.com

Assessing the Impact of Decoding Strategies on the Instruction Following Evaluation for Large Language Models Benchmark

Photo by Sean D on Unsplash

Recently I’ve been intellectually obsessed with two things:

  1. How do models generate text? (Trying to grok how various LLM decoding strategies impact the resulting generations)
  2. And how do we gauge how good they are at it? (The minefield known as LLM evaluation)

It’s not just idle curiosity. It’s my job.

I’ve been handed this cool yet daunting task: …

evaluation hugging face large language models llm

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South