Jan. 5, 2022, 1:39 p.m. | Michael Berk

Towards Data Science - Medium towardsdatascience.com

A look into the SQuAD dataset, its top NLP models, and whether they are overfitting.

Have you ever wanted to build an algorithm that does your homework? While the tech isn’t there yet, we’re getting close.

Figure 1: models with top exact match (EM) scores on the SQuAD 2.0 dataset —src. Image by author.

In 2016, researchers at Stanford released a question answering dataset to train NLP models. Since then, there have been hundreds of models submitted, each …

data science learning machine machine learning nlp overfitting xlnet

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris