Aug. 19, 2022, 1:26 a.m. | /u/Vietname

Natural Language Processing www.reddit.com

I'm working with a dataset of about 200,000 jeopardy questions, and I'd like to use NLP to categorize them into broad categories (history, math, sports, etc).

From doing a cursory check of how many history questions I have (just checking if 'histor' appears in the category) I have about 450 history questions. Obviously there are more that don't have that string in the category, but it would take a fair bit of manual work to identify those.

Given that, and …

dataset languagetechnology small

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst (Commercial Excellence)

@ Allegro | Poznan, Warsaw, Poland

Senior Machine Learning Engineer

@ Motive | Pakistan - Remote

Summernaut Customer Facing Data Engineer

@ Celonis | Raleigh, US, North Carolina

Data Engineer Mumbai

@ Nielsen | Mumbai, India