Web: https://www.reddit.com/r/LanguageTechnology/comments/sdurp5/benchmarking_nlp_datasets/

Jan. 27, 2022, 9:47 a.m. | /u/DarthVader9396

Natural Language Processing reddit.com

Hello Everyone,

I am a newbie in NLP research. My question is - How should we benchmark a new Language dataset/corpus (ex. dialogue dataset, q/a dataset) when there is no publicly available dataset for that particular language? Also what are the possible directions to perform evaluation on the newly prepared dataset. Need suggestions, please.

submitted by /u/DarthVader9396
[link] [comments]

benchmarking datasets languagetechnology nlp

Data Analytics and Technical support Lead

@ Coupa Software, Inc. | Bogota, Colombia

Data Science Manager

@ Vectra | San Jose, CA

Data Analyst Sr

@ Capco | Brazil - Sao Paulo

Data Scientist (NLP)

@ Builder.ai | London, England, United Kingdom - Remote

Senior Data Analyst

@ BuildZoom | Scottsdale, AZ/ San Francisco, CA/ Remote

Senior Research Scientist, Speech Recognition

@ SoundHound Inc. | Toronto, Canada