Jan. 5, 2022, 9 a.m. | /u/hughjonesd

Natural Language Processing www.reddit.com

So I'm working on some early English text. For example, sometimes "up" is spelled "vp", or "himself" might be "himselfe"... or it might not. Is there any advice or good practice for how to handle stemming/lemmata etc.? Has anyone got experience doing word embeddings with this kind of data?

submitted by /u/hughjonesd
[link] [comments]

advice languagetechnology stemming text

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Senior Product Manager - Real-Time Payments Risk AI & Analytics

@ Visa | London, United Kingdom

Business Analyst (AI Industry)

@ SmartDev | Cầu Giấy, Vietnam

Computer Vision Engineer

@ Sportradar | Mont-Saint-Guibert, Belgium

Data Analyst

@ Unissant | Alexandria, VA, USA

Senior Applied Scientist

@ Zillow | Remote-USA