Jan. 23, 2024, 4:20 a.m. | /u/ToughDecision5516

Natural Language Processing www.reddit.com

I need to normalize medical terms within a large corpus, where variations in spelling exist for words with the same meaning (e.g., x-ray, xray, xr). Manual normalization is impractical due to the size of the dataset. I'm looking for a solution, such as a specialized lemmatization or stemming method tailored for medical terms, as conventional methods are not effective in this context. Any suggestions or solutions are appreciated.

dataset keywords languagetechnology lemmatization meaning medical normalization ray solution stemming terms words x-ray

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Data Scientist (Database Development)

@ Nasdaq | Bengaluru-Affluence