July 30, 2022, 2:10 p.m. | /u/Certain-Ad827

Natural Language Processing www.reddit.com

I am trying to find out a faster way to lemmatize words in a list using the NLTK Word Net Lemmatizer. Apparently this is the most time consuming step in my whole program. Using the lemmatizer decreases my performance by 20x .

This is a [link](https://docs.google.com/document/d/1OdMjbQCBc0W-Nr8aTMphl94aCVorLM-A-LHijVVC7fQ/edit?usp=sharing) to my code or you can read it bellow

This is my code:

import nltk

from nltk.corpus import wordnet

[nltk.download](https://nltk.download)('punkt')

[nltk.download](https://nltk.download)('averaged\_perceptron\_tagger')

[nltk.download](https://nltk.download)('wordnet')

[nltk.download](https://nltk.download)('omw-1.4')

lmtzr = nltk.WordNetLemmatizer().lemmatize

​

def Get\_wordnet\_position\_func(treebank\_tag):

if treebank\_tag.startswith('J'):

return wordnet.ADJ

elif …

languagetechnology lemmatization python

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A