Sept. 19, 2022, 12:01 a.m. | /u/milliondollarhaircut

Natural Language Processing www.reddit.com

Inspired by [this post](https://www.reddit.com/r/LanguageTechnology/comments/xgy617/any_easy_tool_to_cherry_pick_rare_words_from_text/) from yesterday, I wrote a script that can rank the rarity of words in an input string, or, alternatively, can return a list of only the rare words included in an input string.

[Here's the repo](https://github.com/cmwxyz/word-rarity), which contains a more in-depth description of how it works.

I wanted to turn this into a module, but that process is more involved than I realized, so it may be a few more days before I figure all that …

extract languagetechnology text tool words

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Lead Data Engineer

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

Senior Machine Learning Engineer

@ TELUS | Vancouver, BC, CA

CT Technologist - Ambulatory Imaging - PRN

@ Duke University | Morriville, NC, US, 27560

BH Data Analyst

@ City of Philadelphia | Philadelphia, PA, United States