all AI news
cleaning twitter tweets
Sept. 28, 2022, 12:25 a.m. | /u/Ok-Vermicelli9298
Natural Language Processing www.reddit.com
I need to clean over 30 million Twitter tweets for training my ml model. I have tried regex and clean-text for basic cleaning like removing punctuations, emojis etc. But, my script is taking over an hour to run which is not optimal. Is this usual? If not, then what other alternative libraries can I use?
I am quite new to nlp, so have no clue about other libraries.
More from www.reddit.com / Natural Language Processing
Anyone working on mathematics of transformers?
2 days, 5 hours ago |
www.reddit.com
What Do You Love About NLP?
2 days, 16 hours ago |
www.reddit.com
How to Install and Deploy LLaMA 3 Into Production
3 days, 9 hours ago |
www.reddit.com
The Languages AI Is Leaving Behind
6 days, 9 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Machine Learning Engineer (m/f/d)
@ StepStone Group | Düsseldorf, Germany
2024 GDIA AI/ML Scientist - Supplemental
@ Ford Motor Company | United States