[D] Character-level vs. word-level tokenization | allainews.com

May 21, 2022, 11:10 a.m. | /u/CodeAllDay1337

Machine Learning www.reddit.com

Hi all,

I'm relatively new to the field of NLP and while reading a blog post from 2015 [The Unreasonable Effectiveness of Recurrent Neural Networks](http://karpathy.github.io/2015/05/21/rnn-effectiveness/) by Andrej Karpathy, I was wondering about this part of the "Further Reading" section:

>Currently it seems that word-level models work better than character-level models, but this is surely a temporary thing.

Aren't most state-of-the art models these days using some kind of vocabulary, i.e. whole words or at least sub-words? Text in the wild …

machinelearning tokenization

More from www.reddit.com / Machine Learning

[D] A slide which makes you feel old 2 hours ago | www.reddit.com

machinelearning

[N] Kaiming He's lecture on DL architecture for Representation Learning 10 hours ago | www.reddit.com

advances architecture good lecture +3

Do you think Reinforcement Learning still got it? [D] 14 hours ago | www.reddit.com

alphago architectures big computer +15

[P] TorchFix - a linter for PyTorch-using code with autofix support 17 hours ago | www.reddit.com

machinelearning

[D] Is Google Set to Dominate the RAG Scene with Its Massive Data Resources? 18 hours ago | www.reddit.com

basic big data google +16

[P] AI-based Language Teacher that can run locally on a 12GB graphics card (RTX 4070) 20 hours ago | www.reddit.com

application card fun graphics +7

[D] Embeddings search "drowning" in a sea of noise! Can you solve this riddle? 21 hours ago | www.reddit.com

application concept dimensions embeddings +15

Any ways to improve TabNet..??? [D] 1 day, 3 hours ago | www.reddit.com

machinelearning

[R] Machine learning from 3D meshes and physical fields 1 day, 3 hours ago | www.reddit.com

machinelearning

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India

View on ai-jobs.net

Staff Data Engineer (Data Platform)

@ Coupang | Seoul, South Korea

View on ai-jobs.net

AI/ML Engineering Research Internship

@ Keysight Technologies | Santa Rosa, CA, United States

View on ai-jobs.net

Sr. Director, Head of Data Management and Reporting Execution

@ Biogen | Cambridge, MA, United States

View on ai-jobs.net

Manager, Marketing - Audience Intelligence (Senior Data Analyst)

@ Delivery Hero | Singapore, Singapore

View on ai-jobs.net