Dec. 1, 2023, 4:51 p.m. | /u/ebursztein

Machine Learning www.reddit.com

Happy Friday,

Really happy to share that the code and model for RETVec our new SOTA robust text tokenizer for classification is available on Github [here](https://github.com/google-research/retvec/) and the NeurIPS paper [here](https://arxiv.org/abs/2302.09207). We also provide native support for TFLite and for the web via a TFJS. Hope you will find it useful for your research. If you would like to give it a try we have a get started [notebook](https://github.com/google-research/retvec/blob/main/notebooks/train_retvec_model_tf.ipynb).

Let us know if you have any questions.

machinelearning questions resilient text

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Senior Data Scientist

@ Highmark Health | PA, Working at Home - Pennsylvania

Principal Data Scientist

@ Warner Bros. Discovery | CA San Francisco 153 Kearny Street