Dec. 1, 2023, 4:51 p.m. | /u/ebursztein

Machine Learning www.reddit.com

Happy Friday,

Really happy to share that the code and model for RETVec our new SOTA robust text tokenizer for classification is available on Github [here](https://github.com/google-research/retvec/) and the NeurIPS paper [here](https://arxiv.org/abs/2302.09207). We also provide native support for TFLite and for the web via a TFJS. Hope you will find it useful for your research. If you would like to give it a try we have a get started [notebook](https://github.com/google-research/retvec/blob/main/notebooks/train_retvec_model_tf.ipynb).

Let us know if you have any questions.

machinelearning questions resilient text

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Data Engineer - Takealot Group (Takealot.com | Superbalist.com | Mr D Food)

@ takealot.com | Cape Town