March 17, 2022, 7:06 a.m. | /u/Yura52

Machine Learning www.reddit.com

Hi! We introduce our new paper "On Embeddings for Numerical Features in Tabular Deep Learning".

Paper: [https://arxiv.org/abs/2203.05556](https://arxiv.org/abs/2203.05556)

Code: [https://github.com/Yura52/tabular-dl-num-embeddings](https://github.com/Yura52/tabular-dl-num-embeddings)

TL;DR: using embeddings for numerical features (i.e. using vector representations instead of scalar values) can lead to significant profit for tabular DL models.

Let's consider the vanilla MLP taking two numerical inputs.

https://preview.redd.it/yb55tdw27wn81.png?width=330&format=png&auto=webp&s=a6fc53e8611baee6993aab47480f0a6a6b85e46c

Now, here is the same MLP, but now with embeddings for numerical features:

https://preview.redd.it/zebl8tld7wn81.png?width=368&format=png&auto=webp&s=3d20652075d0543c7d6c70f34d67140bc2c6346b

The main contributions:

* we show that using vector representations instead of scalar representations …

deep learning dl features learning machinelearning numerical paper tabular

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US