Sept. 6, 2023, 9:15 p.m. | /u/_puhsu

Machine Learning www.reddit.com

I've been hearing lately that NNs are better than GBDTs when scaled up alot:

* Uber [https://www.uber.com/en-CA/blog/deepeta-how-uber-predicts-arrival-times/](https://www.uber.com/en-CA/blog/deepeta-how-uber-predicts-arrival-times/)
* Stripe [https://stripe.com/blog/how-we-built-it-stripe-radar](https://stripe.com/blog/how-we-built-it-stripe-radar)
* Most CTR papers coming from google are also NN based (like [https://arxiv.org/abs/2209.05310](https://arxiv.org/abs/2209.05310))
* Meta mentions NNs in their recommender system (also kind of a large scale tabular problem there) [https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagram-explore-recommendations-system](https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagram-explore-recommendations-system)
* Lyft forecasting [https://medium.com/this-week-in-machine-learning-ai/causal-models-in-practice-at-lyft-with-sean-taylor-1e62efd62385](https://medium.com/this-week-in-machine-learning-ai/causal-models-in-practice-at-lyft-with-sean-taylor-1e62efd62385)

What's your intuition on DL vs GBDT on (very)large-scale tabular datasets? Have you heard of other such examples (or the reverse)?

Are there any particularly …

data datasets examples hearing intuition machinelearning nns scale tabular tabular data test

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South