[D] Tabular Data: DL vs GBDTs on large scale datasets | allainews.com

Sept. 6, 2023, 9:15 p.m. | /u/_puhsu

Machine Learning www.reddit.com

I've been hearing lately that NNs are better than GBDTs when scaled up alot:

* Uber [https://www.uber.com/en-CA/blog/deepeta-how-uber-predicts-arrival-times/](https://www.uber.com/en-CA/blog/deepeta-how-uber-predicts-arrival-times/)
* Stripe [https://stripe.com/blog/how-we-built-it-stripe-radar](https://stripe.com/blog/how-we-built-it-stripe-radar)
* Most CTR papers coming from google are also NN based (like [https://arxiv.org/abs/2209.05310](https://arxiv.org/abs/2209.05310))
* Meta mentions NNs in their recommender system (also kind of a large scale tabular problem there) [https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagram-explore-recommendations-system](https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagram-explore-recommendations-system)
* Lyft forecasting [https://medium.com/this-week-in-machine-learning-ai/causal-models-in-practice-at-lyft-with-sean-taylor-1e62efd62385](https://medium.com/this-week-in-machine-learning-ai/causal-models-in-practice-at-lyft-with-sean-taylor-1e62efd62385)

What's your intuition on DL vs GBDT on (very)large-scale tabular datasets? Have you heard of other such examples (or the reverse)?

Are there any particularly …

data datasets examples hearing intuition machinelearning nns scale tabular tabular data test

More from www.reddit.com / Machine Learning

[D] Why do juniors (undergraduates or first- to second-year PhD students) have so many papers … 5 hours ago | www.reddit.com

academic conferences etc hello +12

[D] How can I detect the text orientation using MMOCR or MMDET models? 9 hours ago | www.reddit.com

example image images issue +5

[D] Current state of Chatbot pipelines in Commercial settings? 13 hours ago | www.reddit.com

build chatbot commercial current +12

[R] Training-free Graph Neural Networks and the Power of Labels as Features 17 hours ago | www.reddit.com

features free graph graph neural networks +6

[D] Modern best coding practices for Pytorch (for research)? 20 hours ago | www.reddit.com

coding config example good +14

[R] Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic … 22 hours ago | www.reddit.com

breaking data machinelearning model collapse +3

[P] I reproduced Anthropic's recent interpretability research 23 hours ago | www.reddit.com

anthropic attention basic capabilities +8

[R] KAN: Kolmogorov-Arnold Networks 1 day ago | www.reddit.com

abstract every function functions +11

[D] Looking for a recent study/paper/article that showed that an alternate model with a similar … 1 day ago | www.reddit.com

article conversation machinelearning nothing +4

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

View on ai-jobs.net

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South

View on ai-jobs.net