How do you deal with predicting purchases where the purchases are extremely imbalanced and the data is extremely sparse. | allainews.com

April 12, 2024, 12:47 a.m. | /u/Terrible-Hamster-342

Data Science www.reddit.com

Dataset has 300 million rows. Only 1 million have purchases. So the dataset is extremely sparse.

I’m taking the one million purchases and taking a random sample of one million non purchases and training my model on that.

Is this approach feasible? Are there any other approaches people would recommend. Any papers on this?

Trying to predict conversions on an ads platform.

data datascience dataset deal random sample training

More from www.reddit.com / Data Science

Imposter Colleagues Taking My Work 5 hours ago | www.reddit.com

analysts analytics colleagues datascience +10

Rshiny is dog shit 8 hours ago | www.reddit.com

code datascience debug debugging +5

When the word is all about LLMs and GenAI and you are still using linear … 9 hours ago | www.reddit.com

algorithm basic current datascience +11

suggestions for a new DS team leader 12 hours ago | www.reddit.com

boss dashboards datascience leader +7

Are any companies good at onboarding data people? Are there any effective data analytics bosses/leaders? 13 hours ago | www.reddit.com

analytics boss bosses companies +11

What's the most important technical skill for an ML Engineer? 14 hours ago | www.reddit.com

datascience engineer ml engineer skill +1

What is Spark demand currently? 23 hours ago | www.reddit.com

databricks datascience demand language +7

Multivariate multi-output time series forecasting 1 day, 13 hours ago | www.reddit.com

car confidence datascience forecast +14

What field or scope are you working on and how often is there a "regime … 1 day, 15 hours ago | www.reddit.com

change datascience mean model retraining +3

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net