How do you deal with predicting purchases where the purchases are extremely imbalanced and the data is extremely sparse. | allainews.com

April 12, 2024, 12:47 a.m. | /u/Terrible-Hamster-342

Data Science www.reddit.com

Dataset has 300 million rows. Only 1 million have purchases. So the dataset is extremely sparse.

I’m taking the one million purchases and taking a random sample of one million non purchases and training my model on that.

Is this approach feasible? Are there any other approaches people would recommend. Any papers on this?

Trying to predict conversions on an ads platform.

data datascience dataset deal random sample training

More from www.reddit.com / Data Science

A lot of post here discuss switching careers INTO data science. But what about the … 4 hours ago | www.reddit.com

careers data data science datascience +5

Is it true most ML/AI projects fail? Why is this? 4 hours ago | www.reddit.com

ai projects datascience ml projects multiple +2

Starting my data science career, how should I plan my summer? 5 hours ago | www.reddit.com

analyst career data data analyst +8

Am I really a Data Analyst? 11 hours ago | www.reddit.com

analyst data data analyst datascience +6

How good is Capital One for a first job out of grad school? 14 hours ago | www.reddit.com

brand capital capital one context +11

AI startup debuts “hallucination-free” and causal AI for enterprise data analysis and decision support 15 hours ago | www.reddit.com

ai system ai technologies artificial artificial intelligence +16

Reccomendations for blogs to follow 1 day, 2 hours ago | www.reddit.com

big blogs concepts datascience +7

Networking easier to get a job? 1 day, 8 hours ago | www.reddit.com

conversation datascience every hiring +10

How many companies out there are truly experimentation focused like Netflix? 1 day, 13 hours ago | www.reddit.com

articles check datascience every +9

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca

View on ai-jobs.net