all AI news
[Discussion] How to deal with duplicate IDs in K means clustering
June 8, 2022, 5:33 p.m. | /u/AbdulWahabAbrar
Machine Learning www.reddit.com
One customer could have multiple records in the dataset, a single customer could visit the same page or different pages in a month so it can not be removed.
If I run K means on this dataset, will the output/predictions be inaccurate and can have bias?
Or is there any approach or clustering method that I could use? …
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Data Scientist
@ ITE Management | New York City, United States