June 8, 2022, 5:33 p.m. | /u/AbdulWahabAbrar

Machine Learning www.reddit.com

I have a dataset of a website pulled out from Google Analytics that contains user’s subscription and demographic details for three months.

One customer could have multiple records in the dataset, a single customer could visit the same page or different pages in a month so it can not be removed.

If I run K means on this dataset, will the output/predictions be inaccurate and can have bias?

Or is there any approach or clustering method that I could use? …

clustering deal duplicate machinelearning

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Scientist

@ ITE Management | New York City, United States