April 4, 2024, 6:42 p.m. | /u/o-rka

Data Science www.reddit.com

For instance say I have 1000 features that I cluster with algorithm A. I obtain another 500 features, I would like to use the existing cluster information without reclustering everything from the start.

Is there a clustering algorithm (ideally in sklearn and not k-means) that can handle this type of usage?

In one use case, the distance metric I plan on using will be jaccard since my data will be binary.

algorithm cluster clustering clustering algorithm datascience everything features information instance k-means sklearn update

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US