Clustering with categorical data

Hello. I work with datasets that have categorical data like gender, state, industry, etc. I normally use k-means for clustering, but k-means can only handle numerical data.

Any suggestions for how I can cluster for categorical data?

Thanks!

1 Like

This general non-health-related question is best asked at stats.stackexchange.com