Hi All, Does anyone know what algorithm for clustering categorical variables? R packages? Which is the best?
If a data has both numeric and categorical data, what is the best clustering algorithm to use and R package? I tried numeric transformation of all categorical fields and doing clustering afterwards. But the transformed fields have values from 1...10, and my other fields is in a bigger scale: 10000-...This will make the categorical fields has less effect on the distance calculation... Thank you! Yan [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.