Hi All, What formula can I use to determine the right sample size for clustering analysis with 100-300 variables?
What sampling methodology can be used for k-means or hierarchical clustering on categorical fields so that all values of the categorical fields are included in the sample? Thanks a lot! Regards, Yan [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.