Hi All,

What formula can I use to determine the right sample size for clustering 
analysis with 100-300 variables?

What sampling methodology can be used for k-means or hierarchical clustering on 
categorical fields so that all values of the categorical fields are included in 
the sample?

Thanks a lot!

Regards,
Yan

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to