Hello all, I've been comparing results from kmeans() in R to PROC FASTCLUS in SAS and I'm getting drastically different results with a real life data set. Even with a simulated data set starting with the same seeds with very well seperated clusters the resulting cluster means are still different. I was hoping to look at the source code of kmeans(), but it's in C and FORTRAN and I'm not quite sure how to get at it. Has anybody looked into the differences in the implementations or have any thoughts on the matter? Below is the code I'm using in each case.
fit=kmeans(obs[,-1],centers,nstart=25) * proc* *fastclus* data=std maxclusters=*2* maxiter=*100* outiter drift converge=*0.01* outseed=centers out=cluster; var x y z; * run*; Thanks, Andy [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.