[R] kmeans() compared to PROC FASTCLUS

Andrew Agrimson Thu, 02 Dec 2010 15:50:35 -0800

Hello all,

I've been comparing results from kmeans() in R to PROC FASTCLUS in SAS and
I'm getting drastically different results with a real life data set. Even
with a simulated data set starting with the same seeds with very well
seperated clusters the resulting cluster means are still different. I was
hoping to look at the source code of kmeans(), but it's in C and FORTRAN and
I'm not quite sure how to get at it. Has anybody looked into the differences
in the implementations or have any thoughts on the matter? Below is the code
I'm using in each case.



fit=kmeans(obs[,-1],centers,nstart=25)

*

proc* *fastclus* data=std maxclusters=*2* maxiter=*100* outiter drift

converge=*0.01* outseed=centers out=cluster;

var x y z;
*

run*;
Thanks,
Andy

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] kmeans() compared to PROC FASTCLUS

Reply via email to