[R] parallel clustering, amap, hcluster

Ziqi Zhang Sat, 08 Aug 2015 04:18:07 -0700

Hi

I am looking for parallel implementation of hierarchical clustering, theequivalent to "hclust" in the "fpc" package.


I found "hcluster" from "amap" package:

hcluster(x, method = "euclidean", diag = FALSE, upper = FALSE,
         link = "complete", members = NULL, nbproc = 2,
         doubleprecision = TRUE)

It takes a data matrix, computes distance matrix then do clustering.

However in my application, /i have to compute the distance matrix anduse it later anyway. So hcluster is re-computing the distance which is awaste of time, as my data is very large scale.

Is there anyway hcluster could just use a pre-computed distance object,or obtain the distance object from hcluster, so I can avoiddouble-computing the distane object?

Or more general question is, if there is a parallel implementation ofhierarchical clustering that takes input a distance matrix, rather thanthe raw data matrix?

Many thanks!

---
This email has been checked for viruses by Avast antivirus software.

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] parallel clustering, amap, hcluster

Reply via email to