Dear All,
Here is a small example:
library(skmeans)
library(tm)
data("crude")
#Examine the first document
inspect(crude[[1]])
dtm <- DocumentTermMatrix(crude, control =
list(removePunctuation = TRUE,
removeNumbers = TRUE,
stopwords = TRUE))
clus <- skmeans(dtm,3)
names(clus)
Is there any way I can get the document number of the prototypes ? Also
can I get the 3 closest documents to each prototype ? By prototype I mean
the cluster centers.
I know can compare each row of the DocumentTermMatrix with the prototypes
to test for equality and I can manually compute the distance of each
Document from a prototypes,but I was wondering if such a tool already
exists.
Best Regards,
Ashim
[[alternative HTML version deleted]]
______________________________________________
[email protected] mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.