[R] interpretation of MDS plot in random forest

Massimo Bressan Mon, 02 Dec 2013 03:35:32 -0800

Given this general example:

set.seed(1)


data(iris)

iris.rf <- randomForest(Species ~ ., iris, proximity=TRUE, keep.forest=TRUE)

#varImpPlot(iris.rf)

#varUsed(iris.rf)

MDSplot(iris.rf, iris$Species)

I’ve been reading the documentation about random forest (at best of my -poor - knowledge) but I’m in trouble with the correct interpretation ofthe MDS plot and I hope someone can give me some clues


What is intended for “the scaling coordinates of the proximity matrix”?

I think to understand that the objective is here to present the distanceamong species in a parsimonious and visual way (of lower dimensionality)

Is therefore a parallelism to what are intended the principal componentsin a classical PCA?

Are the scaling coordinates DIM 1 and DIM2 the eigenvectors of theproximity matrix?

If that is correct, how would you find the eigenvalues for thateigenvectors? And what are the eigenvalues repreenting?

What are saying these two dimensions in the plot about the differentiris species? Their relative distance in terms of proximity within thespace DIM1 and DIM2?

How to choose for the k parameter (number of dimensions for the scalingcoordinates)?


And finally how would you explain the plot in simple terms?

Thank you for any feedback
Best regards

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] interpretation of MDS plot in random forest

Reply via email to