Greetings R Users! I am posting to inquire about the proximity matrix in the randomForest R-package. I am having difficulty pushing very large data through the algorithm and it appears to hang on the building of the prox matrix. I have read on Dr. Breiman's website that in the original code a choice can be made between using an N x N matrix OR to increase the ability to compute large datasets an N x T matrix can be created. The N refers to the number of samples and the T refers to the number of trees in the forest. It is a sentence in the FORTRAN documentation and nothing else is stated about it... My question is, does the randomForest module in R allow for this choice in proximity matrices generated by the algorithm? If so, can someone please point me in the direction of how to implement it? That would be great!
Many thanks in advance and best wishes from Alaska! Michael -- Michael Lindgren GIS Technician / Programmer EWHALE Lab - Institute of Arctic Biology University of Alaska 419 IRVING I Fairbanks, AK 99775-7000 Email: malindg...@alaska.edu Phone: 907 474 7959 [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.