Hi, while working with decision trees and unbalanced data, I came across the use of the Hellinger distance as an alternative to information gain [1,2], when dealing with skewed data. Does anybody know of R implementations of this approach to decision trees?
Thanks, [1] http://www.cse.nd.edu/Reports/2008/TR-2008-06.pdf [2] http://csmr.ca.sandia.gov/~wpk/slides/wdmda-sem.pdf -- Rajarshi Guha NIH Chemical Genomics Center [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.