Hi, while working with decision trees and unbalanced data, I came across the
use of the Hellinger distance as an alternative to information gain [1,2],
when dealing with skewed data. Does anybody know of R implementations of
this approach to decision trees?

Thanks,

[1] http://www.cse.nd.edu/Reports/2008/TR-2008-06.pdf
[2] http://csmr.ca.sandia.gov/~wpk/slides/wdmda-sem.pdf
-- 
Rajarshi Guha
NIH Chemical Genomics Center

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to