Re: [R] different randomForest performance for same data

2009-12-15 Thread Liaw, Andy
You need to be _extremely_ careful when assigning levels of factors. Look at this example: R> x1 = factor(c("a", "b", "c")) R> x2 = factor(c("a", "c", "c")) R> x3 = x2 R> levels(x3) <- levels(x1) R> x3 [1] a b b Levels: a b c I'll try to add more proofing in the code... Andy > -Origi

Re: [R] different randomForest performance for same data

2009-12-13 Thread Uwe Ligges
Häring wrote: Hello, I came across a problem when building a randomForest model. Maybe someone can help me. I have a training- and a testdataset with a discrete response and ten predictors (numeric and factor variables). The two datasets are similar in terms of number of predictor, name of