While trying to train randomForest with my dataset, I am ending up with the
following error
Error in randomForest.default(datatrain, classtrain) :
  length of response must be the same as predictors

My data looks like:
A,B,C,D,Class
1,2,1,2,cl1
1,2,1,2,cl1
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,1,2,cl2
3,2,12,3,cl2
3,2,1,2,cl2
Actual dataset has around 4000 features and two classes. And number of
instances is also around 4000.

The steps followed are:
trainfile <- read.csv("TrainFile",head=TRUE)
datatrain <- subset(trainfile,select=c(-Class))
classtrain <- (subset(trainfile,select=Class))
rf <- randomForest(datatrain, classtrain)
Error in randomForest.default(datatrain, classtrain) :
  length of response must be the same as predictors
In addition: Warning message:
In randomForest.default(datatrain, classtrain) :
  The response has five or fewer unique values.  Are you sure you want to do
regression?

Where I am going wrong?

If I follow the example provided in documentation (Classification and
Regression with Random Forest)
rf <- randomForest(classtrain, data=datatrain)
I dont get randomForest of type: classification
I get:

Call:
 randomForest(x = classtrain, data = datatrain)
               Type of random forest: unsupervised
                     Number of trees: 500
No. of variables tried at each split: 1

Any suggestion would be appreciated.

Thanks

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to