[R] non-english text mining with tm package

2009-06-15 Thread eitan lavi
Hello, I'm having issues working with the tm package on non-english languages. are there any extensions that enable the package to work with Hebrew (and other non-roman letter languages for that matter) ? (e.g. although I can construct a Corpus that shows the hebrew documents alright, I cannot c

[R] non-english text mining with tm package

2009-06-15 Thread eitan lavi
Hello, I'm having issues working with the tm package on non-english languages. are there any extensions that enable the package to work with Hebrew (and other non-roman letter languages for that matter) ? (e.g. although I can construct a Corpus that shows the hebrew documents alright, I cannot c

[R] - help - predicting with glmnet/lars for dataframes with different nrow then the train set

2009-03-16 Thread eitan lavi
Hello I'm having trouble using lars and glmnet functions to predict on a new data set with different nrow then the original : for instance: = log.1 = glm(temp.data$TL~(.),temp.data,family = binomial,x=TRUE,y=TRUE) nrow(test.data) != nrow(temp.data # == TRUE Val.frame = m

[R] - help - LARS & LASSO based on glm

2009-03-16 Thread eitan lavi
Hello I'm trying to use LARS on a glm: log.1 = glm(Train.Data$TL~(.),Train.Data,family = binomial) LARS.1 = lars(model.matrix(log.1),Train.Data$TL,type = "lar",max.steps = ncol(model.matrix(log.1))-1) self.fit = predict.lars(LARS.1,model.matrix(log.1),type="fit",mode="step")$fit