Hey everybody!
I have to use R's tm package to do some text analysis, first thing would be to
create a term frequency matrix.
Digging in tm's source code it seems like it uses some logic like this to
create term frequencies:
data("crude")
(txt <- Content(crude[[1]]))
(tokTxt <- unlist(strsplit
I have a csv file that has approximately 50k rows. In the first value of
each row, a file name is listed, and there are 162 different file names. At
the end of each row, there is a number value. What I would like to be able
to do is for the 162 different files (or we could call them categories)
2 matches
Mail list logo