[R] Analyzing texts with tm

2011-01-19 Thread Michael Weller
Hey everybody! I have to use R's tm package to do some text analysis, first thing would be to create a term frequency matrix. Digging in tm's source code it seems like it uses some logic like this to create term frequencies: data("crude") (txt <- Content(crude[[1]])) (tokTxt <- unlist(strsplit

[R] Multiple ecdf plots?

2010-06-17 Thread weller
I have a csv file that has approximately 50k rows. In the first value of each row, a file name is listed, and there are 162 different file names. At the end of each row, there is a number value. What I would like to be able to do is for the 162 different files (or we could call them categories)