On 03/26/2010 08:41 PM, Christoffer Karlsson wrote:
Hi,
I have a column in a data frame looking something like:
$sex $language $count
male english 0
male english 0
female english 32
male spanish 154
female english 11
female norweigan 7
and so on.
What I want to do is to order these in to categories, for instance one
category where count>=0& count<10 and so on..
I want my data to turn out looking something like:
male english 0-10 1324
male english 11-20 756
.....
male spanish 0-10 354
...
female english 0-10 1557
...
and so on, where the right hand is the count of the number of people in each
category.
Up until now I've been subsetting the data frame into each category, and
then counting number of rows in each subset. However I now have a large
amount of different factor combinations which makes this process tedious.
Any help would be appreciated!
Hi Chris,
As luck would have it, I have been working on a very similar problem,
that of graphically representing multi-level summaries. What you could
do is to create a new factor variable with the "cut" function (say,
"countcut"), then call the "by" function like this:
by(mydf$sex,list(mydf$language,mydf$countcut),sum)
You will not get the format you have specified, but you will get the
numbers that can be reformatted.
Jim
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.