Re: [R] aggregate function output

2013-11-01 Thread arun
You could also try: library(plyr)  newdf <- function(.data, ...) {    eval(substitute(data.frame(...)), .data, parent.frame())  } x1 <- ddply(mtcars,.(cyl,gear), newdf, mgp=t(quantile(mpg)),hp=t(quantile(hp))) #(found in one of the google group discussions) #or library(data.table) dt1 <- data.

Re: [R] aggregate function output

2013-11-01 Thread arun
Hi, Try: do.call(data.frame,c(x,check.names=FALSE)) A.K. Hello,   I´m using function aggregate in R 3.0.2.  If I run the instruction x<-aggregate(cbind(mpg,hp)~cyl+gear,data=mtcars,quantile) I get the result the following data.frame:   cyl gear mpg.0% mpg.25% mpg.50% mpg.75%

Re: [R] aggregate function output

2013-11-01 Thread Adams, Jean
Daniel, You can see better what is going on if you look at as.list(x) There you can see that cyl and gear are vectors but mpg and hp are matrices. You can rearrange them using the do.call() function x2 <- do.call(cbind, x) dim(x2) Jean On Fri, Nov 1, 2013 at 7:08 AM, Daniel Fernandes wrote:

[R] aggregate function output

2013-11-01 Thread Daniel Fernandes
Hello, I´m using function aggregate in R 3.0.2. If I run the instruction x<-aggregate(cbind(mpg,hp)~cyl+gear,data=mtcars,quantile) I get the result the following data.frame: cyl gear mpg.0% mpg.25% mpg.50% mpg.75% mpg.100% hp.0% hp.25% hp.50% hp.75% hp.100% 4 3 21.5 21.5

Re: [R] Aggregate function Bagging

2013-04-14 Thread Rui Barradas
Hello, If you run the example in ?bag you can type data(BloodBrain) ctreeBag$aggregate at an R prompt to see an example aggregate function. Note that it dos _not_ have the parenthesis. Hope this helps, Rui Barradas Em 14-04-2013 11:31, Nicolás Sánchez escreveu: Good morning all. I am doi

[R] Aggregate function Bagging

2013-04-14 Thread Nicolás Sánchez
Good morning all. I am doing bagging with package caret. I need bagging for a classification problem. I am working with " bag". bag(x, y, B = 10, vars = NULL, bagControl = bagControl(), ...) bagControl(fit = NULL, predict = NULL, aggregate = NULL, downSample = FA

Re: [R] aggregate function not working?

2012-10-17 Thread arun
10 A.K. - Original Message - From: jcrosbie To: r-help@r-project.org Cc: Sent: Wednesday, October 17, 2012 4:45 PM Subject: [R] aggregate function not working? The aggregate function for some reason will now work for me. The error I'm getting is: "Error in sort.list(y) : 'x

Re: [R] aggregate function not working?

2012-10-17 Thread David Winsemius
On Oct 17, 2012, at 1:45 PM, jcrosbie wrote: The aggregate function for some reason will now work for me. The error I'm getting is: "Error in sort.list(y) : 'x' must be atomic for 'sort.list' Have you called 'sort' on a list?" You have managed to create a slightly pathological dataframe: >

Re: [R] aggregate function not working?

2012-10-17 Thread Sarah Goslee
If you read the error message carefully and look at the data you included with dput() (for which I thank you!), you'll see that bandNum is a list, not a vector, just as the error message told you. I'm not sure how you created or imported your data frame, but something appears to have not worked th

[R] aggregate function not working?

2012-10-17 Thread jcrosbie
The aggregate function for some reason will now work for me. The error I'm getting is: "Error in sort.list(y) : 'x' must be atomic for 'sort.list' Have you called 'sort' on a list?" agPriceList=aggregate(PriceList$Size, list(PriceList$bandNum),sum) *Price list dataframe:* dput(PriceList) stru

Re: [R] Aggregate function for comparison stats

2012-04-26 Thread David Winsemius
On Apr 26, 2012, at 9:29 AM, Neil Davis wrote: Hi, I have a data.frame which contains timeseries from several different locations, which I want to compare against each other for example calculating RMSE, or normalized mean bias of each location against the others. An example of this is t

[R] Aggregate function for comparison stats

2012-04-26 Thread Neil Davis
Hi, I have a data.frame which contains timeseries from several different locations, which I want to compare against each other for example calculating RMSE, or normalized mean bias of each location against the others. An example of this is the cor function where I can put in a data.frame and

Re: [R] aggregate function

2011-12-21 Thread peter dalgaard
On Dec 21, 2011, at 18:22 , Mary Kindall wrote: > Hi Jim > > Thanks for reply but this is not working. I think I am missing something > over here. Yes, the data.table() bit. It's not going to work with data frames. > > 1> x <- cbind(c(1,2,2,2,3,4), c('a','b', 'c','d','e','f')) > 1> colnames(x

Re: [R] aggregate function

2011-12-21 Thread Mary Kindall
Hi Jim Thanks for reply but this is not working. I think I am missing something over here. 1> x <- cbind(c(1,2,2,2,3,4), c('a','b', 'c','d','e','f')) 1> colnames(x) = c('param', 'case1') 1> x = as.data.frame(x) 1> x param case1 1 1 a 2 2 b 3 2 c 4 2 d 5 3

Re: [R] aggregate function

2011-12-21 Thread jim holtman
You were using the wrong syntax; it should be: x[ , list(case1 = paste(case1, collapse = ',')) , by = param ] Notice that you do not use the "x$" on the names within the data.table statement. On Wed, Dec 21, 2011 at 12:22 PM, Mary Kindall wrote: > Hi Jim > > Thanks for reply but t

Re: [R] aggregate function

2011-12-21 Thread David Winsemius
On Dec 21, 2011, at 11:31 AM, jim holtman wrote: Here is an example using 'data.table'" x <- read.table(text = "param case1 + 1 a + 2 b + 2 c + 2 d + 3 e + 4 f", header = TRUE, as.is = TRUE) And the a

Re: [R] aggregate function

2011-12-21 Thread jim holtman
Here is an example using 'data.table'" > x <- read.table(text = "param case1 + 1 a + 2 b + 2 c + 2 d + 3 e + 4 f", header = TRUE, as.is = TRUE) > require(data.table) > x <- data.table(x) > x[ + , list( ca

[R] aggregate function

2011-12-21 Thread Mary Kindall
Hi I have a data frame with values in following format. param case1 1 a 2 b 2 c 2 d 3 e 4 f how to use aggregate so that it I only one row for each 'param' value. the output for the above input should be

Re: [R] aggregate function with a dataframe for both "x" and "by"

2011-10-05 Thread David Winsemius
On Oct 5, 2011, at 7:45 PM, Eva Powers wrote: I have 2 dataframes. "mydata" contains numerical data. "mybys" contains information on the "group" each row of the data is in. I wish to aggregate each column in mydata using the corresponding column in mybys. corresponding? Please see th

Re: [R] aggregate function with a dataframe for both "x" and "by"

2011-10-05 Thread Dennis Murphy
Hi: It's a little tricky to read in a data frame 'by hand' without making NA a default missing value; you've got to trick it a bit. I'm doing this inefficiently, but if you have the two 'real' data sets stored in separate files, read.table() is the way to go since it provides an option for definin

[R] aggregate function with a dataframe for both "x" and "by"

2011-10-05 Thread Eva Powers
I have 2 dataframes. "mydata" contains numerical data. "mybys" contains information on the "group" each row of the data is in. I wish to aggregate each column in mydata using the corresponding column in mybys. Please see the example below. What is a more elegant or "better" way to accomplish

Re: [R] aggregate function - na.action

2011-02-07 Thread Matthew Dowle
Hadley, That's fine; please do. I'm happy to explain it offline where the documentation or comments in the code aren't sufficient. It's GPL code so you can take it and improve it, or depend on it. Whatever works for you. As long as (of course) you don't stand on it's shoulders and then restric

Re: [R] aggregate function - na.action/ performance issues re structs and algorithms

2011-02-07 Thread Mike Marchywka
> From: had...@rice.edu > Date: Mon, 7 Feb 2011 11:00:59 -0600 > To: mdo...@mdowle.plus.com > CC: r-h...@stat.math.ethz.ch > Subject: Re: [R] aggregate function - na.action > > > Does FAQ 1.8 answer that ok ? > >

Re: [R] aggregate function - na.action

2011-02-07 Thread Hadley Wickham
> Does FAQ 1.8 answer that ok ? >   "Ok, I'm starting to see what data.table is about, but why didn't you > enhance data.frame in R? Why does it have to be a new package?" >   http://datatable.r-forge.r-project.org/datatable-faq.pdf Kind of. I think there are two sets of features data.table provi

Re: [R] aggregate function - na.action

2011-02-07 Thread Matthew Dowle
Hi Hadley, Does FAQ 1.8 answer that ok ? "Ok, I'm starting to see what data.table is about, but why didn't you enhance data.frame in R? Why does it have to be a new package?" http://datatable.r-forge.r-project.org/datatable-faq.pdf Matthew "Hadley Wickham" wrote in message news:AANLkT

Re: [R] aggregate function - na.action

2011-02-07 Thread Hadley Wickham
On Mon, Feb 7, 2011 at 5:54 AM, Matthew Dowle wrote: > Looking at the timings by each stage may help : > >>   system.time(dt <- data.table(dat)) >   user  system elapsed >   1.20    0.28    1.48 >>   system.time(setkey(dt, x1, x2, x3, x4, x5, x6, x7, x8))   # sort by the >> 8 columns (one-off) >  

Re: [R] aggregate function - na.action

2011-02-07 Thread Matthew Dowle
Looking at the timings by each stage may help : > system.time(dt <- data.table(dat)) user system elapsed 1.200.281.48 > system.time(setkey(dt, x1, x2, x3, x4, x5, x6, x7, x8)) # sort by the > 8 columns (one-off) user system elapsed 4.720.945.67 > system.time(

Re: [R] aggregate function - na.action

2011-02-06 Thread David Winsemius
On Feb 6, 2011, at 7:41 PM, Hadley Wickham wrote: There's definitely something amiss with aggregate() here since similar functions from other packages can reproduce your 'control' sum. I expect ddply() will have some timing issues because of all the subgrouping in your data frame, but data

Re: [R] aggregate function - na.action

2011-02-06 Thread Hadley Wickham
> There's definitely something amiss with aggregate() here since similar > functions from other packages can reproduce your 'control' sum. I expect > ddply() will have some timing issues because of all the subgrouping in your > data frame, but data.table did very well and the summaryBy() function i

Re: [R] aggregate function - na.action

2011-02-06 Thread Dennis Murphy
Hi: There's definitely something amiss with aggregate() here since similar functions from other packages can reproduce your 'control' sum. I expect ddply() will have some timing issues because of all the subgrouping in your data frame, but data.table did very well and the summaryBy() function in t

Re: [R] aggregate function - na.action

2011-02-06 Thread jim holtman
Try 'data.table' package. It took 3 seconds to aggregate the 500K levels: Is this what you were after? > # note the characters are converted to factors that 'data.table' likes > dat=data.frame( +x1=sample(c(NA,'m','f'), 2e6, replace=TRUE), +x2=sample(c(NA, 1:10), 2e6, replace=TRU

Re: [R] aggregate function - na.action

2011-02-06 Thread Gene Leynes
By the way, thanks for sending that formula, it's quite thoughtful of you to send an answer with an actual working line of code! When I experimented with ddply earlier last week I couldn't figure out the syntax for a single line aggregation, so it's good to have this example. I will likely use it

Re: [R] aggregate function - na.action

2011-02-06 Thread Denis Kazakiewicz
Try to use formula notation and use na.action=na.pass It is all described in the help(aggregate) У Няд, 06/02/2011 у 14:54 -0600, Gene Leynes піша: > On Fri, Feb 4, 2011 at 6:54 PM, Ista Zahn wrote: > > > > > > > However, I don't think you've told us what you're actually trying to > > > accompl

Re: [R] aggregate function - na.action

2011-02-06 Thread Gene Leynes
On Fri, Feb 4, 2011 at 6:54 PM, Ista Zahn wrote: > > > > However, I don't think you've told us what you're actually trying to > > accomplish... > > > I'm trying to aggregate the y value of a big data set which has several x's and a y. I'm using an abstracted example for many reasons. Partially,

Re: [R] aggregate function - na.action

2011-02-04 Thread Gene Leynes
Ista, Thank you again. I had figured that out... and was crafting another message when you replied. The NAs do come though on the variable that is being aggregated, However, they do not come through on the categorical variable(s). The aggregate function must be converting the data frame variabl

Re: [R] aggregate function - na.action

2011-02-04 Thread Gene Leynes
Just to be clear: This works: > set.seed(100) > dat=data.frame( + x1=sample(c(NA,'m','f'), 100, replace=TRUE), + x2=sample(c(NA, 1:10), 100, replace=TRUE), + x3=sample(c(NA,letters[1:5]), 100, replace=TRUE), + x4=sample(c(NA,T,F), 100, replace=TRUE), + y=sam

Re: [R] aggregate function - na.action

2011-02-04 Thread Ista Zahn
oops. For clarity, that should have been sum(ddply(dat, .(x1,x2,x3,x4), function(x){data.frame(y.sum=sum(x$y, na.rm=TRUE))})$y.sum) -Ista On Fri, Feb 4, 2011 at 7:52 PM, Ista Zahn wrote: > Hi again, > > On Fri, Feb 4, 2011 at 7:18 PM, Gene Leynes wrote: >> Ista, >> >> Thank you again. >> >> I

Re: [R] aggregate function - na.action

2011-02-04 Thread Ista Zahn
Hi again, On Fri, Feb 4, 2011 at 7:18 PM, Gene Leynes wrote: > Ista, > > Thank you again. > > I had figured that out... and was crafting another message when you replied. > > The NAs do come though on the variable that is being aggregated, > However, they do not come through on the categorical va

Re: [R] aggregate function - na.action

2011-02-04 Thread Ista Zahn
Hi, On Fri, Feb 4, 2011 at 6:33 PM, Gene Leynes wrote: > Thank you both for the thoughtful (and funny) replies. > > I agree with both of you that sum is the one picking up aggregate.  Although > I didn't mention it, I did realize that in the first place. > Also, thank you Phil for pointing out th

Re: [R] aggregate function - na.action

2011-02-04 Thread Gene Leynes
Thank you both for the thoughtful (and funny) replies. I agree with both of you that sum is the one picking up aggregate. Although I didn't mention it, I did realize that in the first place. Also, thank you Phil for pointing out that aggregate only accepts a formula value in more recent versions!

Re: [R] aggregate function - na.action

2011-02-04 Thread Ista Zahn
Sorry, I didn't see Phil's reply, which is better than mine anyway. -Ista On Fri, Feb 4, 2011 at 5:16 PM, Ista Zahn wrote: > Hi, > > Please see ?na.action > > (just kidding!) > > So it seems to me the problem is that you are passing na.rm to the sum > function. So there is no missing data for th

Re: [R] aggregate function - na.action

2011-02-04 Thread Ista Zahn
Hi, Please see ?na.action (just kidding!) So it seems to me the problem is that you are passing na.rm to the sum function. So there is no missing data for the na.action argument to operate on! Compare sum(aggregate(y~x1+x2+x3+x4, data=dat, sum, na.action=na.fail)$y) sum(aggregate(y~x1+x2+x3+x4

Re: [R] aggregate function - na.action

2011-02-04 Thread Phil Spector
Gene - Let me try to address your concerns one at a time: Since the formula interface to aggregate was introduced pretty recently (I think R-2.11.1, but I might be wrong) so when you try to use it in an R-2.10.1 it won't work. Now let's take a close look at the help page for aggregate. The

[R] aggregate function - na.action

2011-02-04 Thread Gene Leynes
Can someone please tell me what is up with na.action in aggregate? My (somewhat) reproducible example: (I say somewhat because some lines wouldn't run in a separate session, more below) set.seed(100) dat=data.frame( x1=sample(c(NA,'m','f'), 100, replace=TRUE), x2=sample(c(NA, 1:10

Re: [R] aggregate function / custom column names?

2010-02-11 Thread Gabor Grothendieck
Try this: > aggregate(list(Max = df$value), df['id'], max) id Max 1 11 2.610491 2 22 3.796836 3 33 6.562515 or if using value rather than Max is ok then just: > aggregate(df['value'], df['id'], max) idvalue 1 11 2.610491 2 22 3.796836 3 33 6.562515 On Thu, Feb 11, 2010 at 12:18 PM

[R] aggregate function / custom column names?

2010-02-11 Thread Chuck White
This question is about column names returned by the aggregate function. Consider the following example df <- data.frame( id = c(rep('11',30),rep('22',30),rep('33',30)), value = c(rnorm(30,2,0.5), rnorm(30,3,0.5), rnorm(30,6,0.5)) ) aggregate(df[,c("value"),drop=FALSE], by=list(id=df$id), m

Re: [R] Aggregate Function

2009-04-23 Thread Bronagh Grimes
Thanks a mil, will try that. -Original Message- From: Petr PIKAL [mailto:petr.pi...@precheza.cz] Sent: 23 April 2009 12:18 To: Bronagh Grimes Cc: r-help@r-project.org Subject: Odp: [R] Aggregate Function Try to set scipen in options. ?options e.g. options(scipen=12) Regards Petr r

[R] Aggregate Function

2009-04-23 Thread Bronagh Grimes
Hi all, I am currently carrying out the following aggregate function: D2 <- with(D1,aggregate(COST, list(FRUIT, VEG),FUN="sum")) The function is working fine but I am getting "sum" output in the following format: 1.750623e+09 How can I re-format the output to look like 1750622640.7? Many than

Re: [R] aggregate() function and na.rm = TRUE

2008-07-08 Thread Daniel Malter
uftrag von David Afshartous Gesendet: Tuesday, July 08, 2008 4:57 PM An: r-help@r-project.org Betreff: [R] aggregate() function and na.rm = TRUE All, I've been using aggregate() to compute means and standard deviations at time/treatment combinations for a longitudinal dataset, using na.rm = T

[R] aggregate() function and na.rm = TRUE

2008-07-08 Thread David Afshartous
All, I've been using aggregate() to compute means and standard deviations at time/treatment combinations for a longitudinal dataset, using na.rm = TRUE for missing data. This was working fine before, but now when I re-run some old code it isn't. I've backtracked my steps and can't seem to find

Re: [R] aggregate() function, strange behavior for augmented data

2008-06-16 Thread David Afshartous
Everything was read in the same way, and str(junk1) confirms that they are the same structure. This is very strange. ## original data: > str(junk1) 'data.frame': 96 obs. of 3 variables: $ Hour: int 0 3 5 0 3 5 0 3 5 0 ... $ Drug: Factor w/ 2 levels "D","P": 2 2 2 1 1 1 2 2 2 1 ... $ Aldo:

[R] aggregate() function, strange behavior for augmented data

2008-06-16 Thread David Afshartous
All, I'm re-running some analysis that has been augmented with additional data. When I use the exact same code for the augmented data, the behavior of the aggregate function is very strange, viz., one of the resulting variables is now coded as a factor while it was coded as numeric for the origi

[R] aggregate function oddity

2007-09-17 Thread Mihalicza Péter
Dear All, I tried to aggregate the rows according to some factors in a data frame. I got the "Error in Summary.factor(..., na.rm = na.rm) : sum not meaningful for factors" message. This problem was once already discussed in 2003 on this list, where the following solution was given: inclu

[R] aggregate function oddity

2007-09-17 Thread Mihalicza Péter
__ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.