Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Matthew Dowle
Antonio Piccolboni piccolboni.info> writes: > Hi, > I was wondering if there is anything more efficient than split to do the > kind of conversion in the subject. If I create a data frame as in > > system.time({fd = data.frame(x=1:2000, y = rnorm(2000), id = paste("x", > 1:2000, sep =""))}) >

Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Prof Brian Ripley
On 01/05/2012 00:28, Antonio Piccolboni wrote: Hi, I was wondering if there is anything more efficient than split to do the kind of conversion in the subject. If I create a data frame as in system.time({fd = data.frame(x=1:2000, y = rnorm(2000), id = paste("x", 1:2000, sep =""))}) user syst

Re: [Rd] The constant part of the log-likelihood in StructTS

2012-05-01 Thread Ravi Varadhan
This is not a problem at all. The log likelihood function is a function of the model parameters and the data, but it is defined up to an additive arbitrary constant, i.e. L(\theta) and L(\theta) + k are completely equivalent, for any k. This does not affect model comparisons or hypothesis tests

Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Antonio Piccolboni
It seems like people need to hear more context, happy to provide it. I am implementing a serialization format (typedbytes, HADOOP-1722 if people want the gory details) to make R and Hadoop interoperate better (RHadoop project, package rmr). It is a row first format and it's already implemented as a

Re: [Rd] The constant part of the log-likelihood in StructTS

2012-05-01 Thread Jouni Helske
Ok, it seems that R's AIC and BIC functions warn about different constants, so that's probably enough. The constants are not irrelevant though, if you compute the log-likelihood of one model using StructTS, and then fit alternative model using other functions such as arima(), which do take account

Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Simon Urbanek
On May 1, 2012, at 1:26 PM, Antonio Piccolboni wrote: > It seems like people need to hear more context, happy to provide it. I am > implementing a serialization format (typedbytes, HADOOP-1722 if people want > the gory details) to make R and Hadoop interoperate better (RHadoop > project, package

Re: [Rd] A doubt about substitute() after delayedAssign()

2012-05-01 Thread Philippe Grosjean
On 29/04/12 13:50, Duncan Murdoch wrote: On 12-04-29 3:30 AM, Philippe Grosjean wrote: > Hello, > > ?delayedAssign presents substitute() as a way to look at the expression > in the promise. However, > > msg<- "old" > delayedAssign("x", msg) > msg<- "new!" > x #- new! > substitute(x) #-

Re: [Rd] A doubt about substitute() after delayedAssign()

2012-05-01 Thread Duncan Murdoch
On 12-05-01 4:21 PM, Philippe Grosjean wrote: On 29/04/12 13:50, Duncan Murdoch wrote: On 12-04-29 3:30 AM, Philippe Grosjean wrote: > Hello, > > ?delayedAssign presents substitute() as a way to look at the expression > in the promise. However, > > msg<- "old" > delayedAssign(

Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Antonio Piccolboni
On Tue, May 1, 2012 at 11:29 AM, Simon Urbanek wrote: > > On May 1, 2012, at 1:26 PM, Antonio Piccolboni > wrote: > > > It seems like people need to hear more context, happy to provide it. I am > > implementing a serialization format (typedbytes, HADOOP-1722 if people > want > > the gory details)