Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Antonio Piccolboni
On Tue, May 1, 2012 at 11:29 AM, Simon Urbanek wrote: > > On May 1, 2012, at 1:26 PM, Antonio Piccolboni > wrote: > > > It seems like people need to hear more context, happy to provide it. I am > > implementing a serialization format (typedbytes, HADOOP-1722 if people >

Re: [Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-05-01 Thread Antonio Piccolboni
hat's helpful too in a way because it restricts the options. I thought I may be missing a simple primitive, like a t() for data frames (that doesn't coerce to matrix). Thanks Antonio On Tue, May 1, 2012 at 5:46 AM, Prof Brian Ripley wrote: > On 01/05/2012 00:28, Antonio Piccolboni wr

[Rd] fast version of split.data.frame or conversion from data.frame to list of its rows

2012-04-30 Thread Antonio Piccolboni
Hi, I was wondering if there is anything more efficient than split to do the kind of conversion in the subject. If I create a data frame as in system.time({fd = data.frame(x=1:2000, y = rnorm(2000), id = paste("x", 1:2000, sep =""))}) user system elapsed 0.004 0.000 0.004 and then I try

[Rd] Serializing many small objects efficiently

2012-03-22 Thread Antonio Piccolboni
Hi, sorry if this question is trivial or unclear, this is my first venture into mixed C/R programming (I am reasonably experienced in each separately). I am trying to write a serialization function for a format called typedbytes, which is used as an interchange format in Hadoop circles. Since I wou

Re: [Rd] On R performance

2012-03-11 Thread Antonio Piccolboni
On Fri, Mar 9, 2012 at 8:39 AM, Justin Talbot wrote: > > > Another possibility is to implement R/S on top of an optimized virtual > > machine like the JVM, LLVM, etc. > > > > > Somebody is pursuing that already: https://code.google.com/p/renjin/ Antonio The RHadoop project, https://github.com/R