Re: [R] Using R with Hadoop/Hive for Big Data

2009-08-01 Thread Ajay ohri
Hi, The document helps a lot thanks. I need to know how to work with Hadoop and R in a parallel clsuter environment. HIVE is a new system on top of Hadoop that uses a SQL derivative to query it. http://hadoop.apache.org/hive/ Regards, Ajay On Fri, Jul 31, 2009 at 7:23 PM, Avram Aelony wrot

[R] Using R with Hadoop/Hive for Big Data

2009-07-31 Thread Ajay ohri
Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provide