Re: [Tutor] Processing CSV files

2013-10-09 Thread Sivaram Neelakantan
On Wed, Oct 09 2013,Leena Gupta wrote: > Hello, > > Looking for some inputs on Python's csv processing feature. > > I need to process a large csv file every 5-10 minutes. The file could > contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). As part of > the processing, I need to sum up

Re: [Tutor] Processing CSV files

2013-10-08 Thread Mark Lawrence
On 08/10/2013 21:46, Leena Gupta wrote: Hello, Looking for some inputs on Python's csv processing feature. I need to process a large csv file every 5-10 minutes. The file could contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). As part of the processing, I need to sum up a number v

Re: [Tutor] Processing CSV files

2013-10-08 Thread Martin A. Brown
Hello there, : We have a box with 16GB RAM so RAM should not be an issue : hopefully. : : The datastore is Cassandra and I'm hoping to use the pycassa : library for interaction. : : I do have an additional question related to Cassandra & Python. : As part of data processing, I need t

Re: [Tutor] Processing CSV files

2013-10-08 Thread Alan Gauld
On 09/10/13 00:26, Leena Gupta wrote: I do have an additional question related to Cassandra & Python. As part of data processing, I need to fetch slices of data from Cassandra and run computations like sum and percentile calculation on it. Sorry, I've never even heard of Cassandra before So

Re: [Tutor] Processing CSV files

2013-10-08 Thread Leena Gupta
Dave, Alan - thanks for replying. We have a box with 16GB RAM so RAM should not be an issue hopefully. The datastore is Cassandra and I'm hoping to use the pycassa library for interaction. I do have an additional question related to Cassandra & Python. As part of data processing, I need to fetch

Re: [Tutor] Processing CSV files

2013-10-08 Thread Dave Angel
On 8/10/2013 16:46, Leena Gupta wrote: > Hello, > > Looking for some inputs on Python's csv processing feature. > > I need to process a large csv file every 5-10 minutes. The file could > contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). As part of > the processing, I need to sum up

Re: [Tutor] Processing CSV files

2013-10-08 Thread Alan Gauld
On 08/10/13 21:46, Leena Gupta wrote: I need to process a large csv file every 5-10 minutes. The file could contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). Assuming you have a modern multi-core PC with >2G RAM then those sizes should not be a problem. If you have an older PC t

[Tutor] Processing CSV files

2013-10-08 Thread Leena Gupta
Hello, Looking for some inputs on Python's csv processing feature. I need to process a large csv file every 5-10 minutes. The file could contain 3mill to 10 mill rows and size could be 6MB to 10MB(+). As part of the processing, I need to sum up a number value by grouping on certain attributes and