Re: DIH - stream file with solrEntityProcessor

2013-10-15 Thread Josh Lincoln
ultimately I just temporarily increased the memory to handle this data set, but that won't always be practical. I did try the csv export/import and it worked well in this case. I hadn't considered it at first. I am wary that the escaping and splitting may be problematic with some data sets, so I'l

Re: DIH - stream file with solrEntityProcessor

2013-10-14 Thread Lance Norskog
Can you do this data in CSV format? There is a CSV reader in the DIH. The SEP was not intended to read from files, since there are already better tools that do that. Lance On 10/14/2013 04:44 PM, Josh Lincoln wrote: Shawn, I'm able to read in a 4mb file using SEP, so I think that rules out th

Re: DIH - stream file with solrEntityProcessor

2013-10-14 Thread Josh Lincoln
Shawn, I'm able to read in a 4mb file using SEP, so I think that rules out the POST buffer being the issue. Thanks for suggesting I test this. The full file is over a gig. Lance, I'm actually pointing SEP at a static file (I simply named the file "select" and put it on a Web server). SEP thinks it

Re: DIH - stream file with solrEntityProcessor

2013-10-14 Thread Lance Norskog
On 10/13/2013 10:02 AM, Shawn Heisey wrote: On 10/13/2013 10:16 AM, Josh Lincoln wrote: I have a large solr response in xml format and would like to import it into a new solr collection. I'm able to use DIH with solrEntityProcessor, but only if I first truncate the file to a small subset of the

Re: DIH - stream file with solrEntityProcessor

2013-10-13 Thread Shawn Heisey
On 10/13/2013 10:16 AM, Josh Lincoln wrote: > I have a large solr response in xml format and would like to import it into > a new solr collection. I'm able to use DIH with solrEntityProcessor, but > only if I first truncate the file to a small subset of the records. I was > hoping to set stream="tr