RE: Using Solr to index zip files

2016-06-09 Thread anupama . gangadhar
zips as fields(search criteria) for this index. Thank you, Regards, Anupama -Original Message- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Tuesday, June 07, 2016 7:44 PM To: solr-user Subject: Re: Using Solr to index zip files I _think_ DataImportHandler could handle zip

Re: Using Solr to index zip files

2016-06-07 Thread Alexandre Rafalovitch
I _think_ DataImportHandler could handle zip files with fixed level of nesting, but not read from HDFS. I don't think anything else in Solr will. So, doing it outside of Solr is probably best. Especially, since you would need to decide how you actually want to map these files (e.g. do you keep the

RE: Using Solr to index zip files

2016-06-07 Thread BURN, James
Hi I think you'll need to do some unzipping of your zip files using an unzip application before you post to Solr. If you do this via a OS level batch script you can apply logic there to deal with nested zips. Then post your unzipped files to Solr via Curl. James -Original Message- From