Re: SimplePostTool with extracted Outlook messages

2015-01-27 Thread Mark
In the end I didn't find a way to add a new file/ mime type for recursing a folder. So I added msg to the static dtring and Mime map. private static final String DEFAULT_FILE_TYPES = "xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xlsx,odt,odp,ods,ott,otp,ots,rtf,htm,html,txt,log,msg"; mimeMap.put("msg"

Re: SimplePostTool with extracted Outlook messages

2015-01-27 Thread Mark
Hi Alex, On an individual file basis that would work, since you could set the ID on an individual basis. However recuring a folder it doesn't work, and worse still the server complains, unless on the server side you can use the UpdateRequestProcessor chains with UUID generator as you suggested.

Re: SimplePostTool with extracted Outlook messages

2015-01-27 Thread Alexandre Rafalovitch
Your IDs seem to be the file names, which you are probably also getting from your parsing the file. Can't you just set (or copyField) that as an ID on the Solr side? Alternatively, if you don't actually have good IDs, you could look into UpdateRequestProcessor chains with UUID generator. Regards

Re: SimplePostTool with extracted Outlook messages

2015-01-27 Thread Mark
Thanks Eric However java -classpath dist/solr-core-4.10.3.jar -Dauto=true org.apache.solr.util.SimplePostTool C:/temp/samplemsg/*.msg Fails with: osting files to base url http://localhost:8983/solr/update.. ntering auto mode. File endings considered are xml,json,csv,pdf,doc,docx,ppt,pptx,xls,xl

Re: SimplePostTool with extracted Outlook messages

2015-01-27 Thread Erik Hatcher
Try adding -Dauto=true and take away setting url. The type probably isn't needed then either. With the new Solr 5 bin/post it sets auto=true implicitly. Erik > On Jan 26, 2015, at 17:29, Mark wrote: > > Fantastic - that explians it > > Adding -Durl=" > http://localhost:8983/solr/upd

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Mark
Fantastic - that explians it Adding -Durl=" http://localhost:8983/solr/update/extract?commit=true&overwrite=true"; Get's me a little further POSTing file 6252671B765A1748992DF1A6403BDF81A4A22E00.msg SimplePostTool: WARNING: Solr returned an error #400 (Bad Request) for url: http://localh

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Alexandre Rafalovitch
Well, you are NOT posting to the same URL. On 26 January 2015 at 17:00, Mark wrote: > http://localhost:8983/solr/update Sign up for my Solr resources newsletter at http://www.solr-start.com/

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Mark
A little further This fails java -classpath dist/solr-core-4.10.3.jar -Dtype=application/vnd.ms-outlook org.apache.solr.util.SimplePostTool C:/temp/samplemsg/*.msg With: SimplePostTool: WARNING: IOException while reading response: java.io.IOException: Server returned HTTP response code: 415 fo

Re: SimplePostTool with extracted Outlook messages

2015-01-26 Thread Alexandre Rafalovitch
Seems like apple to oranges comparison here. I would try giving an explicit end point (.../extract), a single message, and a literal id for the SimplePostTool and seeing whether that works. Not providing an ID could definitely be an issue. I would also specifically look on the server side in the