Re: Making tika process mail attachments eludes me

2013-04-01 Thread Chris Hostetter
: I believe that the handling of the multipart MIME lacks some error checking, and : it is probably related to the content outside the MIME boundaries (in my : example, the text "This is a multi-part message in MIME format."): : : I really hope that some SOLR developer can have a look, we canno

Re: Making tika process mail attachments eludes me

2013-03-18 Thread Marcos Garcia
Hi Leif I've had the same problem. I tried with 4.2.0 as well, in both fedora 17 and centos6, using java-6 and java-7 (openjdk and oracel/sun as well). I could NEVER use example-DIH against a mailbox having mails attachments. Only mails without them, even if they were HTML, but as long as I in

Making tika process mail attachments eludes me

2013-03-03 Thread Leif Hetlesæther
Been trying for a while to create an index of a mailbox. I have downloaded solr-4.1.0.tgz, configured example/example-DIH/solr/mail/conf/data-config.xml and emails are indexed, but the attachmens eludes me. The config says: "Note - In order to index attachments, set processAttachement="true" an