Y, this looks like a Tika feature.  If you run the tika-app.jar [1]on your file 
and you get the same output, then that's Tika's doing.

Drop a note on the u...@tika.apache.org list if Tika isn't meeting your needs.

-----Original Message-----
From: Sreenivasa Kallu [mailto:sreenivasaka...@gmail.com] 
Sent: Thursday, February 11, 2016 1:43 PM
To: solr-user@lucene.apache.org
Subject: outlook email file pst extraction problem

Hi ,
       I am currently indexing individual outlook messages and searching is 
working fine.
I have created solr core using following command.
 ./solr create -c sreenimsg1 -d data_driven_schema_configs

I am using following command to index individual messages.
curl  "
http://localhost:8983/solr/sreenimsg/update/extract?literal.id=msg9&uprefix=attr_&fmap.content=attr_content&commit=true";
-F "myfile=@/home/ec2-user/msg9.msg"

This setup is working fine.

But new requirement is extract messages using outlook pst file.
I tried following command to extract messages from outlook pst file.

curl  "
http://localhost:8983/solr/sreenimsg1/update/extract?literal.id=msg7&uprefix=attr_&fmap.content=attr_content&commit=true";
-F "myfile=@/home/ec2-user/sateamc_0006.pst"

This command extracting only high level tags and extracting all messages into 
one message. I am not getting all tags when extracted individual messgaes. is 
above command is correct? is it problem not using recursion?
 how to add recursion to above command ? is it tika library problem?

Please help to solve above problem.

Advanced Thanks.

--sreenivasa kallu

Reply via email to