The metadata fields can be all sorts of strange, including spaces and
other strange characters. So, often, there is some issue on mapping.
But yes, please, add the howto to Wiki. You will need to get your
account whitelisted first (due to spammers), so send a separate email
with your Apache wiki i
Thanks again. I already had the Tika jars, but not the commandline one,
so I downloaded 1.5 and ran it against the docx and found:
So the name is prefixed, does that mean that I should add it prefixed
in the conf files as well? Ie:
Yes. Did that and now it works. Guess I should take the time
You can just download Tika from Apache site, it's a separate product
and has command line interface.
Or to use Solr extract handler: go through Solr tutorial, it explains
it. https://lucene.apache.org/solr/4_7_0/tutorial.html
Specifically, http://wiki.apache.org/solr/ExtractingRequestHandler and