Re: Re: Re: Need help importing OOXML custom properties into Solr

2014-03-18 Thread Alexandre Rafalovitch
The metadata fields can be all sorts of strange, including spaces and other strange characters. So, often, there is some issue on mapping. But yes, please, add the howto to Wiki. You will need to get your account whitelisted first (due to spammers), so send a separate email with your Apache wiki i

Sv: Re: Re: Need help importing OOXML custom properties into Solr

2014-03-18 Thread Anders Gustafsson
Thanks again. I already had the Tika jars, but not the commandline one, so I downloaded 1.5 and ran it against the docx and found: So the name is prefixed, does that mean that I should add it prefixed in the conf files as well? Ie: Yes. Did that and now it works. Guess I should take the time

Re: Re: Need help importing OOXML custom properties into Solr

2014-03-18 Thread Alexandre Rafalovitch
You can just download Tika from Apache site, it's a separate product and has command line interface. Or to use Solr extract handler: go through Solr tutorial, it explains it. https://lucene.apache.org/solr/4_7_0/tutorial.html Specifically, http://wiki.apache.org/solr/ExtractingRequestHandler and

Sv: Re: Need help importing OOXML custom properties into Solr

2014-03-18 Thread Anders Gustafsson
Thanks for the quick reply. I am a bit of a newb when it comes to Solr, Lux and Tika so I would appreciate if you could give me some quick pointers how to use/call Tika directly and/or how to send one file directly and storing the dynamic field? -- Anders Gustafsson Engineer, CNI, CNE6, ASE

Re: Need help importing OOXML custom properties into Solr

2014-03-18 Thread Alexandre Rafalovitch
Have you tried just using Tika directly and seeing what gets output? Maybe it is all prefixed somehow. Or sending one file as a sample directly to the extract handler and temporarily storing the ignored_* dynamicField to see what actually happens? Basically, check what is there before trying to fi