Re: PDF keyword searches not accurate

2013-03-18 Thread JDJ
Does this make a difference? - JDJ "There are two kinds of people in the world; those who understand binary, and those who don't. -- View this message in context: http://lucene.472066.n3.nabble.com/PDF-keyword-searches-not-accurate-tp4046741p4048596.html Sent from the Solr -

Re: PDF keyword searches not accurate

2013-03-13 Thread JDJ
I don't know if this is going to make a difference, or not, but I just discovered that the version of Solr that ships with CF Server 9 is v1.4.0. - JDJ "There are two kinds of people in the world; those who understand binary, and those who don't. -- View this message in conte

Re: PDF keyword searches not accurate

2013-03-13 Thread JDJ
Unfortunately, I am not in control of the development environment, so installing a stand-alone Solr is not an option. Well, let me correct that.. I do have my own instance of ColdFusion Server on my local machine (sometimes I develop locally, sometimes I develop on the network), but if I installed

Re: PDF keyword searches not accurate

2013-03-12 Thread Jack Krupansky
PM To: solr-user@lucene.apache.org Subject: Re: PDF keyword searches not accurate Hello, Jack. (My name, as well.) Thank you for the advice and link. I'll have to see if there is a ColdFusion equivalent, as I'm not using HTTP to search the collection (CF has CFSEARCH that will work

Re: PDF keyword searches not accurate

2013-03-12 Thread JDJ
Hello, Michael. Thank you for your suggestion. I'm unfamiliar with analysis handler. Do you have a link, for that? Much appreciated, - JDJ "There are two kinds of people in the world; those who understand binary, and those who don't. -- View this message in context: http

Re: PDF keyword searches not accurate

2013-03-12 Thread Michael Della Bitta
You could also use the analysis handler to see if your field definition strips numeric input. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Mar 12, 20

Re: PDF keyword searches not accurate

2013-03-12 Thread Jack Krupansky
Use the "extract only" option for Solr Cell to get the text stream that was extracted by Solr Cell/Tika/PDFBox, then manually search through the response for some text that is near the "1386", and see what text is output in the vicinity of the "1386". See: http://wiki.apache.org/solr/Extractin