I think I agree with Justin here, I think the way analysis tool highlights 'matches' is extremely misleading, especially considering it completely ignores queryparsing.
it would be better if it put your text in a memoryindex and actually parsed the query w/ queryparser, ran it, and used the highlighter to try to show any matches. On Wed, Aug 4, 2010 at 10:14 AM, Justin Lolofie <jta...@gmail.com> wrote: > Erik: Yes, I did re-index if that means adding the document again. > Here are the exact steps I took: > > 1. analysis.jsp "ABC12" does NOT match title "ABC12" (however, ABC or 12 > does) > 2. changed schema.xml WordDelimeterFilterFactory catenate-all > 3. restarted tomcat > 4. deleted the document with title "ABC12" > 5. added the document with title "ABC12" > 6. query "ABC12" does NOT result in the document with title "ABC12" > 7. analysis.jsp "ABC12" DOES match that document now > > Is there any way to see, given an ID, how something is indexed internally? > > Lance: I understand the index/query sections of analysis.jsp. However, > it operates on text that you enter into the form, not on actual index > data. Since all my documents have a unique ID, I'd like to supply an > ID and a query, and get back the same index/query sections- using > whats actually in the index. > > > ---------- Forwarded message ---------- > From: Erik Hatcher <erik.hatc...@gmail.com> > To: solr-user@lucene.apache.org > Date: Tue, 3 Aug 2010 22:43:17 -0400 > Subject: Re: analysis tool vs. reality > Did you reindex after changing the schema? > > > On Aug 3, 2010, at 7:35 PM, Justin Lolofie wrote: > > Hi Erik, thank you for replying. So, turning on debugQuery shows > information about how the query is processed- is there a way to see > how things are stored internally in the index? > > My query is "ABC12". There is a document who's "title" field is > "ABC12". However, I can only get it to match if I search for "ABC" or > "12". This was also true in the analysis tool up until recently. > However, I changed schema.xml and turned on catenate-all in > WordDelimterFilterFactory for title fieldtype. Now, in the analysis > tool "ABC12" matches "ABC12". However, when doing an actual query, it > does not match. > > Thank you for any help, > Justin > > > ---------- Forwarded message ---------- > From: Erik Hatcher <erik.hatc...@gmail.com> > To: solr-user@lucene.apache.org > Date: Tue, 3 Aug 2010 16:50:06 -0400 > Subject: Re: analysis tool vs. reality > The analysis tool is merely that, but during querying there is also a > query parser involved. Adding debugQuery=true to your request will > give you the parsed query in the response offering insight into what > might be going on. Could be lots of things, like not querying the > fields you think you are to a misunderstanding about some text not > being analyzed (like wildcard clauses). > > Erik > > On Aug 3, 2010, at 4:43 PM, Justin Lolofie wrote: > > Hello, > > I have found the analysis tool in the admin page to be very useful in > understanding my schema. I've made changes to my schema so that a > particular case I'm looking at matches properly. I restarted solr, > deleted the document from the index, and added it again. But still, > when I do a query, the document does not get returned in the results. > > Does anyone have any tips for debugging this sort of issue? What is > different between what I see in analysis tool and new documents added > to the index? > > Thanks, > Justin > -- Robert Muir rcm...@gmail.com