Screen captures generally get filtered out by the Apache e-mail, it didn't come through.
But this makes no sense. The text_en field type you pasted should not be having the problems you're talking about. So if you add debug=true, you should be seeing your "Wednesday" query going against your field. If you just type q=Wednesday, then the default field is used, the "df" parameter in the request handler you're using. Best, Erick On Mon, Sep 21, 2015 at 12:57 PM, Mark Fenbers <mark.fenb...@noaa.gov> wrote: > You were right about finding only the Wednesday occurrences at the beginning > of the line. But attached (if it works) is a screen capture of my admin UI. > But unlike your suspicion, the index text is being parsed properly, it > appears. So I'm uncertain where this leads me. > > Also attached is the pertinent schema.xml snippet you asked for. > > The logtext column in my table contains merely keyboarded text, with the > infrequent exception that I add a \uFFFC as a placeholder for images. So, > should I be using something besides text_en as the fieldType? > > Thanks, > Mark > > On 9/21/2015 12:12 PM, Erick Erickson wrote: >> >> bq: However, I discovered that if I search on "Wednesday*" (trailing >> asterisk), then I get all the results containing Wednesday that I'm >> looking for! >> >> This almost always means you're not searching on the field you think >> you're searching on and/or the field isn't being analyzed as you think >> (i.e. the fieldType isn't what you expect). If you're really searching >> on a fieldType of text_en (and you haven't changed the definition), >> then there's something very weird here. FieldTypes are totally >> mutable, they are composed of various analysis chains that you (or >> someone else) can freely alter, so seeing the <field> definition that >> references a type="text_en" is suggestive but not definitive. >> >> I'm going to further guess that when you search on "Wednesday*", all >> the matches are at the beginning of the line, and you find docs where >> the field has "Wednesday, September...." but not "The party was on >> Wednesday". >> >> So let's see the <fieldType> associated with the logtext field. Plus, >> the results of adding &debug=true to the query. >> >> But you can get a lot of info a lot faster if you go to the admin UI >> screen, select the proper core from the drop-down on the left sied and >> go to the "analysis" section. Pick the field (or field type), enter >> some text and hit analyze (or uncheck the "verbose" box, that's >> largely uninteresting info at this level). That'll show you exactly >> how the input document is parsed, exactly how the query is parsed etc. >> And be sure to enter something like >> "september first was a Wednesday" in the left-hand (index) box, then >> just "Wednesday" in the right hand (query) side. My bet: You'll see on >> the index side that the input is not broken up, not transformed, etc. >> >> Best, >> Erick >> >