That’s a good place to start. The idea was to make sure titles that started 
with a date would not always be at the forefront and the actual title of the 
doc would be sorted. 

> On Jul 15, 2020, at 4:58 PM, Erick Erickson <erickerick...@gmail.com> wrote:
> 
> Yeah, it’s always a question “how much is enough/too much”.
> 
> That looks reasonable for alphatitle, but what about title? Your original
> question was that the sorting changes depending on which field you 
> sort on. If your title field uses something that tokenizes or doesn’t
> include the same analysis chain (particularly the lowercasing
> and patternreplace) then I’d expect the order to change.
> 
> Best,
> Erick
> 
>> On Jul 15, 2020, at 4:49 PM, David Hastings <hastings.recurs...@gmail.com> 
>> wrote:
>> 
>> thanks, ill check the admin, didnt want to send a big clock of text but:
>> 
>> 
>>  -
>>     -
>> 
>>     Tokenizer:
>>     org.apache.lucene.analysis.core.KeywordTokenizerFactoryclass:
>>     solr.KeywordTokenizerFactoryluceneMatchVersion: 7.1.0
>>     -
>> 
>>     Token Filters:
>>     org.apache.lucene.analysis.core.LowerCaseFilterFactoryclass:
>>     solr.LowerCaseFilterFactoryluceneMatchVersion: 7.1.0
>>     org.apache.lucene.analysis.miscellaneous.TrimFilterFactoryclass:
>>     solr.TrimFilterFactoryluceneMatchVersion: 7.1.0
>>     org.apache.lucene.analysis.pattern.PatternReplaceFilterFactorypattern:
>>     ([^a-z])replace: allclass: solr.PatternReplaceFilterFactoryreplacement
>>     luceneMatchVersion: 7.1.0
>>  -
>> 
>>  Query Analyzer:
>>  
>> <http://192.168.1.33:7300/solr/#/mega/analysis?analysis.fieldname=alphatitle>
>>  org.apache.solr.analysis.TokenizerChain
>>     -
>> 
>>     Tokenizer:
>>     org.apache.lucene.analysis.core.KeywordTokenizerFactoryclass:
>>     solr.KeywordTokenizerFactoryluceneMatchVersion: 7.1.0
>>     -
>> 
>>     Token Filters:
>>     org.apache.lucene.analysis.core.LowerCaseFilterFactoryclass:
>>     solr.LowerCaseFilterFactoryluceneMatchVersion: 7.1.0
>>     org.apache.lucene.analysis.miscellaneous.TrimFilterFactoryclass:
>>     solr.TrimFilterFactoryluceneMatchVersion: 7.1.0
>>     org.apache.lucene.analysis.pattern.PatternReplaceFilterFactorypattern:
>>     ([^a-z])replace: allclass: solr.PatternReplaceFilterFactoryreplacement
>>     luceneMatchVersion: 7.1.0
>> 
>> 
>>> On Wed, Jul 15, 2020 at 4:47 PM Erick Erickson <erickerick...@gmail.com>
>>> wrote:
>>> 
>>> I’d look two places:
>>> 
>>> 1> try the admin/analysis page from the admin UI. In particular, look at
>>> what tokens actually get in the index.
>>> 
>>> 2> again, the admin UI will let you choose the field (alphatitle and
>>> title) and see what the actual indexed tokens are.
>>> 
>>> Both have the issue that I don’t know what tokenizer you are using. For
>>> sorting it better be something
>>> like KeywordTokenizer. Anything that breaks up the input into separate
>>> tokens will produce surprises.
>>> 
>>> And unless you have lowercaseFilter in front of your patternreplace,
>>> you’re removing uppercase characters.
>>> 
>>> Best,
>>> Erick
>>> 
>>>> On Jul 15, 2020, at 3:06 PM, David Hastings <
>>> hastings.recurs...@gmail.com> wrote:
>>>> 
>>>> howdy,
>>>> i have a field that sorts fine all other content, and i cant seem to
>>> debug
>>>> why it wont sort for me on this one chunk of it.
>>>> "sort":"alphatitle asc", "debugQuery":"on", "_":"1594733127740"}},
>>> "response
>>>> ":{"numFound":3,"start":0,"docs":[ { "title":"Money orders", {
>>>> "title":"Finance,
>>>> consolidation and rescheduling of debts", { "title":"Rights in former
>>>> German Islands in Pacific", },
>>>> 
>>>> its using a copyfield from "title" to "alphatitle" that replaces all
>>>> punctuation
>>>> pattern: ([^a-z])replace: allclass: solr.PatternReplaceFilterFactory
>>>> 
>>>> and if i use just title it flips:
>>>> 
>>>> "title":"Finance, consolidation and rescheduling of debts"}, {
>>> "title":"Rights
>>>> in former German Islands in Pacific"}, { "title":"Money orders"}]
>>>> 
>>>> and im banging my head trying to figure out what it is about this
>>>> content in particular that is not sorting the way I would expect.
>>>> don't suppose someone would be able to lead me to a good place to look?
>>> 
>>> 
> 

Reply via email to