On Tue, Nov 12, 2019 at 12:18 PM Chris Hostetter <hossman_luc...@fucit.org> wrote:
> > : > a) What is the fieldType of the uniqueKey field in use? > : > > : > : It is a textField > > whoa... that's not normal .. what *exactly* does the fieldType declaration > (with all analyzers) look like, and what does the <field/> declaration > look like? > > <field name="debtor_id" type="text_general" multiValued="false" indexed="true" required="true" stored="true"/> <fieldType name="text_gen_sort" class="solr.SortableTextField" positionIncrementGap="100" multiValued="true"> <analyzer type="index"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.StopFilterFactory" words="stopwords.txt" ignoreCase="true"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.StopFilterFactory" words="stopwords.txt" ignoreCase="true"/> <filter class="solr.SynonymGraphFilterFactory" expand="true" ignoreCase="true" synonyms="synonyms.txt"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> </fieldType> > you should really never use TextField for a uniqueKey ... it's possible, > but incredibly tricky to get "right". > > I am going to adjust my schema, re-index, and try again. See if that doesn't fix this problem. I didn't know that having the uniqueKey be a textField was a bad idea. > Independent from that, "sorting" on a TextField doesn't always do what you > might think (again: depending on the analysis in use) > > With a cursorMark you have other factors to consider: i bet what's > happening is that the post-analysis terms for your docs result it > duplicate values, so the cursorMark is skipping all docs that have hte > same (post analysis) sort value ... this could also manifest itself in > other weird ways, like trying to deleteById. > > Step #1: switch to using a simple StrField for your uniqueKey field and > see if htat solves all your problems. > > Thanks, doing this now. Rhys