It looks like there were changes in Lucene 7.0 that limited the size of the automaton to prevent overflowing the stack.
https://issues.apache.org/jira/browse/LUCENE-7914 The commit being: https://github.com/apache/lucene-solr/commit/7dde798473d1a8640edafb41f28ad25d17f25a2d Kevin Risden On Tue, Apr 3, 2018 at 1:45 PM, David Hastings <hastings.recurs...@gmail.com > wrote: > For data, its primarily a lot of garbage, around 200k titles, varying > length. im actually looking through my application now to see if I even > still use it or if it was an early experiment. I am just finding it odd > thats its failing in 7 but does fine on 5 > > On Tue, Apr 3, 2018 at 2:41 PM, Erick Erickson <erickerick...@gmail.com> > wrote: > > > What kinds of things go into your title field? On first blush that's a > > bit odd for a multi-word title field since it treats the entire input > > as a single string. The code is trying to build a large FST to hold > > all of this data. Would AnalyzingInfixLookupFactory or similar make > > more sense? > > > > buildOnStartup and buildOnOptimize are other red flags. This means > > that every time you start up, the data for the title field is read > > from disk and the FST is built (or index if you use a different impl). > > On a large corpus this may take many minutes. > > > > Best, > > Erick > > > > On Tue, Apr 3, 2018 at 11:28 AM, David Hastings > > <hastings.recurs...@gmail.com> wrote: > > > Hey all, I recently got a 7.2 instance up and running, and it seems to > be > > > going well however, I have ran into this when creating one of my > indexes, > > > and was wondering if anyone had a quick idea right off the top of their > > > head. > > > > > > solrconfig: > > > > > > <searchComponent name="suggest" class="solr.SuggestComponent"> > > > <lst name="suggester"> > > > <str name="name">fixspell</str> > > > <str name="lookupImpl">FuzzyLookupFactory</str> > > > > > > <str name="suggestAnalyzerFieldType">string</str> > > > > > > <str name="dictionaryImpl">DocumentDictionaryFactory</str> > > > <str name="field">title</str> > > > <str name="buildOnStartup">true</str> > > > <str name="buildOnOptimize">true</str> > > > </lst> > > > > > > > > > received error: > > > > > > > > > ERROR true > > > SuggestComponent > > > Exception in building suggester index for: fixspell > > > java.lang.IllegalArgumentException: input automaton is too large: 1001 > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse( > > Operations.java:1298) > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse( > > Operations.java:1306) > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse( > > Operations.java:1306) > > > > > > ..... > > > > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse( > > Operations.java:1306) > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStatesRecurse( > > Operations.java:1306) > > > at > > > org.apache.lucene.util.automaton.Operations.topoSortStates(Operations. > > java:1275) > > > at > > > org.apache.lucene.search.suggest.analyzing. > > AnalyzingSuggester.replaceSep(AnalyzingSuggester.java:292) > > > at > > > org.apache.lucene.search.suggest.analyzing.AnalyzingSuggester. > > toAutomaton(AnalyzingSuggester.java:854) > > > at > > > org.apache.lucene.search.suggest.analyzing.AnalyzingSuggester.build( > > AnalyzingSuggester.java:430) > > > at org.apache.lucene.search.suggest.Lookup.build(Lookup.java:190) > > > at > > > org.apache.solr.spelling.suggest.SolrSuggester.build( > > SolrSuggester.java:181) > > > at > > > org.apache.solr.handler.component.SuggestComponent$SuggesterListener. > > buildSuggesterIndex(SuggestComponent.java:529) > > > at > > > org.apache.solr.handler.component.SuggestComponent$ > > SuggesterListener.newSearcher(SuggestComponent.java:511) > > > at org.apache.solr.core.SolrCore.lambda$getSearcher$17( > > SolrCore.java:2275) > > >