Compare timings in the following cases: - Without the wildcard - With suffix wild card only - test* - With reverse wild card filter factory and two separate terms - *test OR test*
On Thu, Aug 8, 2013 at 8:15 PM, meena.sri...@mathworks.com < meena.sri...@mathworks.com> wrote: > Index size is around 150 GB and there are around 6.5 million documents in > the > index. Search on a specific text field is very slow, it takes 1 minute to 2 > minute for wildcard queries like *test* with no highlighting and no facets > This field contributes to 90% of index size. > This is my shema.xml > > <fieldType name="text_pl" class="solr.TextField"> > <analyzer> > <tokenizer class="solr.StandardTokenizerFactory"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.StopFilterFactory" > words="stopwords.txt" > ignoreCase="true"/> > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="0" generateNumberParts="0" catenateWords="0" > catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/> > </analyzer> > </fieldType> > > </types> > > > <fields> > > <field name="syndrome" type="text_pl" indexed="true" stored="true" > required="true" multivalued="false" omitNorms="false"/> > <field name="test_file_result_id" type="long" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > <field name="start_date" type="date" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > <field name="job_id" type="long" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > <field name="test_run_id" type="long" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > <field name="cluster" type="string" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > <field name="logfile" type="text_ws" indexed="true" stored="true" > omitNorms="true" multivalued="false"/> > > > I am using DIH for indexing data from Database. Largest "syndrome" field > size is 5MB it can range from 5MB to 1KB > > I tried using whitespacetokeniser with not much luck, > I am using solr3.6.0 > Indexing takes 1.5 hours. > > Please let me know , if I need to add anything to improve the search speed. > > Thanks > Meena > > > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-search-on-a-large-text-field-is-very-slow-tp4083310.html > Sent from the Solr - User mailing list archive at Nabble.com. >