It's not clear what your question or problem is. Try explaining it in simple English first. Autocomplete is fairly simple - no need for the complexity of an ngram filter.

Here's an example of a suggester component and request handler based on a simple text field:

<searchComponent class="solr.SpellCheckComponent" name="suggest">
 <lst name="spellchecker">
   <str name="name">suggest</str>
   <str name="classname">org.apache.solr.spelling.suggest.Suggester</str>
<str name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookup</str>
   <str name="field">name</str>
   <str name="buildOnCommit">true</str>
 </lst>
</searchComponent>

<requestHandler class="org.apache.solr.handler.component.SearchHandler" name="/suggest">
 <lst name="defaults">
   <str name="spellcheck">true</str>
   <str name="spellcheck.dictionary">suggest</str>
   <str name="spellcheck.onlyMorePopular">true</str>
   <str name="spellcheck.count">5</str>
   <str name="spellcheck.collate">true</str>
 </lst>
 <arr name="components">
   <str>suggest</str>
 </arr>
</requestHandler>

-- Jack Krupansky

-----Original Message----- From: Sujatha Arun
Sent: Tuesday, January 22, 2013 12:59 AM
To: solr-user@lucene.apache.org
Subject: AutoComplete with FiterQuery for Full content

Hi,

I need suggestion  on solr Autocomplete for Full content with Filter query.

I have currently implemented this as below


  1. Solr version 3.6.1
  2. solr.StandardTokenizerFactory
  3. EdgeNGramFilterFactory with maxGramSize="25" minGramSize="1"
  4. Stored the content field
  5. Use the Fastvectorhighter and breakiterator on WORD to return results
  based on  standard analyzer with a fragsize of 20 &using the fq param as
  required

This seems to provide snippets ,but they seem like junk at times and not
really relevant as they are pieces of sentence with search term in them .It
could be like
the <keyword> and ...eg: on searching river  suggestion is  - the river and
...which does not really make sense as a suggestion...

So other options of


  - facets support fq but cannot be used for fullcontent tokenized text
  due to performance issue


  1. Can we use a tool that can just extract keywords/phrases from the
Full content and that can either be indexed or updated to Db and same can
  be used to serve the autocomplete?
  2. Any other methods?
  3. Are there any opensource tools for keyword extraction? Sematext has a
  commercial tool for the same.
  4. Which would be better for Autocomplete  - DB / Index in terms of
  speed /performance?

Any pointers?

Regards,
Sujatha

Reply via email to