Pradeep,

As indicated on the wiki 
<http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory>,
 the minShingleSize option is not available in Solr versions prior to 3.1.

What version of Solr are you using?

(By the way, I am only replying on solr-user@lucene.apache.org mailing list - 
the d...@lucene.apache.org mailing list is for the development of Lucene/Solr, 
not for questions about using the products; please ask first on 
solr-user@lucene.apache.org, if you think you have found a bug.  If you don't 
get an answer in a day or two, then it makes sense to escalate to 
d...@lucene.apache.org.)

Steve


> -----Original Message-----
> From: Pradeep Pujari [mailto:prade...@rocketmail.com]
> Sent: Thursday, July 28, 2011 1:43 PM
> To: solr-user@lucene.apache.org
> Subject: ShingleFilterFactory class error
> 
> Hi,
> 
> I am trying to create shingles with minShingleSize = 10, but it also
> returns bi-grams too. Heres is my schema defn
> 
>             <filter class="solr.ShingleFilterFactory" minShingleSize="10"
> maxShingleSize="25"
>                 outputUnigrams="false" outputUnigramsIfNoShingles="false"
> tokenSeparator=" "/>
> 
> 
> For the input String "Apple - iPad 3G Wi-Fi - 32GB", it breaks into
>     "Apple -"
>     "- iPad "
> 
> My understaing that it should be 10-gram token.
> 
> Is it bug or any configuration is to be added.
> 
> Thank you in advance.
> Pradeep

Reply via email to