WDF is not WTF(what I think when I see WDF), right ;-)

What is WDF?

Dennis Gearon

Signature Warning
----------------
EARTH has a Right To Life,
  otherwise we all die.

Read 'Hot, Flat, and Crowded'
Laugh at http://www.yert.com/film.php


--- On Thu, 9/23/10, Markus Jelsma <markus.jel...@buyways.nl> wrote:

> From: Markus Jelsma <markus.jel...@buyways.nl>
> Subject: RE: Search a URL
> To: solr-user@lucene.apache.org
> Date: Thursday, September 23, 2010, 2:11 PM
> Try setting generateWordParts=1 in
> your WDF. Also, having a WhitespaceTokenizer makes little
> sense for URL's, there should be no whitespace in a URL, the
> StandardTokenizer can tokenize a URL. Anyway, the problem is
> your WDF.
>  
> -----Original message-----
> From: Max Lynch <ihas...@gmail.com>
> Sent: Thu 23-09-2010 23:00
> To: solr-user@lucene.apache.org;
> 
> Subject: Search a URL
> 
> Is there a tokenizer that will allow me to search for parts
> of a URL?  For
> example, the search "google" would match on the data "
> http://mail.google.com/dlkjadf";
> 
> This tokenizer factory doesn't seem to be sufficient:
> 
>        <fieldType name="text_standard"
> class="solr.TextField"
> positionIncrementGap="100">
>            <analyzer type="index">
>                <tokenizer
> class="solr.WhitespaceTokenizerFactory"/>
>                <filter
> class="solr.WordDelimiterFilterFactory"
> generateWordParts="0" generateNumberParts="1"
> catenateWords="1"
> catenateNumbers="1" catenateAll="0"
> splitOnCaseChange="1"/>
>                <filter
> class="solr.LowerCaseFilterFactory"/>
>                <filter
> class="solr.SnowballPorterFilterFactory"
> language="English" protected="protwords.txt"/>
>            </analyzer>
>            <analyzer type="query">
>                 <tokenizer
> class="solr.WhitespaceTokenizerFactory"/>
> 
>                 <filter
> class="solr.WordDelimiterFilterFactory"
> generateWordParts="0" generateNumberParts="1"
> catenateWords="1"
> catenateNumbers="1" catenateAll="0"
> splitOnCaseChange="1"/>
>                 <filter
> class="solr.LowerCaseFilterFactory"/>
>                 <filter
> class="solr.SnowballPorterFilterFactory"
> language="English" protected="protwords.txt"/>
>             </analyzer>
>    </fieldType>
> 
> Thanks.
>

Reply via email to