Thanks for the quick response!

Are you saying that I should extend WhitespaceTokenizerFactory to create my
own? Or should I simply use it?

Because, I guess tokenizing on spaces wouldn't be enough. I would need
tokenizing on slashes in other positions, just not within strings matching
([A-Z]+/[0-9]+/[0-9]+).

Marian


2011/11/30 Erick Erickson <erickerick...@gmail.com>

> There's about a zillion tokenizers, for what you're describing
> WhitespaceTokenizerFactory is a good candidate.
>
> See: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
> for a partial list, and it has links to the authoritative docs.
>
> Best
> Erick
>
>

Reply via email to