Hi all,

I've got a pattern in a document (call it "xy") that I want to turn into two tokens - "xy" and "y".

One approach I could use is PatternTokenizer to extract "xy", and then a custom filter that returns "xy" and then "y" on the next call (caches the next result).

Or I could extend PatternTokenizer to return multiple tokens per match, though figuring out how to specify that in the schema seems harder.

Is there another approach that wouldn't require any custom code?

Thanks,

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"If you can't find it, you can't fix it"

Reply via email to