: Actually I want to use anything that is not alphabet or digit to be the : separator - anything between them will be a word (so that I can use the URL : fragment to see what is indexed about this site)...any suggestion?
In addition to Mike's suggestion of trying out the WordDelimiterFilter, take a look at the PatternTokenizerFactory. -Hoss