Hi, I wanted to use the twitter-text libraries github implementation to filter the tokens(hashtags) in my text. I know I can use the Pattern Matching tokenizer also, but would trust twitter's library more then my own regex to do the job for me. I wanted to use it in unison with the solr.WhitespaceTokenizerFactory to get the tokens.
Need help in understanding on how can I do that. Do I have to refactor the twitter Java library to "extends TokenFilterFactory" or can I use it the way it is. Regards, Sid.