: <tokenizer class="solr.PatternTokenizerFactory" pattern="; *" /> ... : Whether to use this idea or Bastian's depends on how the original data source : is organized.
it also depends on what you want to get *out* if this is a stored field ... using an analyzer like this will deal with letting you facet on the individual terms, but the stored vaue returned with each document will still be a single semi-colon seperated string. modifying your indexing code (either DIH, or whatever) will actually result in an array of distinct values being returned for the documents. -Hoss