: It seems that the payloads are applied only to the original word that I : index and the WordDelimiterFilter doesn't apply the payloads to the tokens : it generates.
I believe you are correct. I think the general rule for most TokenFilters that you will find in Lucene/Solr is that they don't typically "clone" attributes (like payloads) when generating new Tokens -- it may be what you want in your use case, but there's no hard & fast rule that it would always make sense to do so. If you'd like to opne a jira (or submit a patch) i suspect a new "clonePayload" attribute could be added to the WDF Factory to drive this kind of behavior so people with use cases where it made sense could enable this -- but i haven't looked at that code (or the current TokenStream API) enough to have any idea how hard it would be. -Hoss