iven that Lucene/Solr
> >>> supports
> >>> > payloads on these field types, they just aren't exposed.
> >>> >
> >>> > As always I appreciate any ideas if I'm barking up the wrong tree
> here.
> >>> >
> >>> > On Tue, Au
2015 at 2:52 PM, Markus Jelsma <
>>> markus.jel...@openindex.io>
>>> > wrote:
>>> >
>>> >> Well, if i remember correctly (i have no testing facility at hand)
>>> >> WordDelimiterFilter maintains payloads on emitted sub terms. So if you
>>> us
rectly (i have no testing facility at hand)
>> >> WordDelimiterFilter maintains payloads on emitted sub terms. So if you
>> use
>> >> a KeywordTokenizer, input 'some text^PAYLOAD', and have a
>> >> DelimitedPayloadFilter, the entire string gets a payload. You c
buse
> >> WordDelimiterFilter for it because it has a types parameter that you can
> >> use to split it on whitespace if its input is not trimmed. Otherwise you
> >> can use any other character instead of a space as your input.
> >>
> >> This is a craz
s parameter that you can
>> use to split it on whitespace if its input is not trimmed. Otherwise you
>> can use any other character instead of a space as your input.
>>
>> This is a crazy idea, but it might work.
>>
>> -Original message-
>>
hitespace if its input is not trimmed. Otherwise you
> can use any other character instead of a space as your input.
>
> This is a crazy idea, but it might work.
>
> -Original message-
> > From:Jamie Johnson
> > Sent: Tuesday 25th August 2015 19:37
> > To: sol
t it might work.
-Original message-
> From:Jamie Johnson
> Sent: Tuesday 25th August 2015 19:37
> To: solr-user@lucene.apache.org
> Subject: Re: Tokenizers and DelimitedPayloadTokenFilterFactory
>
> To be clear, we are using payloads as a way to attach authorizations t
Oh My. What fun!
bq: I need a way to specify the payload on the other field types
Not to my knowledge. The payload mechanism is built on
the capability of having a filter in the analysis chain. And there's
no analysis chain with primitive types (string, numeric and the like).
Hmmm. Totally off t
To be clear, we are using payloads as a way to attach authorizations to
individual tokens within Solr. The payloads are normal Solr Payloads
though we are not using floats, we are using the identity payload encoder
(org.apache.lucene.analysis.payloads.IdentityEncoder) which allows for
storing a by
This really sounds like an XY problem. Or when you use
"payload" it's not the Solr payload.
So Solr Payloads are a float value that you can attach to
individual terms to influence the scoring. Attaching the
_same_ payload to all terms in a field is much the same
thing as boosting on any matches in
I would like to specify a particular payload for all tokens emitted from a
tokenizer, but don't see a clear way to do this. Ideally I could specify
that something like the DelimitedPayloadTokenFilter be run on the entire
field and then standard analysis be done on the rest of the field, so in
the
11 matches
Mail list logo