[
https://issues.apache.org/jira/browse/LUCENE-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384501#comment-17384501
]
Gautam Worah edited comment on LUCENE-9450 at 7/20/21, 10:04 PM:
-----------------------------------------------------------------
I tried the approach of indexing the new BinaryDocValues field with a different
name. Currently, we index the StringField with the internal `$full_path$` name,
but indexing the BinaryDocValues field with a different name and then trying to
pull out ({color:#000000}leafReader{color}.getBinaryDocValues()) the BDV runs
successfully (passes the back compat test, and users won't need to reindex).
The side-effect of this is that users might have some categories in the
`$full_path$` Lucene field and the newer (v9.0) categories in the
`$full_path_binary$` Lucene field
was (Author: gworah):
I tried the approach of indexing the new BinaryDocValues field with a different
name. Currently, we index the StringField with the internal `$full_path$` name,
but indexing the BinaryDocValues field with a different name and then trying to
pull out ({color:#000000}leafReader{color}.getBinaryDocValues()) the BDV runs
successfully (and passes the back compat test).
The side-effect of this is that users might have some categories in the
`$full_path$` Lucene field and the newer (v9.0) categories in the
`$full_path_binary$` Lucene field
> Taxonomy index should use DocValues not StoredFields
> ----------------------------------------------------
>
> Key: LUCENE-9450
> URL: https://issues.apache.org/jira/browse/LUCENE-9450
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/facet
> Affects Versions: 8.5.2
> Reporter: Gautam Worah
> Priority: Minor
> Labels: performance
> Fix For: main (9.0)
>
> Attachments: LUCENE-9450-localrun.py-v1, wip_taxonomy_patch
>
> Time Spent: 3h 50m
> Remaining Estimate: 0h
>
> The taxonomy index that maps binning labels to ordinals was created before
> Lucene added BinaryDocValues.
> I've attached a WIP patch (does not pass tests currently)
> Issue suggested by [~mikemccand]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]