[ https://issues.apache.org/jira/browse/LUCENE-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17384501#comment-17384501 ]
Gautam Worah edited comment on LUCENE-9450 at 7/21/21, 1:01 AM: ---------------------------------------------------------------- [~mikemccand] I tried the approach of indexing the new BinaryDocValues field with a different name. Currently, we index the StringField with the internal `$full_path$` name, but indexing the BinaryDocValues field with a different name and then trying to pull out ({color:#000000}leafReader{color}.getBinaryDocValues()) the BDV runs successfully (passes the back compat test, and users won't need to reindex). The side-effect of this is that users might have some categories in the `$full_path$` Lucene field and the newer (v9.0) categories in the `$full_path_binary$` Lucene field. However, this field addition logic is entirely hidden from the users so we should be fine.. was (Author: gworah): I tried the approach of indexing the new BinaryDocValues field with a different name. Currently, we index the StringField with the internal `$full_path$` name, but indexing the BinaryDocValues field with a different name and then trying to pull out ({color:#000000}leafReader{color}.getBinaryDocValues()) the BDV runs successfully (passes the back compat test, and users won't need to reindex). The side-effect of this is that users might have some categories in the `$full_path$` Lucene field and the newer (v9.0) categories in the `$full_path_binary$` Lucene field > Taxonomy index should use DocValues not StoredFields > ---------------------------------------------------- > > Key: LUCENE-9450 > URL: https://issues.apache.org/jira/browse/LUCENE-9450 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/facet > Affects Versions: 8.5.2 > Reporter: Gautam Worah > Priority: Minor > Labels: performance > Fix For: main (9.0) > > Attachments: LUCENE-9450-localrun.py-v1, wip_taxonomy_patch > > Time Spent: 3h 50m > Remaining Estimate: 0h > > The taxonomy index that maps binning labels to ordinals was created before > Lucene added BinaryDocValues. > I've attached a WIP patch (does not pass tests currently) > Issue suggested by [~mikemccand] -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org