[ https://issues.apache.org/jira/browse/LUCENE-10250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17447714#comment-17447714 ]
Robert Muir commented on LUCENE-10250: -------------------------------------- And in case you are curious, that default implementation of {{lookupTerm}} is optimized for the default codec. so don't be afraid of the binary search :) Sorting and other stuff uses it, too. Default codec does binary search first on "blocks" and only decompresses only the one single block needed, then scans to the target: https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/codecs/lucene90/Lucene90DocValuesProducer.java#L1028 https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/codecs/lucene90/Lucene90DocValuesProducer.java#L1193 > Add hierarchical labels to SSDV facets > -------------------------------------- > > Key: LUCENE-10250 > URL: https://issues.apache.org/jira/browse/LUCENE-10250 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Marc D'Mello > Priority: Major > Labels: discussion > > Hi all, > I recently [added a new benchmarking > task|https://github.com/mikemccand/luceneutil/issues/141] to {{luceneutil}} > to count facets on a random word chosen from each document which would give > us a very high cardinality facet benchmarking compared to the faceting > benchmarks we already had. After being merged, [~mikemccand] pointed out some > [interesting > results|https://home.apache.org/~mikemccand/lucenebench/BrowseRandomLabelTaxoFacets.html] > in the nightly benchmarks where the {{BrowseRandomLabelSSDVFacets}} task was > much faster than the {{BrowseRandomLabelTaxoFacets}} task. > I was thinking that using SSDV facets instead of taxonomy facets for our use > case at Amazon Product Search could potentially lead to some increases in QPS > and decreases in index size, but the issue is we use hierarchical labels, and > as I understand it, SSDV faceting only supports a 2 level hierarchy as of > today. This leads to my question of why is there a limitation like this on > SSDV facets? Is hierarchical labels just a feature that hasn't been > implemented in SSDV facets yet, or is there some more complex reason that we > can't add hierarchical labels to SSDV facets? > Thanks! -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org