[ https://issues.apache.org/jira/browse/LUCENE-10274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17455990#comment-17455990 ]
Greg Miller edited comment on LUCENE-10274 at 12/8/21, 9:05 PM: ---------------------------------------------------------------- I was thinking that this would work over the same doc values indexed when creating "Point" fields (e.g., LongPoint), which is a binary field encoding all N dimensions into a single byte entry. So the faceting logic would inspect a single binary field encoding the N dimensions, testing whether-or-not it's contained in each hyperrectangle of interest. UPDATE: Actually, I think I was confusing the current Point field impl with something else. I just glanced at the code and there isn't a current dv field of course (just the inverted points index). So yeah, will need some thought as to how to encode these as dvs. was (Author: gsmiller): I was thinking that this would work over the same doc values indexed when creating "Point" fields (e.g., LongPoint), which is a binary field encoding all N dimensions into a single byte entry. So the faceting logic would inspect a single binary field encoding the N dimensions, testing whether-or-not it's contained in each hyperrectangle of interest. > Implement "hyperrectangle" faceting > ----------------------------------- > > Key: LUCENE-10274 > URL: https://issues.apache.org/jira/browse/LUCENE-10274 > Project: Lucene - Core > Issue Type: New Feature > Components: modules/facet > Reporter: Greg Miller > Priority: Minor > > I'd be interested in expanding Lucene's faceting capabilities to aggregate a > point field against a set of user-provided n-dimensional > [hyperrectangles|https://en.wikipedia.org/wiki/Hyperrectangle]. This would be > a generalization of {{LongRangeFacets}} / {{DoubleRangeFacets}} from a single > dimension to n-dimensions, and would compliment {{PointRangeQuery}} well, > providing the ability to facet ahead of "drilling down" on such a query. > As a motivating use-case, imagine searching against movie documents that > contain a 2-dimensional point storing "awards" the movie has received. One > dimension encodes the year the award was won, while the other encodes the > type of award as an ordinal. For example, the film "Nomadland" won the > "Academy Awards Best Picture" award in 2021. Imagine providing a > two-dimensional refinement to users allowing them to filter by the > combination of award + year in a single action (e.g., using > {{{}PointRangeQuery{}}}) and needing to get facet counts for these > combinations ahead of time. > Curious if the community thinks this functionality would be useful. Any > thoughts? -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org