[GitHub] [lucene] jpountz commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751011311 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,42 @@ public void testFieldExistsButNoDocsH

[GitHub] [lucene] zacharymorn commented on a change in pull request #418: LUCENE-10061: Implements dynamic pruning support for CombinedFieldsQuery

2021-11-17 Thread GitBox
zacharymorn commented on a change in pull request #418: URL: https://github.com/apache/lucene/pull/418#discussion_r751018772 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/CombinedFieldQuery.java ## @@ -441,6 +491,273 @@ public boolean isCacheable(LeafR

[jira] [Commented] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo commented on LUCENE-10233: --- [~jpountz] Thanks for the guide! Actually, there

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 9:25 AM: ---

[GitHub] [lucene] sonatype-lift[bot] commented on a change in pull request #438: LUCENE-10233: Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread GitBox
sonatype-lift[bot] commented on a change in pull request #438: URL: https://github.com/apache/lucene/pull/438#discussion_r751055798 ## File path: lucene/core/src/java/org/apache/lucene/util/SparseFixedBitSet.java ## @@ -530,4 +530,33 @@ public long ramBytesUsed() { public St

[GitHub] [lucene] bruno-roustant merged pull request #430: LUCENE-10225: Improve IntroSelector.

2021-11-17 Thread GitBox
bruno-roustant merged pull request #430: URL: https://github.com/apache/lucene/pull/430 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-uns

[GitHub] [lucene] spyk commented on a change in pull request #380: LUCENE-10171 - Fix dictionary-based OpenNLPLemmatizerFilterFactory caching issue

2021-11-17 Thread GitBox
spyk commented on a change in pull request #380: URL: https://github.com/apache/lucene/pull/380#discussion_r751061365 ## File path: lucene/analysis/opennlp/src/java/org/apache/lucene/analysis/opennlp/tools/OpenNLPOpsFactory.java ## @@ -169,11 +169,14 @@ public static String ge

[jira] [Commented] (LUCENE-10225) Improve IntroSelector with 3-way partitioning

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445043#comment-17445043 ] ASF subversion and git services commented on LUCENE-10225: -- Co

[jira] [Commented] (LUCENE-10225) Improve IntroSelector with 3-way partitioning

2021-11-17 Thread Bruno Roustant (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445045#comment-17445045 ] Bruno Roustant commented on LUCENE-10225: - I'm a bit confused. I put this chang

[jira] [Commented] (LUCENE-10225) Improve IntroSelector with 3-way partitioning

2021-11-17 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445047#comment-17445047 ] Adrien Grand commented on LUCENE-10225: --- Targeting 9.1 with this change sounds go

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 10:13 AM: --

[jira] [Commented] (LUCENE-10238) Update icu4j to 70.1

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445059#comment-17445059 ] Dawid Weiss commented on LUCENE-10238: -- Ah, thanks! I've rebuilt on windows too (n

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751102338 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,42 @@ public void testFieldExistsButNoDocs

[jira] [Commented] (LUCENE-10225) Improve IntroSelector with 3-way partitioning

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445097#comment-17445097 ] ASF subversion and git services commented on LUCENE-10225: -- Co

[jira] [Resolved] (LUCENE-10225) Improve IntroSelector with 3-way partitioning

2021-11-17 Thread Bruno Roustant (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno Roustant resolved LUCENE-10225. - Fix Version/s: 9.1 Resolution: Fixed Thanks Dawid and Adrien! > Improve IntroSe

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 10:57 AM: --

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 11:00 AM: --

[jira] [Commented] (LUCENE-10238) Update icu4j to 70.1

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445121#comment-17445121 ] Dawid Weiss commented on LUCENE-10238: -- I've removed those specialized icu_xyz ver

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445032#comment-17445032 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 12:37 PM: --

[GitHub] [lucene] jpountz commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751199436 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,50 @@ public void testFieldExistsButNoDocsH

[jira] [Commented] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445143#comment-17445143 ] Feng Guo commented on LUCENE-10233: --- [~jpountz] I execute a low cardinality terms que

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751226805 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,50 @@ public void testFieldExistsButNoDocs

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751228998 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,52 @@ public void testFieldExistsButNoDocs

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751234461 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,50 @@ public void testFieldExistsButNoDocs

[GitHub] [lucene] rmuir commented on pull request #447: LUCENE-10238: Update icu4j to 70.1.

2021-11-17 Thread GitBox
rmuir commented on pull request #447: URL: https://github.com/apache/lucene/pull/447#issuecomment-971582845 FYI I opened https://issues.apache.org/jira/browse/LUCENE-10239 as a followup. I think with recent jflex we can actually remove our emoji regeneration task completely -- This is a

[jira] [Created] (LUCENE-10239) upgrade jflex (1.7.0 -> 1.8.2)

2021-11-17 Thread Robert Muir (Jira)
Robert Muir created LUCENE-10239: Summary: upgrade jflex (1.7.0 -> 1.8.2) Key: LUCENE-10239 URL: https://issues.apache.org/jira/browse/LUCENE-10239 Project: Lucene - Core Issue Type: Task

[GitHub] [lucene] jpountz commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751243685 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,52 @@ public void testFieldExistsButNoDocsH

[jira] [Commented] (LUCENE-10239) upgrade jflex (1.7.0 -> 1.8.2)

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445170#comment-17445170 ] Dawid Weiss commented on LUCENE-10239: -- +1. > upgrade jflex (1.7.0 -> 1.8.2) > --

[jira] [Created] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Robert Muir (Jira)
Robert Muir created LUCENE-10240: Summary: gradle regenerate fails on java 17 Key: LUCENE-10240 URL: https://issues.apache.org/jira/browse/LUCENE-10240 Project: Lucene - Core Issue Type: Task

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445189#comment-17445189 ] Robert Muir commented on LUCENE-10240: -- Of course the task that fails is one I am

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445143#comment-17445143 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 2:08 PM: ---

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445194#comment-17445194 ] Robert Muir commented on LUCENE-10240: -- This task passes with this patch: {noforma

[jira] [Commented] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445196#comment-17445196 ] Feng Guo commented on LUCENE-10233: --- In addition, here are the commit hashes of the c

[jira] [Comment Edited] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445143#comment-17445143 ] Feng Guo edited comment on LUCENE-10233 at 11/17/21, 2:28 PM: ---

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445253#comment-17445253 ] Robert Muir commented on LUCENE-10240: -- Bumping the groovy version fixes the issue

[jira] [Updated] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feng Guo updated LUCENE-10233: -- Attachment: image-2021-11-17-22-43-13-693.png > Store docIds as bitset when leafCardinality = 1 to sp

[jira] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233 ] Feng Guo deleted comment on LUCENE-10233: --- was (Author: gf2121): This is the flame graph of keeping running this script, it seems a lot of time took to new a SparseFixedBitSet, which allocate

[jira] [Commented] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445262#comment-17445262 ] Feng Guo commented on LUCENE-10233: --- This is the flame graph of keeping running this

[GitHub] [lucene] hendrikmuhs commented on pull request #433: LUCENE-10230 make demo builds easier to execute

2021-11-17 Thread GitBox
hendrikmuhs commented on pull request #433: URL: https://github.com/apache/lucene/pull/433#issuecomment-971656429 @dweiss Thanks for the feedback. I agree about avoiding gradle magic. The module suggestion is great and helped me finding a better solution for me (I want to run thing

[jira] [Assigned] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss reassigned LUCENE-10240: Assignee: Dawid Weiss > gradle regenerate fails on java 17 > ---

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445265#comment-17445265 ] Dawid Weiss commented on LUCENE-10240: -- Weird. I'll take a look. > gradle regener

[GitHub] [lucene] codaitya commented on pull request #446: LUCENE-10237 : Add MergeOnCommitTieredMergePolicy to sandbox

2021-11-17 Thread GitBox
codaitya commented on pull request #446: URL: https://github.com/apache/lucene/pull/446#issuecomment-971673541 > Let's rename to `MergeOnFlushTieredMergePolicy` since it technically merges on flushes, not commits? > > I haven't taken a deep look at the code, but is it specific to `T

[GitHub] [lucene] gsmiller commented on a change in pull request #264: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals (instead of custom binary format)

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #264: URL: https://github.com/apache/lucene/pull/264#discussion_r751338671 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/DocValuesOrdinalsReader.java ## @@ -41,12 +40,7 @@ public DocValuesOrdinalsReader(Stri

[GitHub] [lucene] gsmiller commented on a change in pull request #264: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals (instead of custom binary format)

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #264: URL: https://github.com/apache/lucene/pull/264#discussion_r751339846 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetUtils.java ## @@ -81,4 +84,19 @@ public long cost() { } }; } + + /** + *

[GitHub] [lucene] gsmiller commented on pull request #264: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals (instead of custom binary format)

2021-11-17 Thread GitBox
gsmiller commented on pull request #264: URL: https://github.com/apache/lucene/pull/264#issuecomment-971678277 I've removed all back-compat support now from this PR since we're trying to include this change in 9.0 (see #443). So this PR now reflects my proposed end-state in main. I still s

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445286#comment-17445286 ] Dawid Weiss commented on LUCENE-10240: -- That external groovy dependency can be upd

[GitHub] [lucene] mikemccand commented on a change in pull request #442: LUCENE-10122 Use NumericDocValue to store taxonomy parent array

2021-11-17 Thread GitBox
mikemccand commented on a change in pull request #442: URL: https://github.com/apache/lucene/pull/442#discussion_r751348311 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/TaxonomyIndexArrays.java ## @@ -130,15 +125,49 @@ private void initParents

[jira] [Created] (LUCENE-10241) Update OpenNLP to 1.9.4

2021-11-17 Thread Jeff Zemerick (Jira)
Jeff Zemerick created LUCENE-10241: -- Summary: Update OpenNLP to 1.9.4 Key: LUCENE-10241 URL: https://issues.apache.org/jira/browse/LUCENE-10241 Project: Lucene - Core Issue Type: Task

[GitHub] [lucene] mikemccand commented on a change in pull request #442: LUCENE-10122 Use NumericDocValue to store taxonomy parent array

2021-11-17 Thread GitBox
mikemccand commented on a change in pull request #442: URL: https://github.com/apache/lucene/pull/442#discussion_r751350351 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/TaxonomyIndexArrays.java ## @@ -130,40 +125,82 @@ private void initParents

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445292#comment-17445292 ] Dawid Weiss commented on LUCENE-10240: -- Seems like this is the problem: https://is

[GitHub] [lucene] dweiss commented on pull request #433: LUCENE-10230 make demo builds easier to execute

2021-11-17 Thread GitBox
dweiss commented on pull request #433: URL: https://github.com/apache/lucene/pull/433#issuecomment-971689826 I think some of these "demo" classes come from different modules - this is legacy and highly unstructured... But feel free to provide a patch, sure. I was thinking about it myself.

[GitHub] [lucene] dweiss commented on pull request #447: LUCENE-10238: Update icu4j to 70.1.

2021-11-17 Thread GitBox
dweiss commented on pull request #447: URL: https://github.com/apache/lucene/pull/447#issuecomment-971690639 Darn. Thanks for fixing changes.txt. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [lucene] jpountz commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751006192 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/BackCompatSortedNumericDocValues.java ## @@ -0,0 +1,155 @@ +/* + * Licensed to the Apac

[GitHub] [lucene] jpountz commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751361476 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetUtils.java ## @@ -81,4 +82,17 @@ public long cost() { } }; } + + /** + * D

[GitHub] [lucene] gsmiller commented on a change in pull request #442: LUCENE-10122 Use NumericDocValue to store taxonomy parent array

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #442: URL: https://github.com/apache/lucene/pull/442#discussion_r751355800 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/TaxonomyIndexArrays.java ## @@ -130,15 +125,49 @@ private void initParents(I

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751365695 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/DocValuesOrdinalsReader.java ## @@ -41,13 +48,21 @@ public DocValuesOrdinalsReader(Str

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445303#comment-17445303 ] Robert Muir commented on LUCENE-10240: -- Thanks for tracking down the groovy bug.

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751370043 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/BackCompatSortedNumericDocValues.java ## @@ -0,0 +1,155 @@ +/* + * Licensed to the Apa

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445309#comment-17445309 ] Dawid Weiss commented on LUCENE-10240: -- Absolutely. This seems unrelated. > gradl

[GitHub] [lucene] magibney commented on a change in pull request #380: LUCENE-10171 - Fix dictionary-based OpenNLPLemmatizerFilterFactory caching issue

2021-11-17 Thread GitBox
magibney commented on a change in pull request #380: URL: https://github.com/apache/lucene/pull/380#discussion_r751397674 ## File path: lucene/analysis/opennlp/src/java/org/apache/lucene/analysis/opennlp/tools/OpenNLPOpsFactory.java ## @@ -169,11 +169,14 @@ public static Strin

[jira] [Updated] (LUCENE-10233) Store docIds as bitset when leafCardinality = 1 to speed up addAll

2021-11-17 Thread Feng Guo (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feng Guo updated LUCENE-10233: -- Attachment: (was: image-2021-11-17-22-43-13-693.png) > Store docIds as bitset when leafCardinalit

[GitHub] [lucene] jzonthemtn opened a new pull request #448: LUCENE-10241: Updating OpenNLP to 1.9.4.

2021-11-17 Thread GitBox
jzonthemtn opened a new pull request #448: URL: https://github.com/apache/lucene/pull/448 # Description Updating OpenNLP dependency to 1.9.4. # Solution Updated OpenNLP dependency version. # Tests Tests passed. # Checklist Please r

[GitHub] [lucene] mikemccand commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
mikemccand commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751373424 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue()

[jira] [Commented] (LUCENE-10241) Update OpenNLP to 1.9.4

2021-11-17 Thread Jeff Zemerick (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445336#comment-17445336 ] Jeff Zemerick commented on LUCENE-10241: First time contributor to Lucene -- pl

[GitHub] [lucene] jpountz commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751413961 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetUtils.java ## @@ -81,4 +82,17 @@ public long cost() { } }; } + + /** + * D

[GitHub] [lucene] rmuir commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
rmuir commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751435736 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetsConfig.java ## @@ -409,9 +410,26 @@ private void processFacetFields( indexDrillDownT

[GitHub] [lucene] rmuir commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
rmuir commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751437818 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue() {

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751451039 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetsConfig.java ## @@ -409,9 +410,26 @@ private void processFacetFields( indexDrillDo

[GitHub] [lucene] dweiss opened a new pull request #449: LUCENE-10240: gradle regenerate fails on java 17

2021-11-17 Thread GitBox
dweiss opened a new pull request #449: URL: https://github.com/apache/lucene/pull/449 This should do the trick. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751453417 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetsConfig.java ## @@ -409,9 +410,26 @@ private void processFacetFields( indexDrillDo

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751453619 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/BackCompatSortedNumericDocValues.java ## @@ -0,0 +1,155 @@ +/* + * Licensed to the Apa

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751455429 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/FastTaxonomyFacetCounts.java ## @@ -69,31 +69,34 @@ public FastTaxonomyFacetCounts(

[GitHub] [lucene] dweiss merged pull request #447: LUCENE-10238: Update icu4j to 70.1.

2021-11-17 Thread GitBox
dweiss merged pull request #447: URL: https://github.com/apache/lucene/pull/447 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[jira] [Commented] (LUCENE-10238) Update icu4j to 70.1

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445378#comment-17445378 ] ASF subversion and git services commented on LUCENE-10238: -- Co

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751456634 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/FastTaxonomyFacetCounts.java ## @@ -69,31 +69,34 @@ public FastTaxonomyFacetCounts(

[jira] [Resolved] (LUCENE-10238) Update icu4j to 70.1

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss resolved LUCENE-10238. -- Fix Version/s: 9.1 Resolution: Fixed > Update icu4j to 70.1 > > >

[jira] [Commented] (LUCENE-10238) Update icu4j to 70.1

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445381#comment-17445381 ] ASF subversion and git services commented on LUCENE-10238: -- Co

[GitHub] [lucene] jtibshirani commented on a change in pull request #444: LUCENE-10236: Updated field-weight used in CombinedFieldQuery scoring calculation, and added a test

2021-11-17 Thread GitBox
jtibshirani commented on a change in pull request #444: URL: https://github.com/apache/lucene/pull/444#discussion_r751461104 ## File path: lucene/sandbox/src/test/org/apache/lucene/sandbox/search/TestCombinedFieldQuery.java ## @@ -165,6 +169,117 @@ public void testSameScore()

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751462508 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue() {

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751464048 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue() {

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751465599 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/TaxonomyFacetLabels.java ## @@ -62,7 +62,16 @@ public TaxonomyFacetLabels(TaxonomyRead

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751467194 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/TaxonomyFacetLabels.java ## @@ -168,24 +234,61 @@ public FacetLabel nextFacetLabel(int

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751467541 ## File path: lucene/facet/src/test/org/apache/lucene/facet/taxonomy/TestBackCompatSortedNumericDocValues.java ## @@ -0,0 +1,136 @@ +/* + * Licensed to the

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751468971 ## File path: lucene/facet/src/test/org/apache/lucene/facet/taxonomy/directory/TestBackwardsCompatibility.java ## @@ -138,4 +331,51 @@ private Path getInde

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751469402 ## File path: lucene/facet/src/test/org/apache/lucene/facet/taxonomy/directory/TestBackwardsCompatibility.java ## @@ -50,43 +74,192 @@ // Then move the

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751465599 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/TaxonomyFacetLabels.java ## @@ -62,7 +62,16 @@ public TaxonomyFacetLabels(TaxonomyRead

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751475270 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue() {

[GitHub] [lucene] dweiss commented on pull request #449: LUCENE-10240: gradle regenerate fails on java 17

2021-11-17 Thread GitBox
dweiss commented on pull request #449: URL: https://github.com/apache/lucene/pull/449#issuecomment-971804749 Yeah. I don't completely understand the cause either - I just thought not forcing groovy to create an anonymous implementation of the interface passed to andThen would dodge the pro

[GitHub] [lucene] dweiss merged pull request #449: LUCENE-10240: gradle regenerate fails on java 17

2021-11-17 Thread GitBox
dweiss merged pull request #449: URL: https://github.com/apache/lucene/pull/449 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445402#comment-17445402 ] ASF subversion and git services commented on LUCENE-10240: -- Co

[GitHub] [lucene] gsmiller commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
gsmiller commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751476550 ## File path: lucene/facet/src/java/org/apache/lucene/facet/FacetUtils.java ## @@ -81,4 +82,17 @@ public long cost() { } }; } + + /** + *

[jira] [Resolved] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss resolved LUCENE-10240. -- Fix Version/s: 9.1 Resolution: Fixed > gradle regenerate fails on java 17 > --

[jira] [Commented] (LUCENE-10240) gradle regenerate fails on java 17

2021-11-17 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445403#comment-17445403 ] ASF subversion and git services commented on LUCENE-10240: -- Co

[GitHub] [lucene] jpountz commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751482085 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/FastTaxonomyFacetCounts.java ## @@ -69,31 +69,34 @@ public FastTaxonomyFacetCounts(

[GitHub] [lucene] jpountz commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
jpountz commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751483461 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/FastTaxonomyFacetCounts.java ## @@ -69,31 +69,34 @@ public FastTaxonomyFacetCounts(

[GitHub] [lucene] rmuir commented on a change in pull request #443: LUCENE-10062: Switch to numeric doc values for encoding taxonomy ordinals

2021-11-17 Thread GitBox
rmuir commented on a change in pull request #443: URL: https://github.com/apache/lucene/pull/443#discussion_r751504639 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/OrdinalMappingLeafReader.java ## @@ -107,6 +113,64 @@ public BytesRef binaryValue() {

[GitHub] [lucene] zhaih commented on a change in pull request #442: LUCENE-10122 Use NumericDocValue to store taxonomy parent array

2021-11-17 Thread GitBox
zhaih commented on a change in pull request #442: URL: https://github.com/apache/lucene/pull/442#discussion_r751544194 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/DirectoryTaxonomyWriter.java ## @@ -92,15 +93,18 @@ private final Directory

[GitHub] [lucene] zhaih commented on a change in pull request #442: LUCENE-10122 Use NumericDocValue to store taxonomy parent array

2021-11-17 Thread GitBox
zhaih commented on a change in pull request #442: URL: https://github.com/apache/lucene/pull/442#discussion_r751545923 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/DirectoryTaxonomyWriter.java ## @@ -466,18 +476,22 @@ protected final void ensu

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751606167 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,52 @@ public void testFieldExistsButNoDocs

[GitHub] [lucene] pquentin commented on a change in pull request #445: LUCENE-10085: Implement Weight#count on DocValuesFieldExistsQuery

2021-11-17 Thread GitBox
pquentin commented on a change in pull request #445: URL: https://github.com/apache/lucene/pull/445#discussion_r751606710 ## File path: lucene/core/src/test/org/apache/lucene/search/TestDocValuesFieldExistsQuery.java ## @@ -206,6 +210,50 @@ public void testFieldExistsButNoDocs

  1   2   >