date:20200604

[jira] [Updated] (SOLR-12823) remove clusterstate.json in Lucene/Solr 9.0

2020-06-04 Thread Jira



 [ 
https://issues.apache.org/jira/browse/SOLR-12823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-12823:
---
Summary: remove clusterstate.json in Lucene/Solr 9.0  (was: remove 
clusterstate.json in Lucene/Solr 8.0)

> remove clusterstate.json in Lucene/Solr 9.0
> ---
>
> Key: SOLR-12823
> URL: https://issues.apache.org/jira/browse/SOLR-12823
> Project: Solr
>  Issue Type: Task
>Reporter: Varun Thacker
>Priority: Major
>  Time Spent: 3h 50m
>  Remaining Estimate: 0h
>
> clusterstate.json is an artifact of a pre 5.0 Solr release. We should remove 
> that in 8.0
> It stays empty unless you explicitly ask to create the collection with the 
> old "stateFormat" and there is no reason for one to create a collection with 
> the old stateFormat.
> We should also remove the "stateFormat" argument in create collection
> We should also remove MIGRATESTATEVERSION as well
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-12823) remove clusterstate.json in Lucene/Solr 9.0

2020-06-04 Thread Jira



 [ 
https://issues.apache.org/jira/browse/SOLR-12823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-12823:
---
Description: 
clusterstate.json is an artifact of a pre 5.0 Solr release. We should remove 
that in 9.0

It stays empty unless you explicitly ask to create the collection with the old 
"stateFormat" and there is no reason for one to create a collection with the 
old stateFormat.

We should also remove the "stateFormat" argument in create collection

We should also remove MIGRATESTATEVERSION as well

 

 

  was:
clusterstate.json is an artifact of a pre 5.0 Solr release. We should remove 
that in 8.0

It stays empty unless you explicitly ask to create the collection with the old 
"stateFormat" and there is no reason for one to create a collection with the 
old stateFormat.

We should also remove the "stateFormat" argument in create collection

We should also remove MIGRATESTATEVERSION as well

 

 


> remove clusterstate.json in Lucene/Solr 9.0
> ---
>
> Key: SOLR-12823
> URL: https://issues.apache.org/jira/browse/SOLR-12823
> Project: Solr
>  Issue Type: Task
>Reporter: Varun Thacker
>Priority: Major
>  Time Spent: 3h 50m
>  Remaining Estimate: 0h
>
> clusterstate.json is an artifact of a pre 5.0 Solr release. We should remove 
> that in 9.0
> It stays empty unless you explicitly ask to create the collection with the 
> old "stateFormat" and there is no reason for one to create a collection with 
> the old stateFormat.
> We should also remove the "stateFormat" argument in create collection
> We should also remove MIGRATESTATEVERSION as well
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14467) inconsistent server errors combining relatedness() with allBuckets:true

2020-06-04 Thread Lucene/Solr QA (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125612#comment-17125612
 ] 

Lucene/Solr QA commented on SOLR-14467:
---

| (/) *{color:green}+1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
|| || || || {color:brown} Prechecks {color} ||
| {color:green}+1{color} | {color:green} test4tests {color} | {color:green}  0m 
 0s{color} | {color:green} The patch appears to include 4 new or modified test 
files. {color} |
|| || || || {color:brown} master Compile Tests {color} ||
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m  
4s{color} | {color:green} master passed {color} |
|| || || || {color:brown} Patch Compile Tests {color} ||
| {color:green}+1{color} | {color:green} compile {color} | {color:green}  1m  
4s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green}  1m  
4s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} Release audit (RAT) {color} | 
{color:green}  1m  8s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} Check forbidden APIs {color} | 
{color:green}  1m  4s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} Validate source patterns {color} | 
{color:green}  1m  4s{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} Validate ref guide {color} | 
{color:green}  1m  4s{color} | {color:green} the patch passed {color} |
|| || || || {color:brown} Other Tests {color} ||
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 45m 
16s{color} | {color:green} core in the patch passed. {color} |
| {color:black}{color} | {color:black} {color} | {color:black} 49m 53s{color} | 
{color:black} {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Issue | SOLR-14467 |
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/13004759/SOLR-14467.patch |
| Optional Tests |  compile  javac  unit  ratsources  checkforbiddenapis  
validatesourcepatterns  validaterefguide  |
| uname | Linux lucene1-us-west 4.15.0-54-generic #58-Ubuntu SMP Mon Jun 24 
10:55:24 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux |
| Build tool | ant |
| Personality | 
/home/jenkins/jenkins-slave/workspace/PreCommit-SOLR-Build/sourcedir/dev-tools/test-patch/lucene-solr-yetus-personality.sh
 |
| git revision | master / 08a13ce5894 |
| ant | version: Apache Ant(TM) version 1.10.5 compiled on March 28 2019 |
| Default Java | LTS |
|  Test Results | 
https://builds.apache.org/job/PreCommit-SOLR-Build/760/testReport/ |
| modules | C: solr/core solr/solr-ref-guide U: solr |
| Console output | 
https://builds.apache.org/job/PreCommit-SOLR-Build/760/console |
| Powered by | Apache Yetus 0.7.0   http://yetus.apache.org |


This message was automatically generated.



> inconsistent server errors combining relatedness() with allBuckets:true
> ---
>
> Key: SOLR-14467
> URL: https://issues.apache.org/jira/browse/SOLR-14467
> Project: Solr
>  Issue Type: Bug
>  Components: Facet Module
>Reporter: Chris M. Hostetter
>Priority: Major
> Attachments: SOLR-14467.patch, SOLR-14467.patch, SOLR-14467.patch, 
> SOLR-14467.patch, SOLR-14467_allBuckets_refine.patch, SOLR-14467_test.patch, 
> SOLR-14467_test.patch, beast.log.txt, beast2.log.txt
>
>
> While working on randomized testing for SOLR-13132 i discovered a variety of 
> different ways that JSON Faceting's "allBuckets" option can fail when 
> combined with the "relatedness()" function.
> I haven't found a trivial way to manual reproduce this, but i have been able 
> to trigger the failures with a trivial patch to {{TestCloudJSONFacetSKG}} 
> which i will attach.
> Based on the nature of the failures it looks like it may have something to do 
> with multiple segments of different sizes, and or resizing the SlotAccs ?
> The relatedness() function doesn't have much (any?) existing tests in place 
> that leverage "allBuckets" so this is probably a bug that has always existed 
> -- it's possible it may be excessively cumbersome to fix and we might 
> nee/wnat to just document that incompatibility and add some code to try and 
> detect if the user combines these options and if so fail with a 400 error?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14518) Add support for partitioned unique agg to JSON facets

2020-06-04 Thread Mikhail Khludnev (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125661#comment-17125661
 ] 

Mikhail Khludnev commented on SOLR-14518:
-

[~dan2097] It sounds like a way to go, but I've already made wrong suggestion 
in the comment above. So, beware.  

> Add support for partitioned unique agg to JSON facets
> -
>
> Key: SOLR-14518
> URL: https://issues.apache.org/jira/browse/SOLR-14518
> Project: Solr
>  Issue Type: New Feature
>  Components: Facet Module
>Reporter: Joel Bernstein
>Priority: Major
>
> There are scenarios where documents are partitioned across shards based on 
> the same field that the *unique* agg is applied to with JSON facets. In this 
> scenario exact unique counts can be calculated by simply sending the bucket 
> level unique counts to the aggregator where they can be summed. Suggested 
> syntax is to add a boolean flag to the unique aggregation function: 
> *unique*(partitioned_field, true).
> The *true* value turns on the "partitioned" unique logic. The default is 
> false.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14521) Establish the Solr project (resolution)

2020-06-04 Thread Jira



 [ 
https://issues.apache.org/jira/browse/SOLR-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-14521:
---
Attachment: Skjermbilde 2020-06-04 kl. 11.23.53.png

> Establish the Solr project (resolution)
> ---
>
> Key: SOLR-14521
> URL: https://issues.apache.org/jira/browse/SOLR-14521
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Jan Høydahl
>Priority: Major
> Attachments: Skjermbilde 2020-06-04 kl. 11.23.53.png
>
>
> By submitting a board resolution to the next board meeting. See 
> [https://cwiki.apache.org/confluence/display/LUCENE/Solr+TLP+needed+changes#SolrTLPneededchanges-resolutionDraftboardresolution]
>  for the draft resolution



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14521) Establish the Solr project (resolution)

2020-06-04 Thread Jira



 [ 
https://issues.apache.org/jira/browse/SOLR-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-14521:
---
Description: 
By submitting a board resolution to the next board meeting. See 
[https://cwiki.apache.org/confluence/display/LUCENE/Solr+TLP+needed+changes#SolrTLPneededchanges-resolutionDraftboardresolution]
 for the draft resolution.

Once the resolution draft is approved (by vote or lazy consensus), PMC chair 
can add it to Board meeting agenda using Whimsy 
[https://whimsy.apache.org/board/agenda/2020-06-17/] and clicking the "Add 
Item" button and then "Establish Project" button. It's just a form with 
checkboxes for PMC members. The first PMC member selected becomes the chair.

The next meeting is June 17th, when Lucene quarterly report is also due.

!Skjermbilde 2020-06-04 kl. 11.23.53.png!

  was:By submitting a board resolution to the next board meeting. See 
[https://cwiki.apache.org/confluence/display/LUCENE/Solr+TLP+needed+changes#SolrTLPneededchanges-resolutionDraftboardresolution]
 for the draft resolution


> Establish the Solr project (resolution)
> ---
>
> Key: SOLR-14521
> URL: https://issues.apache.org/jira/browse/SOLR-14521
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Jan Høydahl
>Priority: Major
> Attachments: Skjermbilde 2020-06-04 kl. 11.23.53.png
>
>
> By submitting a board resolution to the next board meeting. See 
> [https://cwiki.apache.org/confluence/display/LUCENE/Solr+TLP+needed+changes#SolrTLPneededchanges-resolutionDraftboardresolution]
>  for the draft resolution.
> Once the resolution draft is approved (by vote or lazy consensus), PMC chair 
> can add it to Board meeting agenda using Whimsy 
> [https://whimsy.apache.org/board/agenda/2020-06-17/] and clicking the "Add 
> Item" button and then "Establish Project" button. It's just a form with 
> checkboxes for PMC members. The first PMC member selected becomes the chair.
> The next meeting is June 17th, when Lucene quarterly report is also due.
> !Skjermbilde 2020-06-04 kl. 11.23.53.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9391) Upgrade to HPPC 0.8.2

2020-06-04 Thread Dawid Weiss (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125762#comment-17125762
 ] 

Dawid Weiss commented on LUCENE-9391:
-

I think this is applicable to Solr, not Lucene? Care to provide a patch?

> Upgrade to HPPC 0.8.2
> -
>
> Key: LUCENE-9391
> URL: https://issues.apache.org/jira/browse/LUCENE-9391
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Haoyu Zhai
>Priority: Minor
>
> HPPC 0.8.2 is out and exposes an Accountable-like interface using to estimate 
> the memory usage.
> [https://issues.carrot2.org/secure/ReleaseNote.jspa?projectId=10070&version=13522&styleName=Text]
> We should upgrade to that if any of components using hppc need to estimate 
> memory better.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] markharwood commented on pull request #1496: Bugfix for FuzzyQuery false negative when prefix length == search term length

2020-06-04 Thread GitBox



markharwood commented on pull request #1496:
URL: https://github.com/apache/lucene-solr/pull/1496#issuecomment-638749347


   Closing - addressed instead by 
https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=45611d0



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] markharwood closed pull request #1496: Bugfix for FuzzyQuery false negative when prefix length == search term length

2020-06-04 Thread GitBox



markharwood closed pull request #1496:
URL: https://github.com/apache/lucene-solr/pull/1496


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] ErickErickson closed pull request #1492: SOLR-11934: Visit Solr logging, it's too noisy.

2020-06-04 Thread GitBox



ErickErickson closed pull request #1492:
URL: https://github.com/apache/lucene-solr/pull/1492


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] ErickErickson commented on pull request #1492: SOLR-11934: Visit Solr logging, it's too noisy.

2020-06-04 Thread GitBox



ErickErickson commented on pull request #1492:
URL: https://github.com/apache/lucene-solr/pull/1492#issuecomment-638769279


   Forgot to close it, thanks for pointing it out.
   
   Erick
   
   > On Jun 3, 2020, at 4:41 PM, Mike Drob  wrote:
   > 
   > 
   > @ErickErickson There's no changes here. Stale PR?
   > 
   > —
   > You are receiving this because you were mentioned.
   > Reply to this email directly, view it on GitHub, or unsubscribe.
   > 
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] markharwood commented on pull request #1541: RegExp - add case insensitive matching option

2020-06-04 Thread GitBox



markharwood commented on pull request #1541:
URL: https://github.com/apache/lucene-solr/pull/1541#issuecomment-638776739


   I updated the constructor on this @jpountz - good to go?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14533) Fix or suppress warnings in solr/handler/admin

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125820#comment-17125820
 ] 

ASF subversion and git services commented on SOLR-14533:


Commit bab4fccba22707e254807c8152921da143fb8706 in lucene-solr's branch 
refs/heads/master from Erick Erickson
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=bab4fcc ]

SOLR-14533: Fix or suppress warnings in solr/handler/admin


> Fix or suppress warnings in solr/handler/admin
> --
>
> Key: SOLR-14533
> URL: https://issues.apache.org/jira/browse/SOLR-14533
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
> Attachments: SOLR-14533.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14533) Fix or suppress warnings in solr/handler/admin

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125821#comment-17125821
 ] 

ASF subversion and git services commented on SOLR-14533:


Commit 94916a6a8a1cf38cc03767f3f2f4e2d38a4ad271 in lucene-solr's branch 
refs/heads/branch_8x from Erick Erickson
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=94916a6 ]

SOLR-14533: Fix or suppress warnings in solr/handler/admin


> Fix or suppress warnings in solr/handler/admin
> --
>
> Key: SOLR-14533
> URL: https://issues.apache.org/jira/browse/SOLR-14533
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
> Attachments: SOLR-14533.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14533) Fix or suppress warnings in solr/handler/admin

2020-06-04 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-14533:
--
Attachment: SOLR-14533.patch
Status: Open  (was: Open)

Changed a couple of things:

1> it's rarely (never?) the right thing to do to throw an exception from a test 
method, JUnit will deal with that.

2> AutoCloseable.close(). We crossed wires a bit. And no big deal at all, just 
for future reference. I looked at the JavaDocs for AutoCloseable.close(), and 
it's _strongly_ advised to throw a concrete exception rather than Exception, 
even though that's the method signature for AutoCloseable.close(). I'd already 
changed the method signature for some classes that implement 
AutoCloseable.close (SolrMetricProducer is case in point) to throw an 
IOException rather than Exception. That rippled through a couple of 
try-with-resources blocks and needed adding a catch (IOException) block. This 
is just in case you run into this in future so you won't scratch your head.

3> <2> notwithstanding, I had to make a few changes for the 8x backport because 
the AutoCloseable interface hadn't been added to one of the metrics classes, 
but I'm not going to put up a separate patch for that...


Attached is a patch with the rest of the warnings removed.

Thanks again!

> Fix or suppress warnings in solr/handler/admin
> --
>
> Key: SOLR-14533
> URL: https://issues.apache.org/jira/browse/SOLR-14533
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
> Attachments: SOLR-14533.patch, SOLR-14533.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (SOLR-14533) Fix or suppress warnings in solr/handler/admin

2020-06-04 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-14533.
---
Fix Version/s: 8.6
   Resolution: Fixed

Thanks Andras!

> Fix or suppress warnings in solr/handler/admin
> --
>
> Key: SOLR-14533
> URL: https://issues.apache.org/jira/browse/SOLR-14533
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
> Fix For: 8.6
>
> Attachments: SOLR-14533.patch, SOLR-14533.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (SOLR-14426) forbidden api error during precommit DateMathFunction

2020-06-04 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-14426.
---
Resolution: Fixed

I believe all the package-private classes are now either static nested classes 
or in their own file and this issue won't happen again.

Since nobody objected, I'll close it again. We can reopen if necessary.

> forbidden api error during precommit DateMathFunction
> -
>
> Key: SOLR-14426
> URL: https://issues.apache.org/jira/browse/SOLR-14426
> Project: Solr
>  Issue Type: Bug
>  Components: Build
>Reporter: Mike Drob
>Assignee: Mike Drob
>Priority: Major
> Fix For: master (9.0)
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> When running `./gradlew precommit` I'll occasionally see
> {code}
> * What went wrong:
> Execution failed for task ':solr:contrib:analytics:forbiddenApisMain'.
> > de.thetaphi.forbiddenapis.ForbiddenApiException: Check for forbidden API 
> > calls failed while scanning class 
> > 'org.apache.solr.analytics.function.mapping.DateMathFunction' 
> > (DateMathFunction.java): java.lang.ClassNotFoundException: 
> > org.apache.solr.analytics.function.mapping.DateMathValueFunction (while 
> > looking up details about referenced class 
> > 'org.apache.solr.analytics.function.mapping.DateMathValueFunction')
> {code}
> `./gradlew clean` fixes this, but I don't understand what or why this 
> happens. Feels like a gradle issue?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14532) Add iml file to gitignore

2020-06-04 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125852#comment-17125852
 ] 

Erick Erickson commented on SOLR-14532:
---

[~asalamon74] I don't see this happening. If I do the following:
 - git clean -dxf
 - ./gradlew idea

I do see a file SolrEOEFork.iml, but it's not lagged by Git as being part of 
the repo. In fact if I do an "ant idea" I see a ton of new iml files scattered 
all over the source tree, not only in {{dev-tools}}, none of which are flagged 
by git.

How do reproduce this?

> Add iml file to gitignore
> -
>
> Key: SOLR-14532
> URL: https://issues.apache.org/jira/browse/SOLR-14532
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Andras Salamon
>Priority: Trivial
> Attachments: SOLR-14532.patch
>
>
> If I execute {{gradlew idea}} in my {{lucene-solr-upstream}} directory, it 
> will create three files in the root directory:
> {noformat}
> lucene-solr-upstream.iml
> lucene-solr-upstream.ipr
> lucene-solr-upstream.iws
> {noformat}
> Git will ignore the {{ipr}} and the {{iws}} file, but it lists the iml file 
> as a new file. We should also ignore that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14535) Fix or suppress warnings in apache/solr/handler/component

2020-06-04 Thread Erick Erickson (Jira)

Erick Erickson created SOLR-14535:
-

 Summary: Fix or suppress warnings in apache/solr/handler/component
 Key: SOLR-14535
 URL: https://issues.apache.org/jira/browse/SOLR-14535
 Project: Solr
  Issue Type: Sub-task
Reporter: Erick Erickson
Assignee: Erick Erickson






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (SOLR-14535) Fix or suppress warnings in apache/solr/handler/component, sql and loader

2020-06-04 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-14535:
--
Summary: Fix or suppress warnings in apache/solr/handler/component, sql and 
loader  (was: Fix or suppress warnings in apache/solr/handler/component)

> Fix or suppress warnings in apache/solr/handler/component, sql and loader
> -
>
> Key: SOLR-14535
> URL: https://issues.apache.org/jira/browse/SOLR-14535
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13458) Make Jetty timeouts configurable system wide

2020-06-04 Thread Gus Heck (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125908#comment-17125908
 ] 

Gus Heck commented on SOLR-13458:
-

Not at the time, that particular work got abandoned by the customer soon after 
I wrote this and I didn't dig further, but I'm actually once again bumping into 
timeouts (in a cluster with many billions of docs) so I may soon.

> Make Jetty timeouts configurable system wide
> 
>
> Key: SOLR-13458
> URL: https://issues.apache.org/jira/browse/SOLR-13458
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Affects Versions: master (9.0)
>Reporter: Gus Heck
>Priority: Major
>
> Our jetty container has several timeouts associated with it, and at least one 
> of these is regularly getting in my way (the idle timeout after 120 sec). I 
> tried setting a system property, with no effect and I've tried altering a 
> jetty.xml found at solr-install/solr/server/etc/jetty.xml on all (50) 
> machines and rebooting all servers only to have an exception with the old 120 
> sec timeout still show up. This ticket proposes that these values are by 
> nature "Global System Timeouts" and should be made configurable in solr.xml 
> (which may be difficult because they will be needed early in the boot 
> sequence). 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] msokolov commented on pull request #1543: LUCENE-9378: Disable compression on binary values whose length is less than 32.

2020-06-04 Thread GitBox



msokolov commented on pull request #1543:
URL: https://github.com/apache/lucene-solr/pull/1543#issuecomment-638852369


   Our internal benchmarks show slowdowns even as we increase the threshold 
here, although we can nearly recover (-9% QPS) the previous query-time 
performance disabling below 128 bytes. I'm beginning to wonder what the case is 
for compressing BDV at all? Do we see large BinaryDocValues fields that are 
not, or only rarely, decoded at query time? I think if such fields participate 
in the search in any significant way (ie for all hits), we are going to see 
these slowdowns. Maybe the bytes threshold is really just a proxy for binary 
doc values used as stored fields?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14536) Fix or suppress warnings in apache/solr/common

2020-06-04 Thread Erick Erickson (Jira)

Erick Erickson created SOLR-14536:
-

 Summary: Fix or suppress warnings in apache/solr/common
 Key: SOLR-14536
 URL: https://issues.apache.org/jira/browse/SOLR-14536
 Project: Solr
  Issue Type: Sub-task
Reporter: Erick Erickson
Assignee: Erick Erickson






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14532) Add iml file to gitignore

2020-06-04 Thread Dawid Weiss (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125973#comment-17125973
 ] 

Dawid Weiss commented on SOLR-14532:


If you're using gradle then I'd suggest importing the project into IntelliJ as 
a gradle project (not using the plugin to generate project files). This results 
in directory-based layout (.gradle) rather than file-based layout of IntelliJ 
files.

> Add iml file to gitignore
> -
>
> Key: SOLR-14532
> URL: https://issues.apache.org/jira/browse/SOLR-14532
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Andras Salamon
>Priority: Trivial
> Attachments: SOLR-14532.patch
>
>
> If I execute {{gradlew idea}} in my {{lucene-solr-upstream}} directory, it 
> will create three files in the root directory:
> {noformat}
> lucene-solr-upstream.iml
> lucene-solr-upstream.ipr
> lucene-solr-upstream.iws
> {noformat}
> Git will ignore the {{ipr}} and the {{iws}} file, but it lists the iml file 
> as a new file. We should also ignore that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14532) Add iml file to gitignore

2020-06-04 Thread Andras Salamon (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125980#comment-17125980
 ] 

Andras Salamon commented on SOLR-14532:
---

Here are my reproduction steps:
{noformat}
$ git clone g...@github.com:apache/lucene-solr.git
$ cd lucene-solr
$ git status
# it shows that we have no new files which is OK
$ ./gradlew idea
git status
On branch master
Your branch is up to date with 'origin/master'.

Untracked files:
  (use "git add ..." to include in what will be committed)

lucene-solr.iml

nothing added to commit but untracked files present (use "git add" to 
track){noformat}

So git shows this file as an untracked file. The other two files 
(lucene-solr.ipr, lucene-solr.iws) are ignored so they don't listed as 
untracked files.



> Add iml file to gitignore
> -
>
> Key: SOLR-14532
> URL: https://issues.apache.org/jira/browse/SOLR-14532
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Andras Salamon
>Priority: Trivial
> Attachments: SOLR-14532.patch
>
>
> If I execute {{gradlew idea}} in my {{lucene-solr-upstream}} directory, it 
> will create three files in the root directory:
> {noformat}
> lucene-solr-upstream.iml
> lucene-solr-upstream.ipr
> lucene-solr-upstream.iws
> {noformat}
> Git will ignore the {{ipr}} and the {{iws}} file, but it lists the iml file 
> as a new file. We should also ignore that one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14523) Enhance gradle logging calls validation: eliminate getMessage()

2020-06-04 Thread Andras Salamon (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125988#comment-17125988
 ] 

Andras Salamon commented on SOLR-14523:
---

[~erickerickson] I followed your description it was working correctly. Thx. 
I'll start on the fix.

> Enhance gradle logging calls validation: eliminate getMessage()
> ---
>
> Key: SOLR-14523
> URL: https://issues.apache.org/jira/browse/SOLR-14523
> Project: Solr
>  Issue Type: Improvement
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Andras Salamon
>Assignee: Erick Erickson
>Priority: Minor
> Attachments: validation.patch
>
>
> SOLR-14280 fixed a logging problem in SolrConfig by removing a few 
> getMessage() calls. We could enhance this solution by modifying gradle's 
> logging calls validation and forbid getMessage() calls during logging. We 
> should check the existing code and eliminate such calls.
> It is possible to suppress the warning using {{//logok}}.
> [~erickerickson] [~gerlowskija]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9389) Enhance gradle logging calls validation: eliminate getMessage()

2020-06-04 Thread Andras Salamon (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17125990#comment-17125990
 ] 

Andras Salamon commented on LUCENE-9389:


LUCENE-9382 turned off log validation for Lucene now this is only called for 
Solr. So even if I apply the patch provided in SOLR-14523 it will not list 
Lucene problems.

It is still useful to fix the Lucene log problems but without the checker they 
will come back eventually.

> Enhance gradle logging calls validation: eliminate getMessage()
> ---
>
> Key: LUCENE-9389
> URL: https://issues.apache.org/jira/browse/LUCENE-9389
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Andras Salamon
>Assignee: Erick Erickson
>Priority: Minor
>
> SOLR-14280 fixed a logging problem in SolrConfig by removing a few 
> getMessage() calls. We could enhance this solution by modifying gradle's 
> logging calls validation and forbid getMessage() calls during logging. We 
> should check the existing code and eliminate such calls.
> It is possible to suppress the warning using {{//logok}}.
> [~erickerickson] [~gerlowskija]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] jpountz commented on pull request #1543: LUCENE-9378: Disable compression on binary values whose length is less than 32.

2020-06-04 Thread GitBox



jpountz commented on pull request #1543:
URL: https://github.com/apache/lucene-solr/pull/1543#issuecomment-638908545


   Hi @msokolov I'm looking into improving the encoding of lengths, which is 
the next bottleneck for binary doc values. We are using binary doc values to 
run regex queries on a high cardinality field. A ngram index helps find good 
candidates and binary doc values are then used for verification. Field values 
are typically files, URLs, ... which can have significant redundancy. I'm ok 
with making compression less aggressive though I think that it would be a pity 
to disable it entirely and never take advantage of redundancy. You mentioned 
slowdowns, but this actually depends on the query, e.g. I'm seeing an almost 2x 
speedup when sorting a `MatchAllDocsQuery` on `wikimedium10m`. Let me give a 
try at making compression less aggressive/slow?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9391) Upgrade to HPPC 0.8.2

2020-06-04 Thread Michael McCandless (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126013#comment-17126013
 ] 

Michael McCandless commented on LUCENE-9391:


Let's also upgrade Lucene facet module?  Not sure if there are any other Lucene 
modules consuming HPPC?

> Upgrade to HPPC 0.8.2
> -
>
> Key: LUCENE-9391
> URL: https://issues.apache.org/jira/browse/LUCENE-9391
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Haoyu Zhai
>Priority: Minor
>
> HPPC 0.8.2 is out and exposes an Accountable-like interface using to estimate 
> the memory usage.
> [https://issues.carrot2.org/secure/ReleaseNote.jspa?projectId=10070&version=13522&styleName=Text]
> We should upgrade to that if any of components using hppc need to estimate 
> memory better.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14419) Query DLS {"param":"ref"}

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126045#comment-17126045
 ] 

ASF subversion and git services commented on SOLR-14419:


Commit 7c55ba954763d2e4d554992542a814fb438576b3 in lucene-solr's branch 
refs/heads/master from Mikhail Khludnev
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=7c55ba9 ]

SOLR-14419: Ref Guide update for {ref:param}


> Query DLS {"param":"ref"}
> -
>
> Key: SOLR-14419
> URL: https://issues.apache.org/jira/browse/SOLR-14419
> Project: Solr
>  Issue Type: Improvement
>  Components: JSON Request API
>Reporter: Mikhail Khludnev
>Assignee: Mikhail Khludnev
>Priority: Major
> Fix For: 8.6
>
> Attachments: SOLR-14419-refguide.patch, SOLR-14419.patch, 
> SOLR-14419.patch, SOLR-14419.patch
>
>
> What we can do with plain params: 
> {{q=\{!parent which=$prnts}...&prnts=type:parent}}
> obviously I want to have something like this in Query DSL:
> {code}
> { "query": { "parents":{ "which":{"param":"prnts"}, "query":"..."}}
>   "params": {
>   "prnts":"type:parent"
>}
> }
> {code} 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14419) Query DLS {"param":"ref"}

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126055#comment-17126055
 ] 

ASF subversion and git services commented on SOLR-14419:


Commit 3bbb8e3d4168c3d84658438e10fadf98da08d006 in lucene-solr's branch 
refs/heads/branch_8x from Mikhail Khludnev
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=3bbb8e3 ]

SOLR-14419: Ref Guide update for {ref:param}


> Query DLS {"param":"ref"}
> -
>
> Key: SOLR-14419
> URL: https://issues.apache.org/jira/browse/SOLR-14419
> Project: Solr
>  Issue Type: Improvement
>  Components: JSON Request API
>Reporter: Mikhail Khludnev
>Assignee: Mikhail Khludnev
>Priority: Major
> Fix For: 8.6
>
> Attachments: SOLR-14419-refguide.patch, SOLR-14419.patch, 
> SOLR-14419.patch, SOLR-14419.patch
>
>
> What we can do with plain params: 
> {{q=\{!parent which=$prnts}...&prnts=type:parent}}
> obviously I want to have something like this in Query DSL:
> {code}
> { "query": { "parents":{ "which":{"param":"prnts"}, "query":"..."}}
>   "params": {
>   "prnts":"type:parent"
>}
> }
> {code} 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] NazerkeBS commented on a change in pull request #1527: SOLR-14384 Stack SolrRequestInfo

2020-06-04 Thread GitBox



NazerkeBS commented on a change in pull request #1527:
URL: https://github.com/apache/lucene-solr/pull/1527#discussion_r435401394



##
File path: solr/core/src/java/org/apache/solr/request/SolrRequestInfo.java
##
@@ -52,35 +56,60 @@
   private static final Logger log = 
LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
 
   public static SolrRequestInfo getRequestInfo() {
-return threadLocal.get();
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) return null;
+return stack.peek();
   }
 
+  /** Adds the SolrRequestInfo onto the stack provided that the stack is not 
reached MAX_STACK_SIZE */
   public static void setRequestInfo(SolrRequestInfo info) {
-// TODO: temporary sanity check... this can be changed to just an assert 
in the future
-SolrRequestInfo prev = threadLocal.get();
-if (prev != null) {
-  log.error("Previous SolrRequestInfo was not closed!  req={}", 
prev.req.getOriginalParams());
-  log.error("prev == info : {}", prev.req == info.req, new 
RuntimeException());
+Deque stack = threadLocal.get();
+if (info == null) {
+  throw new IllegalArgumentException("SolrRequestInfo is null");
+} else {
+  if (stack.size() <= MAX_STACK_SIZE) {
+stack.push(info);
+  } else {
+assert true : "SolrRequestInfo Stack is full";
+log.error("SolrRequestInfo Stack is full");
+  }
 }
-assert prev == null;
-
-threadLocal.set(info);
   }
 
+  /** Removes the most recent SolrRequestInfo from the stack */
   public static void clearRequestInfo() {
-try {
-  SolrRequestInfo info = threadLocal.get();
-  if (info != null && info.closeHooks != null) {
-for (Closeable hook : info.closeHooks) {
-  try {
-hook.close();
-  } catch (Exception e) {
-SolrException.log(log, "Exception during close hook", e);
-  }
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) {
+  log.error("clearRequestInfo called too many times");
+} else {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+  }
+
+  /**
+   * This reset method is more of a protection mechanism as
+   * we expect it to be empty by now because all "set" calls need to be 
balanced with a "clear".
+   */
+  public static void reset() {
+Deque stack = threadLocal.get();
+boolean isEmpty = stack.isEmpty();
+while (!stack.isEmpty()) {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+assert isEmpty : "SolrRequestInfo Stack should have been cleared.";
+  }
+
+  private static void closeHooks(SolrRequestInfo info) {
+if (info != null && info.closeHooks != null) {

Review comment:
   it can be null; when getRequestInfo is called, in case of the 
stack.isEmpty(), it returns null;





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] NazerkeBS commented on a change in pull request #1527: SOLR-14384 Stack SolrRequestInfo

2020-06-04 Thread GitBox



NazerkeBS commented on a change in pull request #1527:
URL: https://github.com/apache/lucene-solr/pull/1527#discussion_r435401394



##
File path: solr/core/src/java/org/apache/solr/request/SolrRequestInfo.java
##
@@ -52,35 +56,60 @@
   private static final Logger log = 
LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
 
   public static SolrRequestInfo getRequestInfo() {
-return threadLocal.get();
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) return null;
+return stack.peek();
   }
 
+  /** Adds the SolrRequestInfo onto the stack provided that the stack is not 
reached MAX_STACK_SIZE */
   public static void setRequestInfo(SolrRequestInfo info) {
-// TODO: temporary sanity check... this can be changed to just an assert 
in the future
-SolrRequestInfo prev = threadLocal.get();
-if (prev != null) {
-  log.error("Previous SolrRequestInfo was not closed!  req={}", 
prev.req.getOriginalParams());
-  log.error("prev == info : {}", prev.req == info.req, new 
RuntimeException());
+Deque stack = threadLocal.get();
+if (info == null) {
+  throw new IllegalArgumentException("SolrRequestInfo is null");
+} else {
+  if (stack.size() <= MAX_STACK_SIZE) {
+stack.push(info);
+  } else {
+assert true : "SolrRequestInfo Stack is full";
+log.error("SolrRequestInfo Stack is full");
+  }
 }
-assert prev == null;
-
-threadLocal.set(info);
   }
 
+  /** Removes the most recent SolrRequestInfo from the stack */
   public static void clearRequestInfo() {
-try {
-  SolrRequestInfo info = threadLocal.get();
-  if (info != null && info.closeHooks != null) {
-for (Closeable hook : info.closeHooks) {
-  try {
-hook.close();
-  } catch (Exception e) {
-SolrException.log(log, "Exception during close hook", e);
-  }
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) {
+  log.error("clearRequestInfo called too many times");
+} else {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+  }
+
+  /**
+   * This reset method is more of a protection mechanism as
+   * we expect it to be empty by now because all "set" calls need to be 
balanced with a "clear".
+   */
+  public static void reset() {
+Deque stack = threadLocal.get();
+boolean isEmpty = stack.isEmpty();
+while (!stack.isEmpty()) {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+assert isEmpty : "SolrRequestInfo Stack should have been cleared.";
+  }
+
+  private static void closeHooks(SolrRequestInfo info) {
+if (info != null && info.closeHooks != null) {

Review comment:
   it can be null; when getRequestInfo is called, if the stack.isEmpty(), 
it returns null;





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] dsmiley commented on a change in pull request #1527: SOLR-14384 Stack SolrRequestInfo

2020-06-04 Thread GitBox



dsmiley commented on a change in pull request #1527:
URL: https://github.com/apache/lucene-solr/pull/1527#discussion_r435405488



##
File path: solr/core/src/java/org/apache/solr/request/SolrRequestInfo.java
##
@@ -52,35 +56,60 @@
   private static final Logger log = 
LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
 
   public static SolrRequestInfo getRequestInfo() {
-return threadLocal.get();
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) return null;
+return stack.peek();
   }
 
+  /** Adds the SolrRequestInfo onto the stack provided that the stack is not 
reached MAX_STACK_SIZE */
   public static void setRequestInfo(SolrRequestInfo info) {
-// TODO: temporary sanity check... this can be changed to just an assert 
in the future
-SolrRequestInfo prev = threadLocal.get();
-if (prev != null) {
-  log.error("Previous SolrRequestInfo was not closed!  req={}", 
prev.req.getOriginalParams());
-  log.error("prev == info : {}", prev.req == info.req, new 
RuntimeException());
+Deque stack = threadLocal.get();
+if (info == null) {
+  throw new IllegalArgumentException("SolrRequestInfo is null");
+} else {
+  if (stack.size() <= MAX_STACK_SIZE) {
+stack.push(info);
+  } else {
+assert true : "SolrRequestInfo Stack is full";
+log.error("SolrRequestInfo Stack is full");
+  }
 }
-assert prev == null;
-
-threadLocal.set(info);
   }
 
+  /** Removes the most recent SolrRequestInfo from the stack */
   public static void clearRequestInfo() {
-try {
-  SolrRequestInfo info = threadLocal.get();
-  if (info != null && info.closeHooks != null) {
-for (Closeable hook : info.closeHooks) {
-  try {
-hook.close();
-  } catch (Exception e) {
-SolrException.log(log, "Exception during close hook", e);
-  }
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) {
+  log.error("clearRequestInfo called too many times");
+} else {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+  }
+
+  /**
+   * This reset method is more of a protection mechanism as
+   * we expect it to be empty by now because all "set" calls need to be 
balanced with a "clear".
+   */
+  public static void reset() {
+Deque stack = threadLocal.get();
+boolean isEmpty = stack.isEmpty();
+while (!stack.isEmpty()) {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+assert isEmpty : "SolrRequestInfo Stack should have been cleared.";
+  }
+
+  private static void closeHooks(SolrRequestInfo info) {
+if (info != null && info.closeHooks != null) {

Review comment:
   The callers of this method will not pass null though; right?

##
File path: 
solr/core/src/java/org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java
##
@@ -271,7 +273,9 @@ public void writeResults(ResultContext ctx, JavaBinCodec 
codec) throws IOExcepti
   throw new SolrServerException(ex);
 } finally {
   if (req != null) req.close();
-  SolrRequestInfo.clearRequestInfo();
+  if (mustClearSolrRequestInfo) {

Review comment:
   Hmm; might it simply be enough to call clearRequestInfo if req != null, 
thus no new boolean?

##
File path: 
solr/core/src/java/org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java
##
@@ -71,6 +71,7 @@
   protected final String coreName;
   private final SolrRequestParsers _parser;
   private final RequestWriterSupplier supplier;
+  private boolean mustClearSolrRequestInfo = false;

Review comment:
   This change is highly suspicious.  Why is this boolean now a field of 
EmbeddedSolrServer?  It doesn't seem like *state* of this object and so I don't 
think it belongs here.  It seems like a transient status in the course of work 
within the method that sets/clears the requestInfo.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14537) Improve performance of ExportWriter

2020-06-04 Thread Andrzej Bialecki (Jira)

Andrzej Bialecki created SOLR-14537:
---

 Summary: Improve performance of ExportWriter
 Key: SOLR-14537
 URL: https://issues.apache.org/jira/browse/SOLR-14537
 Project: Solr
  Issue Type: Improvement
  Security Level: Public (Default Security Level. Issues are Public)
  Components: Export Writer
Reporter: Andrzej Bialecki


Retrieving, sorting and writing out documents in {{ExportWriter}} are three 
aspects of the /export handler that can be further optimized.

SOLR-14470 introduced some level of caching in {{StringValue}}. Further options 
for caching and speedups should be explored.

Currently the sort/retrieve and write operations are done sequentially, but 
they could be parallelized, considering that they block on different channels - 
the first is index reading & CPU bound, the other is bound by the receiving end 
because it uses blocking IO. The sorting and retrieving of values could be done 
in parallel with the operation of writing out the current batch of results.

One possible approach here would be to use "double buffering" where one 
buffered batch that is ready (already sorted and retrieved) is being written 
out, while the other batch is being prepared in a background thread, and when 
both are done the buffers are swapped. This wouldn't complicate the current 
code too much but it should instantly give up to 2x higher throughput.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14408) Refactor MoreLikeThisHandler Implementation

2020-06-04 Thread Nazerke Seidan (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126088#comment-17126088
 ] 

Nazerke Seidan commented on SOLR-14408:
---

[~alessandro.benedetti], wondering if you have taken a look at the PR. 

> Refactor MoreLikeThisHandler Implementation
> ---
>
> Key: SOLR-14408
> URL: https://issues.apache.org/jira/browse/SOLR-14408
> Project: Solr
>  Issue Type: Improvement
>  Components: MoreLikeThis
>Reporter: Nazerke Seidan
>Priority: Minor
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> The main goal of this refactoring is for readability and accessibility of 
> MoreLikeThisHandler class. Current MoreLikeThisHandler class consists of two 
> static subclasses and accessing them later in MoreLikeThisComponent.  I 
> propose to have them as separate public classes. 
> cc: [~abenedetti], as you have had the recent commit for MLT, what do you 
> think about this?  Anyway, the code is ready for review. 
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14467) inconsistent server errors combining relatedness() with allBuckets:true

2020-06-04 Thread Michael Gibney (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126126#comment-17126126
 ] 

Michael Gibney commented on SOLR-14467:
---

Yes, this looks good to me! Integrating the "implied" concept more generally 
and not as an explicit special case makes sense.

> inconsistent server errors combining relatedness() with allBuckets:true
> ---
>
> Key: SOLR-14467
> URL: https://issues.apache.org/jira/browse/SOLR-14467
> Project: Solr
>  Issue Type: Bug
>  Components: Facet Module
>Reporter: Chris M. Hostetter
>Priority: Major
> Attachments: SOLR-14467.patch, SOLR-14467.patch, SOLR-14467.patch, 
> SOLR-14467.patch, SOLR-14467_allBuckets_refine.patch, SOLR-14467_test.patch, 
> SOLR-14467_test.patch, beast.log.txt, beast2.log.txt
>
>
> While working on randomized testing for SOLR-13132 i discovered a variety of 
> different ways that JSON Faceting's "allBuckets" option can fail when 
> combined with the "relatedness()" function.
> I haven't found a trivial way to manual reproduce this, but i have been able 
> to trigger the failures with a trivial patch to {{TestCloudJSONFacetSKG}} 
> which i will attach.
> Based on the nature of the failures it looks like it may have something to do 
> with multiple segments of different sizes, and or resizing the SlotAccs ?
> The relatedness() function doesn't have much (any?) existing tests in place 
> that leverage "allBuckets" so this is probably a bug that has always existed 
> -- it's possible it may be excessively cumbersome to fix and we might 
> nee/wnat to just document that incompatibility and add some code to try and 
> detect if the user combines these options and if so fail with a 400 error?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9390) Kuromoji tokenizer discards tokens if they start with a punctuation character

2020-06-04 Thread Jim Ferenczi (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126141#comment-17126141
 ] 

Jim Ferenczi commented on LUCENE-9390:
--

> I usually set the "discardPunctuation" flag to False to avoid such subtle 
>situation.

 

I thought that discardPunctuation set to false was relevant only in the context 
of the JapaneseNumberFilter.

The `icu` tokenizer removes the punctuation for instance so I am not sure it 
should be the default. *_（株）_*is kind of special since the parenthesis are 
required so it shouldn't need a breaking change to preserve this term in the 
Japanese tokenizer ?

 

> Kuromoji tokenizer discards tokens if they start with a punctuation character
> -
>
> Key: LUCENE-9390
> URL: https://issues.apache.org/jira/browse/LUCENE-9390
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Jim Ferenczi
>Priority: Minor
>
> This issue was first raised in Elasticsearch 
> [here|https://github.com/elastic/elasticsearch/issues/57614]
> The unidic dictionary that is used by the Kuromoji tokenizer contains entries 
> that mix punctuations and other characters. For instance the following entry:
> _（株）,1285,1285,3690,名詞,一般,*,*,*,*,（株）,カブシキガイシャ,カブシキガイシャ_
> can be found in the Noun.csv file.
> Today, tokens that start with punctuations are automatically removed by 
> default (discardPunctuation  is true). I think the code was written this way 
> because we expect punctuations to be separated from normal tokens but there 
> are exceptions in the original dictionary. Maybe we should check the entire 
> token when discarding punctuations ?
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13458) Make Jetty timeouts configurable system wide

2020-06-04 Thread Alexander Zhideev (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126148#comment-17126148
 ] 

Alexander Zhideev commented on SOLR-13458:
--

Thanks for your response. Similar is happening here - collections grow by many 
millions per day and total volume is over 3 Billion docs. So far no luck on 
fixing the timeouts that come and go. One approach which seems to sort of work 
is by using commitWithinMs along with also allowing some time for cool down 
between queries by not sending requests to Solr and then retrying on Timeout 
exception. It literally throws 1 timeout after 2 mins then after a few minutes 
of cool down period same query runs for 4-5 mins and returns just fine with no 
issue 

> Make Jetty timeouts configurable system wide
> 
>
> Key: SOLR-13458
> URL: https://issues.apache.org/jira/browse/SOLR-13458
> Project: Solr
>  Issue Type: Sub-task
>  Components: SolrCloud
>Affects Versions: master (9.0)
>Reporter: Gus Heck
>Priority: Major
>
> Our jetty container has several timeouts associated with it, and at least one 
> of these is regularly getting in my way (the idle timeout after 120 sec). I 
> tried setting a system property, with no effect and I've tried altering a 
> jetty.xml found at solr-install/solr/server/etc/jetty.xml on all (50) 
> machines and rebooting all servers only to have an exception with the old 120 
> sec timeout still show up. This ticket proposes that these values are by 
> nature "Global System Timeouts" and should be made configurable in solr.xml 
> (which may be difficult because they will be needed early in the boot 
> sequence). 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9389) Enhance gradle logging calls validation: eliminate getMessage()

2020-06-04 Thread Erick Erickson (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126152#comment-17126152
 ] 

Erick Erickson commented on LUCENE-9389:


The only actual logging calls in Lucene are in the Luke module, so the Lucene 
folks decided against keeping the check so we just don't need to worry about 
it...

> Enhance gradle logging calls validation: eliminate getMessage()
> ---
>
> Key: LUCENE-9389
> URL: https://issues.apache.org/jira/browse/LUCENE-9389
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Andras Salamon
>Assignee: Erick Erickson
>Priority: Minor
>
> SOLR-14280 fixed a logging problem in SolrConfig by removing a few 
> getMessage() calls. We could enhance this solution by modifying gradle's 
> logging calls validation and forbid getMessage() calls during logging. We 
> should check the existing code and eliminate such calls.
> It is possible to suppress the warning using {{//logok}}.
> [~erickerickson] [~gerlowskija]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (LUCENE-9392) Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR to public

2020-06-04 Thread Ankur (Jira)

Ankur created LUCENE-9392:
-

 Summary: Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR 
to public 
 Key: LUCENE-9392
 URL: https://issues.apache.org/jira/browse/LUCENE-9392
 Project: Lucene - Core
  Issue Type: Improvement
Affects Versions: 8.5.2
Reporter: Ankur


FacetsConfig.DELIM_CHAR is marked as private.  An application that wants to use 
this delimiter (in a unit test for example) is forced to re-declare it in the 
application code. This can break the application if DELIM_CHAR is changed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Updated] (LUCENE-9392) Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR to public

2020-06-04 Thread Ankur (Jira)



 [ 
https://issues.apache.org/jira/browse/LUCENE-9392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ankur updated LUCENE-9392:
--
Description: FacetsConfig.DELIM_CHAR is marked as private.  An application 
that wants to use this delimiter (in a unit test for example) is forced to 
re-declare it in the application code. This can break the application if tetshe 
value of DELIM_CHAR is changed in FacetsConfig  (was: FacetsConfig.DELIM_CHAR 
is marked as private.  An application that wants to use this delimiter (in a 
unit test for example) is forced to re-declare it in the application code. This 
can break the application if DELIM_CHAR is changed.)

> Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR to public 
> ---
>
> Key: LUCENE-9392
> URL: https://issues.apache.org/jira/browse/LUCENE-9392
> Project: Lucene - Core
>  Issue Type: Improvement
>Affects Versions: 8.5.2
>Reporter: Ankur
>Priority: Minor
>
> FacetsConfig.DELIM_CHAR is marked as private.  An application that wants to 
> use this delimiter (in a unit test for example) is forced to re-declare it in 
> the application code. This can break the application if tetshe value of 
> DELIM_CHAR is changed in FacetsConfig



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] NazerkeBS commented on a change in pull request #1527: SOLR-14384 Stack SolrRequestInfo

2020-06-04 Thread GitBox



NazerkeBS commented on a change in pull request #1527:
URL: https://github.com/apache/lucene-solr/pull/1527#discussion_r435544642



##
File path: 
solr/core/src/java/org/apache/solr/client/solrj/embedded/EmbeddedSolrServer.java
##
@@ -271,7 +273,9 @@ public void writeResults(ResultContext ctx, JavaBinCodec 
codec) throws IOExcepti
   throw new SolrServerException(ex);
 } finally {
   if (req != null) req.close();
-  SolrRequestInfo.clearRequestInfo();
+  if (mustClearSolrRequestInfo) {

Review comment:
   actually that's enough





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] NazerkeBS commented on a change in pull request #1527: SOLR-14384 Stack SolrRequestInfo

2020-06-04 Thread GitBox



NazerkeBS commented on a change in pull request #1527:
URL: https://github.com/apache/lucene-solr/pull/1527#discussion_r435545076



##
File path: solr/core/src/java/org/apache/solr/request/SolrRequestInfo.java
##
@@ -52,35 +56,60 @@
   private static final Logger log = 
LoggerFactory.getLogger(MethodHandles.lookup().lookupClass());
 
   public static SolrRequestInfo getRequestInfo() {
-return threadLocal.get();
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) return null;
+return stack.peek();
   }
 
+  /** Adds the SolrRequestInfo onto the stack provided that the stack is not 
reached MAX_STACK_SIZE */
   public static void setRequestInfo(SolrRequestInfo info) {
-// TODO: temporary sanity check... this can be changed to just an assert 
in the future
-SolrRequestInfo prev = threadLocal.get();
-if (prev != null) {
-  log.error("Previous SolrRequestInfo was not closed!  req={}", 
prev.req.getOriginalParams());
-  log.error("prev == info : {}", prev.req == info.req, new 
RuntimeException());
+Deque stack = threadLocal.get();
+if (info == null) {
+  throw new IllegalArgumentException("SolrRequestInfo is null");
+} else {
+  if (stack.size() <= MAX_STACK_SIZE) {
+stack.push(info);
+  } else {
+assert true : "SolrRequestInfo Stack is full";
+log.error("SolrRequestInfo Stack is full");
+  }
 }
-assert prev == null;
-
-threadLocal.set(info);
   }
 
+  /** Removes the most recent SolrRequestInfo from the stack */
   public static void clearRequestInfo() {
-try {
-  SolrRequestInfo info = threadLocal.get();
-  if (info != null && info.closeHooks != null) {
-for (Closeable hook : info.closeHooks) {
-  try {
-hook.close();
-  } catch (Exception e) {
-SolrException.log(log, "Exception during close hook", e);
-  }
+Deque stack = threadLocal.get();
+if (stack.isEmpty()) {
+  log.error("clearRequestInfo called too many times");
+} else {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+  }
+
+  /**
+   * This reset method is more of a protection mechanism as
+   * we expect it to be empty by now because all "set" calls need to be 
balanced with a "clear".
+   */
+  public static void reset() {
+Deque stack = threadLocal.get();
+boolean isEmpty = stack.isEmpty();
+while (!stack.isEmpty()) {
+  SolrRequestInfo info = stack.pop();
+  closeHooks(info);
+}
+assert isEmpty : "SolrRequestInfo Stack should have been cleared.";
+  }
+
+  private static void closeHooks(SolrRequestInfo info) {
+if (info != null && info.closeHooks != null) {

Review comment:
   in fact, it's true; as it is a private method and called inside 
SolrRequestInfo





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] madrob merged pull request #1548: SOLR-14524: Harden MultiThreadedOCPTest testFillWorkQueue()

2020-06-04 Thread GitBox



madrob merged pull request #1548:
URL: https://github.com/apache/lucene-solr/pull/1548


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14524) Harden MultiThreadedOCPTest

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126202#comment-17126202
 ] 

ASF subversion and git services commented on SOLR-14524:


Commit dec692252874a5b60dfb71d8c38643a285975b22 in lucene-solr's branch 
refs/heads/master from murblanc
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=dec6922 ]

SOLR-14524: Harden MultiThreadedOCPTest testFillWorkQueue() (#1548)

Make MultiThreadedOCPTest.testFillWorkQueue() less vulnerable to timing issues

Co-authored-by: Ilan Ginzburg 

> Harden MultiThreadedOCPTest
> ---
>
> Key: SOLR-14524
> URL: https://issues.apache.org/jira/browse/SOLR-14524
> Project: Solr
>  Issue Type: Test
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrCloud
>Affects Versions: master (9.0)
>Reporter: Ilan Ginzburg
>Priority: Minor
>  Labels: test
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> {{MultiThreadedOCPTest.test()}} fails occasionally in Jenkins because of 
> timing of tasks enqueue to the Collection API queue.
> This test in {{testFillWorkQueue()}} enqueues a large number of tasks (115, 
> more than the 100 Collection API parallel executors) to the Collection API 
> queue for a collection COLL_A, then observes a short delay and enqueues a 
> task for another collection COLL_B.
>  It verifies that the COLL_B task (that does not require the same lock as the 
> COLL_A tasks) completes before the third COLL_A task.
> Test failures happen because when enqueues are slowed down enough, the first 
> 3 tasks on COLL_A complete even before the COLL_B task gets enqueued!
> In one sample failed Jenkins test execution, the COLL_B task enqueue happened 
> 1275ms after the enqueue of the first COLL_A, leaving plenty of time for a 
> few (and possibly all) COLL_A tasks to complete.
> Fix will be along the lines of:
>  * Make the “blocking” COLL_A task longer to execute (currently 1 second) to 
> compensate for slow enqueues.
>  * Verify the COLL_B task (a 1ms task) finishes before the long running 
> COLL_A task does. This would be a good indication that even though the 
> collection queue was filled with tasks waiting for a busy lock, a non 
> competing task was picked and executed right away.
>  * Delay the enqueue of the COLL_B task to the end of processing of the first 
> COLL_A task. This would guarantee that COLL_B is enqueued once at least some 
> COLL_A tasks started processing at the Overseer. Possibly also verify that 
> the long running task of COLL_A didn't finish execution yet when the COLL_B 
> task is enqueued...
>  * It might be possible to set a (very) long duration for the slow task of 
> COLL_A (to be less vulnerable to execution delays) without requiring the test 
> to wait for that task to complete, but only wait for the COLL_B task to 
> complete (so the test doesn't run for too long).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (SOLR-14524) Harden MultiThreadedOCPTest

2020-06-04 Thread Mike Drob (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Drob resolved SOLR-14524.
--
Fix Version/s: master (9.0)
 Assignee: Mike Drob
   Resolution: Fixed

Thanks for the fix, [~murblanc]!

> Harden MultiThreadedOCPTest
> ---
>
> Key: SOLR-14524
> URL: https://issues.apache.org/jira/browse/SOLR-14524
> Project: Solr
>  Issue Type: Test
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrCloud
>Affects Versions: master (9.0)
>Reporter: Ilan Ginzburg
>Assignee: Mike Drob
>Priority: Minor
>  Labels: test
> Fix For: master (9.0)
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> {{MultiThreadedOCPTest.test()}} fails occasionally in Jenkins because of 
> timing of tasks enqueue to the Collection API queue.
> This test in {{testFillWorkQueue()}} enqueues a large number of tasks (115, 
> more than the 100 Collection API parallel executors) to the Collection API 
> queue for a collection COLL_A, then observes a short delay and enqueues a 
> task for another collection COLL_B.
>  It verifies that the COLL_B task (that does not require the same lock as the 
> COLL_A tasks) completes before the third COLL_A task.
> Test failures happen because when enqueues are slowed down enough, the first 
> 3 tasks on COLL_A complete even before the COLL_B task gets enqueued!
> In one sample failed Jenkins test execution, the COLL_B task enqueue happened 
> 1275ms after the enqueue of the first COLL_A, leaving plenty of time for a 
> few (and possibly all) COLL_A tasks to complete.
> Fix will be along the lines of:
>  * Make the “blocking” COLL_A task longer to execute (currently 1 second) to 
> compensate for slow enqueues.
>  * Verify the COLL_B task (a 1ms task) finishes before the long running 
> COLL_A task does. This would be a good indication that even though the 
> collection queue was filled with tasks waiting for a busy lock, a non 
> competing task was picked and executed right away.
>  * Delay the enqueue of the COLL_B task to the end of processing of the first 
> COLL_A task. This would guarantee that COLL_B is enqueued once at least some 
> COLL_A tasks started processing at the Overseer. Possibly also verify that 
> the long running task of COLL_A didn't finish execution yet when the COLL_B 
> task is enqueued...
>  * It might be possible to set a (very) long duration for the slow task of 
> COLL_A (to be less vulnerable to execution delays) without requiring the test 
> to wait for that task to complete, but only wait for the COLL_B task to 
> complete (so the test doesn't run for too long).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14535) Fix or suppress warnings in apache/solr/handler/component, sql and loader

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126208#comment-17126208
 ] 

ASF subversion and git services commented on SOLR-14535:


Commit bf7db012ec31e6ef553b88abd2c2f75dc0298615 in lucene-solr's branch 
refs/heads/branch_8x from Erick Erickson
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=bf7db01 ]

SOLR-14535: Fix or suppress warnings in apache/solr/handler/component, sql and 
loader


> Fix or suppress warnings in apache/solr/handler/component, sql and loader
> -
>
> Key: SOLR-14535
> URL: https://issues.apache.org/jira/browse/SOLR-14535
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14535) Fix or suppress warnings in apache/solr/handler/component, sql and loader

2020-06-04 Thread ASF subversion and git services (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126210#comment-17126210
 ] 

ASF subversion and git services commented on SOLR-14535:


Commit 0c4d8fb116f7dec83a04d27df550640dcec4eb8d in lucene-solr's branch 
refs/heads/master from Erick Erickson
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=0c4d8fb ]

SOLR-14535: Fix or suppress warnings in apache/solr/handler/component, sql and 
loader


> Fix or suppress warnings in apache/solr/handler/component, sql and loader
> -
>
> Key: SOLR-14535
> URL: https://issues.apache.org/jira/browse/SOLR-14535
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Resolved] (SOLR-14535) Fix or suppress warnings in apache/solr/handler/component, sql and loader

2020-06-04 Thread Erick Erickson (Jira)



 [ 
https://issues.apache.org/jira/browse/SOLR-14535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-14535.
---
Fix Version/s: 8.6
   Resolution: Fixed

> Fix or suppress warnings in apache/solr/handler/component, sql and loader
> -
>
> Key: SOLR-14535
> URL: https://issues.apache.org/jira/browse/SOLR-14535
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Erick Erickson
>Assignee: Erick Erickson
>Priority: Major
> Fix For: 8.6
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-14524) Harden MultiThreadedOCPTest

2020-06-04 Thread Ilan Ginzburg (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-14524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126221#comment-17126221
 ] 

Ilan Ginzburg commented on SOLR-14524:
--

Thanks for the review and for merging it [~mdrob].

> Harden MultiThreadedOCPTest
> ---
>
> Key: SOLR-14524
> URL: https://issues.apache.org/jira/browse/SOLR-14524
> Project: Solr
>  Issue Type: Test
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: SolrCloud
>Affects Versions: master (9.0)
>Reporter: Ilan Ginzburg
>Assignee: Mike Drob
>Priority: Minor
>  Labels: test
> Fix For: master (9.0)
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> {{MultiThreadedOCPTest.test()}} fails occasionally in Jenkins because of 
> timing of tasks enqueue to the Collection API queue.
> This test in {{testFillWorkQueue()}} enqueues a large number of tasks (115, 
> more than the 100 Collection API parallel executors) to the Collection API 
> queue for a collection COLL_A, then observes a short delay and enqueues a 
> task for another collection COLL_B.
>  It verifies that the COLL_B task (that does not require the same lock as the 
> COLL_A tasks) completes before the third COLL_A task.
> Test failures happen because when enqueues are slowed down enough, the first 
> 3 tasks on COLL_A complete even before the COLL_B task gets enqueued!
> In one sample failed Jenkins test execution, the COLL_B task enqueue happened 
> 1275ms after the enqueue of the first COLL_A, leaving plenty of time for a 
> few (and possibly all) COLL_A tasks to complete.
> Fix will be along the lines of:
>  * Make the “blocking” COLL_A task longer to execute (currently 1 second) to 
> compensate for slow enqueues.
>  * Verify the COLL_B task (a 1ms task) finishes before the long running 
> COLL_A task does. This would be a good indication that even though the 
> collection queue was filled with tasks waiting for a busy lock, a non 
> competing task was picked and executed right away.
>  * Delay the enqueue of the COLL_B task to the end of processing of the first 
> COLL_A task. This would guarantee that COLL_B is enqueued once at least some 
> COLL_A tasks started processing at the Overseer. Possibly also verify that 
> the long running task of COLL_A didn't finish execution yet when the COLL_B 
> task is enqueued...
>  * It might be possible to set a (very) long duration for the slow task of 
> COLL_A (to be less vulnerable to execution delays) without requiring the test 
> to wait for that task to complete, but only wait for the COLL_B task to 
> complete (so the test doesn't run for too long).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (SOLR-14538) Fix or suppress remaining warnings in apache/solr/handler

2020-06-04 Thread Erick Erickson (Jira)

Erick Erickson created SOLR-14538:
-

 Summary: Fix or suppress remaining warnings in apache/solr/handler
 Key: SOLR-14538
 URL: https://issues.apache.org/jira/browse/SOLR-14538
 Project: Solr
  Issue Type: Sub-task
Reporter: Erick Erickson
Assignee: Erick Erickson






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9392) Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR to public

2020-06-04 Thread Michael McCandless (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126235#comment-17126235
 ] 

Michael McCandless commented on LUCENE-9392:


+1

> Change the visibility of o.a.l.f.FacetsConfig.DELIM_CHAR to public 
> ---
>
> Key: LUCENE-9392
> URL: https://issues.apache.org/jira/browse/LUCENE-9392
> Project: Lucene - Core
>  Issue Type: Improvement
>Affects Versions: 8.5.2
>Reporter: Ankur
>Priority: Minor
>
> FacetsConfig.DELIM_CHAR is marked as private.  An application that wants to 
> use this delimiter (in a unit test for example) is forced to re-declare it in 
> the application code. This can break the application if tetshe value of 
> DELIM_CHAR is changed in FacetsConfig



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-9390) Kuromoji tokenizer discards tokens if they start with a punctuation character

2020-06-04 Thread Michael McCandless (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126237#comment-17126237
 ] 

Michael McCandless commented on LUCENE-9390:


Not sure if it is related, but LUCENE-9100 is another struggle with Kuromoji 
and punctuation.

> Kuromoji tokenizer discards tokens if they start with a punctuation character
> -
>
> Key: LUCENE-9390
> URL: https://issues.apache.org/jira/browse/LUCENE-9390
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Jim Ferenczi
>Priority: Minor
>
> This issue was first raised in Elasticsearch 
> [here|https://github.com/elastic/elasticsearch/issues/57614]
> The unidic dictionary that is used by the Kuromoji tokenizer contains entries 
> that mix punctuations and other characters. For instance the following entry:
> _（株）,1285,1285,3690,名詞,一般,*,*,*,*,（株）,カブシキガイシャ,カブシキガイシャ_
> can be found in the Noun.csv file.
> Today, tokens that start with punctuations are automatically removed by 
> default (discardPunctuation  is true). I think the code was written this way 
> because we expect punctuations to be separated from normal tokens but there 
> are exceptions in the original dictionary. Maybe we should check the entire 
> token when discarding punctuations ?
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Created] (LUCENE-9393) FunctionScoreQuery shouldn’t use TOP_DOCS for creating inner weight

2020-06-04 Thread Tomas Eduardo Fernandez Lobbe (Jira)

Tomas Eduardo Fernandez Lobbe created LUCENE-9393:
-

 Summary: FunctionScoreQuery shouldn’t use TOP_DOCS for creating 
inner weight
 Key: LUCENE-9393
 URL: https://issues.apache.org/jira/browse/LUCENE-9393
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Tomas Eduardo Fernandez Lobbe


{{FunctionScoreQuery.createWeight}} creates the weight of the inner query using 
the {{scoreMode}} from it’s input parameter, however, FunctionScoreQuery can’t 
really use WAND algorithm, and the Scorer used will ignore calls to set 
competitive scores.  FunctionScoreQuery should just turn {{TOP_DOCS}} to 
{{COMPLETE}} before creating the inner query's weight.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (LUCENE-8962) Can we merge small segments during refresh, for faster searching?

2020-06-04 Thread Nhat Nguyen (Jira)



[ 
https://issues.apache.org/jira/browse/LUCENE-8962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126253#comment-17126253
 ] 

Nhat Nguyen commented on LUCENE-8962:
-

>  For some reason it's not linked above, and I'm not sure how to remedy that

I guess the title of the PR needs to be in the format of "LUCENE-: 
description".

> Can we merge small segments during refresh, for faster searching?
> -
>
> Key: LUCENE-8962
> URL: https://issues.apache.org/jira/browse/LUCENE-8962
> Project: Lucene - Core
>  Issue Type: Improvement
>  Components: core/index
>Reporter: Michael McCandless
>Priority: Major
> Fix For: 8.6
>
> Attachments: LUCENE-8962_demo.png, failed-tests.patch
>
>  Time Spent: 10h
>  Remaining Estimate: 0h
>
> With near-real-time search we ask {{IndexWriter}} to write all in-memory 
> segments to disk and open an {{IndexReader}} to search them, and this is 
> typically a quick operation.
> However, when you use many threads for concurrent indexing, {{IndexWriter}} 
> will accumulate write many small segments during {{refresh}} and this then 
> adds search-time cost as searching must visit all of these tiny segments.
> The merge policy would normally quickly coalesce these small segments if 
> given a little time ... so, could we somehow improve {{IndexWriter'}}s 
> refresh to optionally kick off merge policy to merge segments below some 
> threshold before opening the near-real-time reader?  It'd be a bit tricky 
> because while we are waiting for merges, indexing may continue, and new 
> segments may be flushed, but those new segments shouldn't be included in the 
> point-in-time segments returned by refresh ...
> One could almost do this on top of Lucene today, with a custom merge policy, 
> and some hackity logic to have the merge policy target small segments just 
> written by refresh, but it's tricky to then open a near-real-time reader, 
> excluding newly flushed but including newly merged segments since the refresh 
> originally finished ...
> I'm not yet sure how best to solve this, so I wanted to open an issue for 
> discussion!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mikemccand commented on pull request #1552: LUCENE-8962

2020-06-04 Thread GitBox



mikemccand commented on pull request #1552:
URL: https://github.com/apache/lucene-solr/pull/1552#issuecomment-639161416


   I'll beast this PR on my 128-core AMD Ryzen box :)



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] mikemccand commented on a change in pull request #1552: LUCENE-8962

2020-06-04 Thread GitBox



mikemccand commented on a change in pull request #1552:
URL: https://github.com/apache/lucene-solr/pull/1552#discussion_r435592460



##
File path: lucene/CHANGES.txt
##
@@ -376,6 +376,8 @@ Improvements
 
 * LUCENE-9253: KoreanTokenizer now supports custom dictionaries(system, 
unknown). (Namgyu Kim)
 
+* LUCENE-8962: Add ability to selectively merge on commit (Michael Froh)

Review comment:
   Maybe `Add IndexWriter merge-on-commit feature to selectively merge 
small segments on commit, subject to a configurable timeout, to improve search 
performance by reducing the number of small segments for searching`?

##
File path: lucene/core/src/java/org/apache/lucene/index/IndexWriterConfig.java
##
@@ -459,6 +463,31 @@ public IndexWriterConfig setCommitOnClose(boolean 
commitOnClose) {
 return this;
   }
 
+  /**
+   * Expert: sets the amount of time to wait for merges returned by 
MergePolicy.findFullFlushMerges(...).
+   * If this time is reached, we proceed with the commit based on segments 
merged up to that point.
+   * The merges are not cancelled, and may still run to completion independent 
of the commit.

Review comment:
   Maybe for the last sentence: `The merges are not cancelled, and will 
still run to completion independent of the commit like normal segment merges`?
   
   Maybe also state that this setting has no effect unless the `MergePolicy` 
actually returns merges from `findFullFlushMerges`, which the default merge 
policy does not?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] tflobbe opened a new pull request #1553: LUCENE-9393: FunctionScoreQuery turns TOP_DOCS to COMPLETE in inner weights

2020-06-04 Thread GitBox



tflobbe opened a new pull request #1553:
URL: https://github.com/apache/lucene-solr/pull/1553


   FunctionScoreQuery can't really use WAND algorithm even if TOP_DOCS score 
mode is requested. This commit makes the inner weight created use COMPLETE in 
this case.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene-solr] tflobbe commented on a change in pull request #1553: LUCENE-9393: FunctionScoreQuery turns TOP_DOCS to COMPLETE in inner weights

2020-06-04 Thread GitBox



tflobbe commented on a change in pull request #1553:
URL: https://github.com/apache/lucene-solr/pull/1553#discussion_r435604487



##
File path: 
lucene/queries/src/test/org/apache/lucene/queries/function/TestFunctionScoreQuery.java
##
@@ -244,4 +247,33 @@ public void testAccessToValueSource() throws Exception {
 
   }
 
+  public void testScoreMode() throws Exception {
+// Value Source doesn't need scores
+assertInnerScoreMode(ScoreMode.COMPLETE_NO_SCORES, ScoreMode.COMPLETE, 
DoubleValuesSource.fromDoubleField("foo"));

Review comment:
   While this test makes the current code pass, I'm trying to understand 
why it is correct. Why do we pass a COMPLETE_NO_SCORES to the underlying weight 
when the function's ValueSource doesn't need scores?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Commented] (SOLR-13132) Improve JSON "terms" facet performance when sorted by relatedness

2020-06-04 Thread Michael Gibney (Jira)



[ 
https://issues.apache.org/jira/browse/SOLR-13132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17126314#comment-17126314
 ] 

Michael Gibney commented on SOLR-13132:
---

I pushed a new commit (18c3d21f44b382b5e35ba7551ebf20abb31fdb76) to the branch 
for PR 751 that reverts the five preceding commits (which were attempting to 
pursue a different approach to addressing SOLR-14467).

> Improve JSON "terms" facet performance when sorted by relatedness 
> --
>
> Key: SOLR-13132
> URL: https://issues.apache.org/jira/browse/SOLR-13132
> Project: Solr
>  Issue Type: Improvement
>  Components: Facet Module
>Affects Versions: 7.4, master (9.0)
>Reporter: Michael Gibney
>Priority: Major
> Attachments: SOLR-13132-with-cache-01.patch, 
> SOLR-13132-with-cache.patch, SOLR-13132.patch, SOLR-13132_testSweep.patch
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> When sorting buckets by {{relatedness}}, JSON "terms" facet must calculate 
> {{relatedness}} for every term. 
> The current implementation uses a standard uninverted approach (either 
> {{docValues}} or {{UnInvertedField}}) to get facet counts over the domain 
> base docSet, and then uses that initial pass as a pre-filter for a 
> second-pass, inverted approach of fetching docSets for each relevant term 
> (i.e., {{count > minCount}}?) and calculating intersection size of those sets 
> with the domain base docSet.
> Over high-cardinality fields, the overhead of per-term docSet creation and 
> set intersection operations increases request latency to the point where 
> relatedness sort may not be usable in practice (for my use case, even after 
> applying the patch for SOLR-13108, for a field with ~220k unique terms per 
> core, QTime for high-cardinality domain docSets were, e.g.: cardinality 
> 1816684=9000ms, cardinality 5032902=18000ms).
> The attached patch brings the above example QTimes down to a manageable 
> ~300ms and ~250ms respectively. The approach calculates uninverted facet 
> counts over domain base, foreground, and background docSets in parallel in a 
> single pass. This allows us to take advantage of the efficiencies built into 
> the standard uninverted {{FacetFieldProcessorByArray[DV|UIF]}}), and avoids 
> the per-term docSet creation and set intersection overhead.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

61 matches

Mail list logo