[GitHub] [lucene] dweiss commented on issue #11771: KeywordRepeatFilter + OpenNLPLemmatizer Early Exit

2022-09-24 Thread GitBox


dweiss commented on issue #11771:
URL: https://github.com/apache/lucene/issues/11771#issuecomment-1256892005

   If this code went in the main branch then it's also a bug there. Comparing 
strings by reference is a no-no - I should have caught it earlier. I'll do the 
update on both branches later today.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] romseygeek commented on pull request #11807: No need to rewrite queries in unified highlighter

2022-09-24 Thread GitBox


romseygeek commented on PR #11807:
URL: https://github.com/apache/lucene/pull/11807#issuecomment-1256931459

   I thought the changes you made for unrecognized queries fixed the issues 
with the surround query parser? If not it would be good to implement query 
visitors for the queries that it produces, or at the very least have a test in 
the highlighting module for them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] romseygeek merged pull request #11808: Don't try to highlight very long terms

2022-09-24 Thread GitBox


romseygeek merged PR #11808:
URL: https://github.com/apache/lucene/pull/11808


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss commented on pull request #11810: fix equality check bug in test

2022-09-24 Thread GitBox


dweiss commented on PR #11810:
URL: https://github.com/apache/lucene/pull/11810#issuecomment-1256938048

   This stream is not required at all: you just split on any consecutive 
sequence of white spaces, including newlines, and it'll be fine. I'll commit 
this separately and backport.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss closed pull request #11810: fix equality check bug in test

2022-09-24 Thread GitBox


dweiss closed pull request #11810: fix equality check bug in test
URL: https://github.com/apache/lucene/pull/11810


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss closed issue #11771: KeywordRepeatFilter + OpenNLPLemmatizer Early Exit

2022-09-24 Thread GitBox


dweiss closed issue #11771: KeywordRepeatFilter + OpenNLPLemmatizer Early Exit
URL: https://github.com/apache/lucene/issues/11771


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


dweiss commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256959688

   Not sure what's failing here, will take a look later. I suspect errorprone.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir opened a new pull request, #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir opened a new pull request, #11813:
URL: https://github.com/apache/lucene/pull/11813

   This method is recursive: to avoid eating too much stack we apply a small 
limit. This means it can't really be used on any largish automata without 
hitting exception.
   
   But the benefit of knowing finite vs infinite in AutomatonTermsEnum is 
minor: let's not auto-compute this. FuzzyQuery still gets the finite 
optimization because its finite by definition. PrefixQuery is always infinite. 
Wildcard/Regex just assume infinite which is safe to do.
   
   Remove the auto-computation and the "trillean" Boolean parameter. If you 
dont know that your automaton is finite, pass false to CompiledAutomaton, it is 
safe.
   
   Move this method to AutomatonTestUtil so we can still use it in test asserts.
   
   Closes #11809


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss merged pull request #11811: Upgrade google java format and apply tidy

2022-09-24 Thread GitBox


dweiss merged PR #11811:
URL: https://github.com/apache/lucene/pull/11811


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


rmuir commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256971988

   Looks like error-prone fails saying that `PublicConstructorForAbstractClass` 
is not a valid checker name. 
   We explicitly disable it currently:
   ```
   gradle/validation/error-prone.gradle:
'-Xep:PublicConstructorForAbstractClass:OFF'
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


dweiss commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256973107

   Yeah, thanks, Robert. I'll be looking into this - have an hour to spare at 
the airport anyway.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] stefanvodita opened a new issue, #11814: Support deletes in IndexRearranger

2022-09-24 Thread GitBox


stefanvodita opened a new issue, #11814:
URL: https://github.com/apache/lucene/issues/11814

   ### Description
   
   `IndexRearranger` applies deletes found in the original index, so the 
rearranged index will not contain any deleted docs. While deleted docs don't 
show up in search results, they still impact search performance. If we want the 
rearranged index to be as similar as possible to the original index, we should 
also include the docs marked for deletion.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


dweiss commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256975233

   Right - this check was removed here: 
https://github.com/google/error-prone/commit/b4c6a03168102026bb6540627b21b356804250e4


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] mikemccand commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


mikemccand commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979249444


##
lucene/core/src/java/org/apache/lucene/util/automaton/CompiledAutomaton.java:
##
@@ -139,21 +136,21 @@ private static int findSinkState(Automaton automaton) {
   }
 
   /**
-   * Create this. If finite is null, we use {@link Operations#isFinite} to 
determine whether it is
-   * finite. If simplify is true, we run possibly expensive operations to 
determine if the automaton
-   * is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}.
+   * Create this. If simplify is true, we run possibly expensive operations to 
determine if the
+   * automaton is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}. 
Set finite to true if
+   * the automaton is finite, otherwise set to false if infinite or you don't 
know.

Review Comment:
   If a user accidentally claims the automaton was finite but it is not, what 
happens?



##
lucene/core/src/java/org/apache/lucene/analysis/AutomatonToTokenStream.java:
##
@@ -46,10 +45,6 @@ private AutomatonToTokenStream() {}
* @return TokenStream representation of automaton.
*/
   public static TokenStream toTokenStream(Automaton automaton) {
-if (Operations.isFinite(automaton) == false) {

Review Comment:
   Maybe we should add a warning that this may run forever on an infinite 
automaton?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] dweiss commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


dweiss commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256976327

   Another check was removed here: 
https://github.com/google/error-prone/pull/2809


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] stefanvodita opened a new pull request, #11815: Support deletions in rearrange (#11814)

2022-09-24 Thread GitBox


stefanvodita opened a new pull request, #11815:
URL: https://github.com/apache/lucene/pull/11815

   ### Description
   
   Changes made to support deletes in `IndexRearranger`:
   1. Add `isDeleted(LeafReader reader, int idx)` in document selectors, which 
returns true if the document found at index `idx` in the leaf reader was marked 
for deletion in the original index.
   2. Record all docs in the `DocumentSelector` (not just live ones).
   3. When rearranging, in `addOneSegment`, add all docs that belong in the 
segment and then delete the ones that were originally marked for deletion.
   
   ### Testing
   The new test class is copied from `TestIndexRearranger` with modifications 
for deleting documents in the original index and testing that they are still 
present and deleted in the rearranged index. In a subsequent revision, this 
file would be merged into `TestIndexRearranger`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979251049


##
lucene/core/src/java/org/apache/lucene/analysis/AutomatonToTokenStream.java:
##
@@ -46,10 +45,6 @@ private AutomatonToTokenStream() {}
* @return TokenStream representation of automaton.
*/
   public static TokenStream toTokenStream(Automaton automaton) {
-if (Operations.isFinite(automaton) == false) {

Review Comment:
   ok, ill fix this.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979251445


##
lucene/core/src/java/org/apache/lucene/util/automaton/CompiledAutomaton.java:
##
@@ -139,21 +136,21 @@ private static int findSinkState(Automaton automaton) {
   }
 
   /**
-   * Create this. If finite is null, we use {@link Operations#isFinite} to 
determine whether it is
-   * finite. If simplify is true, we run possibly expensive operations to 
determine if the automaton
-   * is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}.
+   * Create this. If simplify is true, we run possibly expensive operations to 
determine if the
+   * automaton is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}. 
Set finite to true if
+   * the automaton is finite, otherwise set to false if infinite or you don't 
know.

Review Comment:
   I assume wrong answers or assertions :) Only FuzzyTermsEnum uses this 
optimization to avoid a little CPU/upkeeping for the fuzzy case, its really an 
opto for that. BlockTree intersection doesn't even look at it, i think.
   
   We could probably tone down some of these CompiledAutomaton ctors to expose 
it less in the future. We just need a single expert ctor for Fuzzy? I was 
trying to minimize the scope of API changes here but can do more.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] mikemccand commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


mikemccand commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979251604


##
lucene/core/src/java/org/apache/lucene/util/automaton/CompiledAutomaton.java:
##
@@ -139,21 +136,21 @@ private static int findSinkState(Automaton automaton) {
   }
 
   /**
-   * Create this. If finite is null, we use {@link Operations#isFinite} to 
determine whether it is
-   * finite. If simplify is true, we run possibly expensive operations to 
determine if the automaton
-   * is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}.
+   * Create this. If simplify is true, we run possibly expensive operations to 
determine if the
+   * automaton is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}. 
Set finite to true if
+   * the automaton is finite, otherwise set to false if infinite or you don't 
know.

Review Comment:
   OK no worries -- no need to do more here!  This change is already 
self-contained and a great progress!



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979252083


##
lucene/core/src/java/org/apache/lucene/util/automaton/CompiledAutomaton.java:
##
@@ -139,21 +136,21 @@ private static int findSinkState(Automaton automaton) {
   }
 
   /**
-   * Create this. If finite is null, we use {@link Operations#isFinite} to 
determine whether it is
-   * finite. If simplify is true, we run possibly expensive operations to 
determine if the automaton
-   * is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}.
+   * Create this. If simplify is true, we run possibly expensive operations to 
determine if the
+   * automaton is one the cases in {@link CompiledAutomaton.AUTOMATON_TYPE}. 
Set finite to true if
+   * the automaton is finite, otherwise set to false if infinite or you don't 
know.

Review Comment:
   yeah, this recursive method is "in the query path", the only remaining 
recursive method is sortTopoStates, which is less exposed (i think suggesters 
only).
   
   ultimately it would be great to remove more of these "automatic" (sometimes 
costly) optimizations, maybe even remove CompiledAutomaton.  we've been making 
progress.
   
   but this Operations.isFinite is definitely the biggest problem and easiest 
win right now.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979254120


##
lucene/core/src/java/org/apache/lucene/analysis/AutomatonToTokenStream.java:
##
@@ -46,10 +45,6 @@ private AutomatonToTokenStream() {}
* @return TokenStream representation of automaton.
*/
   public static TokenStream toTokenStream(Automaton automaton) {
-if (Operations.isFinite(automaton) == false) {

Review Comment:
   I looked at the javadocs (scroll up just a bit more), it looks good?
   
   ```java
 /**
  * converts an automaton into a TokenStream. This is done by first Topo 
sorting the nodes in the
  * Automaton. Nodes that have the same distance from the start are grouped 
together to form the
  * position nodes for the TokenStream. The resulting TokenStream releases 
edges from the automaton
  * as tokens in order from the position nodes. This requires the automaton 
be a finite DAG.
  *
  * @param automaton automaton to convert. Must be a finite DAG.
  * @return TokenStream representation of automaton.
  */
 public static TokenStream toTokenStream(Automaton automaton) {
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] mikemccand commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


mikemccand commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979254281


##
lucene/core/src/java/org/apache/lucene/analysis/AutomatonToTokenStream.java:
##
@@ -46,10 +45,6 @@ private AutomatonToTokenStream() {}
* @return TokenStream representation of automaton.
*/
   public static TokenStream toTokenStream(Automaton automaton) {
-if (Operations.isFinite(automaton) == false) {

Review Comment:
   Ahh super yes I agree!



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on a diff in pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on code in PR #11813:
URL: https://github.com/apache/lucene/pull/11813#discussion_r979255164


##
lucene/core/src/java/org/apache/lucene/analysis/AutomatonToTokenStream.java:
##
@@ -46,10 +45,6 @@ private AutomatonToTokenStream() {}
* @return TokenStream representation of automaton.
*/
   public static TokenStream toTokenStream(Automaton automaton) {
-if (Operations.isFinite(automaton) == false) {

Review Comment:
   I boosted it a little bit with an explanation mark and the words "infinite 
loop"



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir commented on PR #11813:
URL: https://github.com/apache/lucene/pull/11813#issuecomment-1256986744

   I'm planning on doing this 10.x-only, not out of laziness, but because there 
are already several related 10.x changes around this stuff: removal of det in 
#11049, removal of minimize in #11332, etc.
   
   Not opposed to backporting this stuff to 9.x but we'd need to be careful.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir merged pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


rmuir merged PR #11813:
URL: https://github.com/apache/lucene/pull/11813


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir closed issue #11809: input automaton is too large for lengthy wildcard query

2022-09-24 Thread GitBox


rmuir closed issue #11809: input automaton is too large  for lengthy wildcard 
query
URL: https://github.com/apache/lucene/issues/11809


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on pull request #11812: Upgrade several build dependencies.

2022-09-24 Thread GitBox


rmuir commented on PR #11812:
URL: https://github.com/apache/lucene/pull/11812#issuecomment-1256988534

   oh, one question @dweiss, since `javacc` is upgraded, should we force a 
`gradlew regenerate`?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] rmuir commented on issue #11809: input automaton is too large for lengthy wildcard query

2022-09-24 Thread GitBox


rmuir commented on issue #11809:
URL: https://github.com/apache/lucene/issues/11809#issuecomment-125699

   Thanks for reporting this with easy-to-reproduce testcase @taroplus 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] magibney commented on pull request #11813: Remove Operations.isFinite

2022-09-24 Thread GitBox


magibney commented on PR #11813:
URL: https://github.com/apache/lucene/pull/11813#issuecomment-1257009186

   fwiw I have a local branch (very recent) that changed the implementation of 
`Operations.isFinite()` (and `Operations.topoSortStates()`) to be 
non-recursive, afaict without sacrificing performance on common use cases. 
Hoping to push a PR soon.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] shaie commented on a diff in pull request #11768: Fix tie-break bug in various Facets implementations

2022-09-24 Thread GitBox


shaie commented on code in PR #11768:
URL: https://github.com/apache/lucene/pull/11768#discussion_r979357774


##
lucene/facet/src/java/org/apache/lucene/facet/taxonomy/FloatTaxonomyFacets.java:
##
@@ -189,10 +190,11 @@ private TopChildrenForPath 
getTopChildrenForPath(DimConfig dimConfig, int pathOr
 
 TopOrdAndFloatQueue.OrdAndValue reuse = null;
 while (ord != TaxonomyReader.INVALID_ORDINAL) {
-  if (values[ord] > 0) {
+  float value = values[ord];
+  if (value > 0) {
 aggregatedValue = aggregationFunction.aggregate(aggregatedValue, 
values[ord]);

Review Comment:
   nit: might as well use `value` here too (and check if we you can replace 
`values[ord]` with `value` elsewhere



##
lucene/facet/src/test/org/apache/lucene/facet/TestDrillSideways.java:
##
@@ -626,7 +626,7 @@ public void testBasicWithCollectorManager() throws 
Exception {
 List topNDimsResult = r.facets.getTopDims(1, 2);
 assertEquals(1, topNDimsResult.size());
 assertEquals(
-"dim=Author path=[] value=5 childCount=4\n  Lisa (2)\n  Susan (1)\n",

Review Comment:
   It's disturbing that these tests were "wrong" and we just let them be like 
that. I'm glad that you fixed them, but makes me wonder if it was possible to 
catch this bug earlier by scrutinizing these tests better.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org