rmuir commented on PR #14192:
URL: https://github.com/apache/lucene/pull/14192#issuecomment-2667264982
```java
var re = new RegExp("παραστάσεις", RegExp.NONE, RegExp.CASE_INSENSITIVE);
System.out.println(re.toAutomaton().toDot());
```
 under one or
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1949836434
##
lucene/core/src/java/org/apache/lucene/util/automaton/CaseFolding.java:
##
@@ -0,0 +1,82 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1949691308
##
lucene/core/src/java/org/apache/lucene/util/automaton/CaseFoldingUtil.java:
##
@@ -0,0 +1,338 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1946735208
##
lucene/core/src/java/org/apache/lucene/util/automaton/CaseFoldingUtil.java:
##
@@ -0,0 +1,338 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1946628495
##
lucene/core/src/java/org/apache/lucene/util/automaton/CaseFoldingUtil.java:
##
@@ -0,0 +1,338 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1945855034
##
lucene/core/src/java/org/apache/lucene/util/automaton/CaseFoldingUtil.java:
##
@@ -0,0 +1,338 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or
john-wagster commented on PR #14192:
URL: https://github.com/apache/lucene/pull/14192#issuecomment-2641159156
Iterated here a bit after the changes in
https://github.com/apache/lucene/pull/14193 went in and also pivoted to using
https://www.unicode.org/Public/16.0.0/ucd/CaseFolding.txt. I
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1942013537
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -696,17 +896,52 @@ private Automaton toAutomaton(
return a;
}
- private Automaton
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1942009773
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DEPRECATE
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1941963110
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -696,17 +896,52 @@ private Automaton toAutomaton(
return a;
}
- private Aut
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1941960613
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DE
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1940403543
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DEPRECATE
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1940371117
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -696,17 +896,52 @@ private Automaton toAutomaton(
return a;
}
- private Automaton
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1940264096
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DEPRECATE
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1940026321
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DE
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1940023844
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -424,6 +426,46 @@ public enum Kind {
/** Allows case insensitive matching of ASCII
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1939974637
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DEPRECATE
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1939944485
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -424,6 +426,46 @@ public enum Kind {
/** Allows case insensitive matching of ASCII charact
rmuir commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1939946041
##
lucene/core/src/java/org/apache/lucene/util/automaton/RegExp.java:
##
@@ -436,6 +478,160 @@ public enum Kind {
*/
@Deprecated public static final int DEPRECATE
john-wagster commented on code in PR #14192:
URL: https://github.com/apache/lucene/pull/14192#discussion_r1939889352
##
lucene/core/src/test/org/apache/lucene/util/automaton/TestRegExp.java:
##
@@ -35,6 +43,320 @@ public void testSmoke() {
assertFalse(run.run("ad"));
}
john-wagster opened a new pull request, #14192:
URL: https://github.com/apache/lucene/pull/14192
About four years ago ASCII-only case insensitive matching
(https://github.com/apache/lucene-solr/pull/1541) was added to Lucene. In the
past couple of a years a couple of requests have been mad
john-wagster commented on PR #14192:
URL: https://github.com/apache/lucene/pull/14192#issuecomment-2631833927
@jpountz, @jimczi, @mayya-sharipova ya'll may be interested in this PR so
just tagging you here in case you are interested.
--
This is an automated message from the Apache Git S
28 matches
Mail list logo