[GitHub] [lucene] mkhludnev commented on issue #12259: Case insensitive search

2023-05-04 Thread via GitHub
mkhludnev commented on issue #12259: URL: https://github.com/apache/lucene/issues/12259#issuecomment-1534327232 [SrikanthMedisetti] please use TextField instead. Not an issue. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

[GitHub] [lucene] mkhludnev closed issue #12259: Case insensitive search

2023-05-04 Thread via GitHub
mkhludnev closed issue #12259: Case insensitive search URL: https://github.com/apache/lucene/issues/12259 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail

[GitHub] [lucene] mkhludnev opened a new issue, #12264: Shouldn't StandardTokenizer keep aplanum dot joined?

2023-05-04 Thread via GitHub
mkhludnev opened a new issue, #12264: URL: https://github.com/apache/lucene/issues/12264 ### Description ### AS-IS `a9nine.com` -> `a9nine.com` `3.14` -> `3.14` ### Problem `a9.com` -> `a9` `com` Should it keep them joined? -- This is an automated message from the

[GitHub] [lucene] romseygeek commented on issue #12264: Shouldn't StandardTokenizer keep aplanum dot joined?

2023-05-04 Thread via GitHub
romseygeek commented on issue #12264: URL: https://github.com/apache/lucene/issues/12264#issuecomment-1534586713 The tokenizer is based on http://unicode.org/reports/tr29/, which has rules for handling dots that appear in numbers or in URLs, but it does seem that URLs that have a number bef

[GitHub] [lucene] mkhludnev commented on issue #12264: Shouldn't StandardTokenizer keep aplanum dot joined?

2023-05-04 Thread via GitHub
mkhludnev commented on issue #12264: URL: https://github.com/apache/lucene/issues/12264#issuecomment-1534733406 Thanks @romseygeek. Right. It's a question. Maybe it's worth to discuss. For the reference https://lists.apache.org/thread/gpxz58jdb9n1sh2oxx161g4kkd7x94wn -- This is a

[GitHub] [lucene] mkhludnev commented on issue #12264: Shouldn't StandardTokenizer keep aplanum dot joined?

2023-05-04 Thread via GitHub
mkhludnev commented on issue #12264: URL: https://github.com/apache/lucene/issues/12264#issuecomment-1534792087 The proposal around http://unicode.org/reports/tr29/#WB7 is to introduce (implement) two new don't break rules: *WB6a* `AHLetter Numeric | × | (MidLetter | MidNumLetQ) AHLet

[GitHub] [lucene] msokolov commented on pull request #12254: add ConcurrentOnHeapHnswGraph and Builder

2023-05-04 Thread via GitHub
msokolov commented on PR #12254: URL: https://github.com/apache/lucene/pull/12254#issuecomment-1534829661 Yep, I plan to review, definitely interested to see this get committed, but it's a bit complex and I need to find some quiet time, which is rare. You know we're all volunteers here! -

[GitHub] [lucene] mkhludnev commented on a diff in pull request #12245: `ToParentBlockJoinQuery` Explain Support Score Mode

2023-05-04 Thread via GitHub
mkhludnev commented on code in PR #12245: URL: https://github.com/apache/lucene/pull/12245#discussion_r1185121793 ## lucene/join/src/java/org/apache/lucene/search/join/ToParentBlockJoinQuery.java: ## @@ -391,35 +391,75 @@ private void setScoreAndFreq() throws IOException {