Re: [I] jenkins dump file traversal exceptions ("no matches found within 10000") [lucene]

2023-12-29 Thread via GitHub
dweiss commented on issue #12907: URL: https://github.com/apache/lucene/issues/12907#issuecomment-1871829048 Apache Infra has bumped the default for us to 60k. Let's see if these stack traces still show up. https://issues.apache.org/jira/browse/INFRA-25269 -- This is an automated mess

Re: [I] jenkins dump file traversal exceptions ("no matches found within 10000") [lucene]

2023-12-29 Thread via GitHub
dweiss commented on issue #12907: URL: https://github.com/apache/lucene/issues/12907#issuecomment-1871830389 Answering myself - yes, still a thing, https://ci-builds.apache.org/job/Lucene/job/Lucene-Check-main/10935/ ``` Caused: java.lang.InterruptedException: hudson.FilePath$FileMask

Re: [I] jenkins dump file traversal exceptions ("no matches found within 10000") [lucene]

2023-12-29 Thread via GitHub
uschindler commented on issue #12907: URL: https://github.com/apache/lucene/issues/12907#issuecomment-1871830725 I don't think the setting is live yet. You need to restart everything. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to G

Re: [I] jenkins dump file traversal exceptions ("no matches found within 10000") [lucene]

2023-12-29 Thread via GitHub
dweiss commented on issue #12907: URL: https://github.com/apache/lucene/issues/12907#issuecomment-1871831161 The exception message would indicate it's live though, right? They bumped it to 60k. https://github.com/apache/infrastructure-p6/pull/1747/files -- This is an automated message

Re: [I] jenkins dump file traversal exceptions ("no matches found within 10000") [lucene]

2023-12-29 Thread via GitHub
uschindler commented on issue #12907: URL: https://github.com/apache/lucene/issues/12907#issuecomment-1871832304 Yes. The file mask cannot be improved. It counts all files it sees. Look at the other code. Unless the path starts with a fixed path it needs to go through all files. It's like w

[I] Introduce Bloom Filter as non-experimental/core postings format [lucene]

2023-12-29 Thread via GitHub
mgodwan opened a new issue, #12986: URL: https://github.com/apache/lucene/issues/12986 ### Description Today, [BloomFilteringPostingsFormat](https://github.com/apache/lucene/blob/main/lucene/codecs/src/java/org/apache/lucene/codecs/bloom/BloomFilteringPostingsFormat.java) in Lucene i

Re: [PR] Move group-varint encoding/decoding logic to DataOutput/DataInput [lucene]

2023-12-29 Thread via GitHub
uschindler commented on PR #12841: URL: https://github.com/apache/lucene/pull/12841#issuecomment-1872008672 Hi @easyice, I backported the PR. There was only a change in the test because in Java 11 does not have random() with two parameters. We have TestUtil for that. Uwe -- This is

Re: [PR] Move group-varint encoding/decoding logic to DataOutput/DataInput [lucene]

2023-12-29 Thread via GitHub
easyice commented on PR #12841: URL: https://github.com/apache/lucene/pull/12841#issuecomment-1872023239 Thank you for the backport and all great suggestions! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

Re: [I] Introduce Bloom Filter as non-experimental/core postings format [lucene]

2023-12-29 Thread via GitHub
rmuir commented on issue #12986: URL: https://github.com/apache/lucene/issues/12986#issuecomment-1872038887 supporting back compat is a one-way door and a big deal. Back compat has a heavy price and is responsible for lots of bugs (e.g. Lucene 9.9.1 release). It can't be done on a whim and

[PR] Output binary doc values as hex array in SimpleTextCodec [lucene]

2023-12-29 Thread via GitHub
msfroh opened a new pull request, #12987: URL: https://github.com/apache/lucene/pull/12987 ### Description Binary doc values were being written directly in SimpleTextCodec, though they may not be valid UTF-8 (i.e. they may not be "text"). This change encodes them as a string represen

Re: [PR] Output well-formed UTF-8 bytes in SimpleTextCodec's segmentinfos [lucene]

2023-12-29 Thread via GitHub
msfroh commented on PR #12897: URL: https://github.com/apache/lucene/pull/12897#issuecomment-1872283044 I implemented a similar change for binary doc values at https://github.com/apache/lucene/pull/12987 -- This is an automated message from the Apache Git Service. To respond to the messag

Re: [I] Make ByteBufferIndexInput public [LUCENE-8406] [lucene]

2023-12-29 Thread via GitHub
uschindler closed issue #9453: Make ByteBufferIndexInput public [LUCENE-8406] URL: https://github.com/apache/lucene/issues/9453 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] Replace ByteBuffersIndexInput with ByteBufferIndexInput (replace and rename) [LUCENE-8661] [lucene]

2023-12-29 Thread via GitHub
uschindler commented on issue #9707: URL: https://github.com/apache/lucene/issues/9707#issuecomment-1872409165 We should do this cleanup at some point. We still have 2 implementations... -- This is an automated message from the Apache Git Service. To respond to the message, please log on t