Re: Unfinished Business: Fast Global IDF

2024-08-27 Thread David Smiley
Thanks for sharing Walter! I hope someone enterprising tackles it. It'd be nice to have global IDF by default without having to go enable something that adds a performance risk. I'm sure you have many career stories to tell. If you find yourself at Acadia National Park hiking & backpacking, as y

Re: [Draft] Release notes for Apache Solr 9.7.0

2024-08-27 Thread Anshum Gupta
Thanks David! The current list is far from something that includes all the updates but this release has reasonably more stuff than most other minor releases. We should be able to remove a few of those but a lot of the currently listed changes would fall into the highlight category. Anshum Gupta

Re: [Draft] Release notes for Apache Solr 9.7.0

2024-08-27 Thread David Smiley
I restructured the announcement so that the CHANGES.txt reference goes after the highlights, since people will want to only consider clicking that *after* they've read the highlights IMO. I also added a link to our upgrade notes, which I think is critical info to link to. I noticed the current st

Re: [Draft] Release notes for Apache Solr 9.7.0

2024-08-27 Thread David Smiley
The following script can generate the contributors: https://github.com/apache/solr/pull/2424 Need to get that PR merged and integrated into the release process/wizard. On Tue, Aug 27, 2024 at 3:14 PM Anshum Gupta wrote: > > Hi everyone, > > Here are the draft release notes for Apache Solr 9.7.0:

Re: Overlapping arguments (SOLR-17383)

2024-08-27 Thread Christos Malliaridis
Hello everyone, In order to start resolving the CLI argument conflicts from https://issues.apache.org/jira/browse/SOLR-17383, we started to deprecate (in 9.X) and remove or change (in 10.0/main) the overlapping arguments. I would like to use this thread for tracking each conflict resolution. A co

[Draft] Release notes for Apache Solr 9.7.0

2024-08-27 Thread Anshum Gupta
Hi everyone, Here are the draft release notes for Apache Solr 9.7.0: https://cwiki.apache.org/confluence/display/SOLR/ReleaseNote9_7_0 Please review and update. -Anshum

Re: Unfinished Business: Fast Global IDF

2024-08-27 Thread Walter Underwood
This is the patent. Last assignee was Google, expired in 2017. https://patents.google.com/patent/US5659732A/en —wunder > On Aug 27, 2024, at 12:01 PM, Walter Underwood wrote: > > When I’ve enabled global exact IDF in Solr, the speed penalty was about 10X. > Back in 1995, Infoseek figured out

[VOTE] Release Solr 9.7.0 RC1

2024-08-27 Thread Anshum Gupta
Please vote for Release Candidate 1 for Apache Solr 9.7.0 The artifacts can be downloaded from: https://dist.apache.org/repos/dist/dev/solr/solr-9.7.0-RC1-rev-dd176f1217f0573ea9b9b72c75a3e52e7a49e139 You can run the smoke tester directly with this command: python3 -u dev-tools/scripts/smokeTestR

Unfinished Business: Fast Global IDF

2024-08-27 Thread Walter Underwood
When I’ve enabled global exact IDF in Solr, the speed penalty was about 10X. Back in 1995, Infoseek figured out how to do that with no speed penalty. They patented it, but that patent expired several years ago. I’ll try and hunt it down. Short version, from each shard return the number of docs

Re: Unfinished Business: Fuzzy in edismax

2024-08-27 Thread Walter Underwood
Oops. https://issues.apache.org/jira/browse/SOLR-629 —wunder > On Aug 27, 2024, at 11:40 AM, Walter Underwood wrote: > > I’m retired and not working on Solr all the time, but there are two things I > didn’t finish that should be picked up. I’m not going to do these, I’ve got > plenty of retir

Unfinished Business: Fuzzy in edismax

2024-08-27 Thread Walter Underwood
I’m retired and not working on Solr all the time, but there are two things I didn’t finish that should be picked up. I’m not going to do these, I’ve got plenty of retirement stuff to do. The first is SOLR-629, probably the oldest open feature request and a good first project for someone. This a

Generic Test Injection Points for Cloud

2024-08-27 Thread Gus Heck
In thinking about the testing for QueryLimits, I'm somewhat unsatisfied because it seems like it would be ideal to be validating limit expirations at various points in the request process but the current tests are all thumbs, doing things like wasting CPU when query limits are checked which can rel