I forgot to mention: I admittedly haven't been very involved in Solr in
the past. So I'm probably not aware of many of the problems Solr might
have had with staying in sync with Lucene. If everyone here agrees with
Yonik/Mike's proposal I will not try to block it with a -1 veto. I'm
just trying to express here the concerns that come to my mind. To do
what's best for the future of the Lucene TLP as a whole is of course my
main interest too.
And I really really want to still be able to use Lucene separately as a
library, and I think we all agree here!
Michael
On 2/28/10 9:05 PM, Michael Busch wrote:
On 2/28/10 4:30 PM, Grant Ingersoll wrote:
Not sure why more tests would be a negative. The Solr tests exercise
quite a bit of Lucene functionality as well.
-Grant
Sorry, I should have made myself clearer here. It'd obviously be silly
to argue against more test coverage. In general I think it's a great
idea to run the Solr tests also when testing a Lucene patch.
I'm just not happy about making this a formal requirement (that Solr
tests have to pass in order to commit a Lucene patch). All
backwards-incompatible patches, which we had quite a few of in 2.9 and
3.0, would then become even more difficult to commit, because you have
to make all changes then in Solr too as part of the Lucene patch.
Think about changes like per-segment search or the new TokenStream API
and how difficult and time consuming they were for core and contrib
already. For backwards-compatible changes, by all means, let's run as
many tests as we can.
We have all been saying we want to have more frequent releases. Right
now Lucene has no external dependencies that could slow down a release
and still we don't release as frequently as we'd like to. If we add
dependencies like release alignment with subprojects I'm afraid this
will become worse.
I was really happy about the original idea of having a separate
analyzer module (or subproject, library, whatever name it'd have),
because analysis seems quite separate from indexing/search. Separating
the two seems logical. And why not release such an analyzer package
more frequently than Lucene. Different pieces of code don't all move
with the same pace. It'd be nice to have the freedom of releasing an
analyzer library after e.g. a new language was added, maybe even only
two weeks after the previous release. IMO more modular release cycles
is a better way to go than this new proposal.
I'd be happy if the Solr developers would be more involved in Lucene
(again) and if we would discuss new ideas with the question in mind,
where the new feature should live. And also the Lucene developers who
are not very involved in Solr should understand the impact that Lucene
changes have on Solr. So big +1 for better communication between Solr
and Lucene devs!
Michael