I'm having two problems with Solr 4.9.1.  I can't upgrade yet, because
we are using a third-party plugin component that is not yet explicitly
qualified for anything newer than 4.9.0.  The point release upgrade
seemed like a safe bet, because I know that we don't do API changes in
point releases.  These are transient problems, and do not seem to be
affecting the index at this time.

Some background info:

Ubuntu 14, Java 8u40 from the webupd8 PPA, Solr 4.9.1.  It is *NOT*
SolrCloud.

Full rebuilds on my index involve building a new index in cores that I
have designated "build" cores, then swapping those cores with "live"
cores.  This always worked flawlessly before I updated to Solr 4.9.1 and
migrated the config to use core discovery.

root@idxb4:~# cat /index/solr4/cores/sparkinc_0/core.properties
name=sparkinclive
dataDir=../../data/sparkinc_0

root@idxb4:~# cat /index/solr4/cores/sparkinc_1/core.properties
name=sparkincbuild
dataDir=../../data/sparkinc_1

The first problem:  Sometimes, in a completely unpredictable manner, the
new solr.xml format seems to behave like using the old format with
persistent=false.

When I restarted Solr yesterday, that action swapped the live cores with
the build cores and I lost half my index because it swapped back to the
previous build cores.  Just now when I tried a restart, everything
worked flawlessly and the cores did not swap.

The second problem:  Sometimes old index segments do not get deleted,
even though they are not part of the index.

Another part of the full rebuild process involves clearing the build
cores before beginning the full import.  The code does a deleteByQuery
with *:* and then optimizes the core.  Sometimes this action fails to
delete the old segment files, but when I checked the core Overview in
the admin UI, numDocs only reflected the newly indexed docs and
deletedDocs was 0.

It was actually while trying to fix/debug this second problem that I
discovered the first problem.  Once the rebuild finished, I wanted to
see what would happen if I restarted Solr while one of my cores had 32GB
of segment files that were not part of the index ... but that's when the
indexes swapped.  At that point, I deleted all the dataDirs on both
machines (it's a distributed index), restarted Solr again, and began a
full rebuild.  Everything seems to be fine now.

Are either of these problems anything that anyone has seen?  I don't
recall seeing anything come across the list before.  Are there existing
issues in Jira?  Is there any information that I can provide which would
help in narrowing down the problem?

Thanks,
Shawn

Reply via email to