Re: Solr Master Slave Architecture over NFS

Grant Ingersoll Mon, 30 Jun 2008 10:35:38 -0700

I think it comes w/ some caveats, but is now workable (although it maynot give great performance), assuming you're using 2.3 (2.2????) orlater. I would definitely do a search in the Lucene archives aboutNFS, especially paying attention to Mike McCandless' comments.


On Jun 30, 2008, at 1:08 PM, Bill Au wrote:

Isn't using Lucene over NFS *not* recommended?

Bill

On Mon, Jun 30, 2008 at 4:27 AM, Nico Heid <[EMAIL PROTECTED]> wrote:
Hey, I'm looking for some feedback on the following setup.
Due to the architects decision I will be working with NFS notSolr's own
distribution scripts.
A few Solr indexing machines use Multicore to divide the 300.000Users to
1000
shards.
For several reasons we have to go with per user sharding (as youcan see
300
per shard) Updates come in with about 166 updates per hour on eachshard.
So
not a problem.
The question lies more in this concept: I set up a few QuerySlaves, using
NFS
readonly mounts.
I do not use the index directory for the readonly slaves. I patchedthe
slaves
to use the most recent snapshot directory to avoid all the nasty nfs
issues.
(only a quick and dirty hack for testing) On a not yet definedinterval I
do a
snapshot on the masters and send a http commit to the slave, so a new
reader
on the fresh snapshot is opened.
This seems to work without trouble so far, but I've not doneextensive
testing.
To take this a step further (only an idea yet). I let the slaveswork on
the
real index, as long as I do not optimize. Because the directorystructure
is
not changing as long as I do not optimize, I can send commits to the
slaves.
Before I optimize I take a snapshot, send them a special "commit"to make
them
fall back to the most recent snapshot dir, optimize the index andsend them
a
real commit when done.
Even though a little trickier I would be more up to date with thequery
slaves.
So if you have any design comments or see major or minor flaws,feedback
would
be very welcome.
I do not use live data yet, this is the experimental stage. ButI'll givefeedback on how it performs and what issues I run into. There'salso the
faint
chance of letting this setup (or a "fixed" one) run on the realuser data,which would be roughly 20TB of usable data for indexing. This wouldbe
really
interesting :-)

Have a nice week
Nico


--------------------------
Grant Ingersoll
http://www.lucidimagination.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ

Re: Solr Master Slave Architecture over NFS

Reply via email to