eks, This is clear as day - you're using Winblows! Kidding.
I'd: * watch IO with something like vmstat 2 and see if the rate drops correlate to increased disk IO or IO wait time * monitor the DB from which you were pulling the data - maybe the DB or the server that runs it had issues * monitor the network over which you pull data from DB If none of the above reveals the problem I'd still: * grab all data you need to index and copy it locally * index everything locally Out of curiosity, how big is your ramBufferSizeMB and your -Xmx? And on that 8-core box you have ~8 indexing threads going? Otis ---- Sematext is Hiring -- http://sematext.com/about/jobs.html >________________________________ >From: eks dev <eks...@yahoo.co.uk> >To: solr-user <solr-user@lucene.apache.org> >Sent: Saturday, September 24, 2011 3:18 PM >Subject: Update ingest rate drops suddenly > >just looking for hints where to look for... > >We were testing single threaded ingest rate on solr, trunk version on >atypical collection (a lot of small documents), and we noticed >something we are not able to explain. > >Setup: >We use defaults for index settings, windows 64 bit, jdk 7 U2. on SSD, >machine with enough memory and 8 cores. Schema has 5 stored fields, >4 of them indexed no positions no norms. >Average net document size (optimized index size / number of documents) >is around 100 bytes. > >On a test with 40 Mio document: >- we had update ingest rate on first 4,4Mio documents @ incredible >34k records / second... >- then it dropped, suddenly to 20k records per second and this rate >remained stable (variance 1k) until... >- we hit 13Mio, where ingest rate dropped again really hard, from one >instant in time to another to 10k records per second. > >it stayed there until we reached the end @40Mio (slightly reducing, to >ca 9k, but this is not long enough to see trend). > >Nothing unusual happening with jvm memory ( tooth-saw 200- 450M fully >regular). CPU in turn was following the ingest rate trend, inicating >that we were waiting on something. No searches , no commits, nothing. > >autoCommit was turned off. Updates were streaming directly from the database. > >----- >I did not expect something like this, knowing lucene merges in >background. Also, having such sudden drops in ingest rate is >indicative that we are not leaking something. (drop would have been >much more gradual). It is some caches, but why two really significant >drops? 33k/sec to 20k and than to 10k... We would love to keep it @34 >k/second :) > >I am not really acquainted with the new MergePolicy and flushing >settings, but I suspect this is something there we could tweak. > >Could it be windows is somehow, hmm, quirky with solr default >directory on win64/jvm (I think it is MMAP by default)... We did not >saturate IO with such a small documents I guess, It is a just couple >of Gig over 1-2 hours. > >All in all, it works good, but is having such hard update ingest rate >drops normal? > >Thanks, >eks. > > >