Thanks. And on Windows?

2008/1/18, Otis Gospodnetic <[EMAIL PROTECTED]>:
>
> Paging - quite possible.  Run vmstat 2' on that server when its slow and
> see if there is any paging (look at the 'si' and 'so' columns).
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> ----- Original Message ----
> From: Geert-Jan Brits <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Thursday, January 17, 2008 6:10:56 AM
> Subject: Re: batch indexing takes more time than shown on SOLR output -->
> something to do with IO?
>
> No Solr-commit is sent until the end.
> Since client and server on this moment are on the same machine network
> IO
> should be small as well I think. Also as you mentioned the response is
> very
> small so it can't be that either.
>
> As to what IO-activity I was thinking about: I was merely guessing
> here, but
> I thought that maybe the creation of indices for indexed fields were
> not
> accounted for in the supplied number. Which is something I can't
> imagine,
> but still with these numbers my head makes all sorts of strange
> scenario's
> ;-)
>
> After checking some other machine stats while doing a big update I
> think I
> know what's going on (please correct me if it doesn't sound plausible):
> client and server (on the same machine with 2GB RAM) are causing
> excessive
> page swapping (on the same disk, yeah I know, I must get a different
> setup)
> which causes SOLR-server to have difficulties with creating its indices
> on
> disk. I think this is going on since all goes pretty good (no big
> discrepencies) until Ram gets's more or less exhausted.
>
> could this be it? I'm going to test with 2 machines I guess.
>
> Thanks,
> Geert-Jan
>
> 2008/1/17, Chris Hostetter <[EMAIL PROTECTED]>:
> >
> > : INFO: {add=[10485, 10488, 10489, 10490, 10491, 10495, 10497, 10498,
> > ...(42
> > : more)
> > : ]} 0 875
> > :
> > : However, when timing this instruction on the client-side (I use
> SOlrJ
> > -->
> > : req.process(server)) I get totally different numbers (in the
> beginning
> > the
> > : client-side measured time is about 2 seconds on average but after
> some
> > time
> > : this time goes up to about 30-40 seconds, altough the
> solr-outputted
> > time
> > : stays between 0.8-1.3 seconds?
> >
> > as Otis mentioned, that time is the raw processing of the request,
> not
> > counting any network IO between the client and the server, or any
> time
> > spent by the "ResponseWriter" formating the response.  you can get
> more
> > accurate numbers about exctly how long the server spent doing all of
> these
> > things from the access log of your servlet container (which should be
> > recording the time only after every last byte is written back to the
> > client.
> >
> > that said: there's really no reason for as big a descrepency as you
> are
> > describing particularly on updates where the ResposneWriter has
> almost
> > nothing to do (30-40 seconds per update?!?!?!)
> >
> > I'm not very familiar with SolrJ, but are you by any chance using it
> in a
> > way that sends a commit after every update command?  (commits can get
> > successifly longer as your index gets bigger.)
> >
> > : Does this have anything to do with costly IO-activity that is
> accounted
> > for
> > : in the SOLR output? If this is true, what tool do you recommend
> using to
> > : monitor IO-activity?
> >
> > Which IO-activity are you talking about?
> >
> >
> >
> >
> > -Hoss
> >
> >
>
>
>
>

Reply via email to