What we're looking for is a way to inject *without* using
curl, or wget, or any other http-based communication.  We'd
like for the HTTP daemon to only handle search requests, not
indexing requests on top of them.

Plus, I have to believe there's a faster way to get documents
into solr/lucene than using curl....

_________________________________________________________________
david whalen
senior applications developer
eNR Services, Inc.
[EMAIL PROTECTED]
203-849-7240
  

> -----Original Message-----
> From: Clay Webster [mailto:[EMAIL PROTECTED] 
> Sent: Thursday, August 09, 2007 11:43 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Any clever ideas to inject into solr? Without http?
> 
> Condensing the loader into a single executable sounds right 
> if you have performance problems. ;-)
> 
> You could also try adding multiple <doc>s in a single post if 
> you notice your problems are with tcp setup time, though if 
> you're doing localhost connections that should be minimal.
> 
> If you're already local to the solr server, you might check 
> out the CSV slurper. http://wiki.apache.org/solr/UpdateCSV  
> It's a little specialized.
> 
> And then there's of course the question of "are you doing 
> full re-indexing or incremental indexing of changes?"
> 
> --cw
> 
> 
> On 8/9/07, Kevin Holmes <[EMAIL PROTECTED]> wrote:
> >
> > I inherited an existing (working) solr indexing script that 
> runs like
> > this:
> >
> >
> >
> > Python script queries the mysql DB then calls bash script
> >
> > Bash script performs a curl POST submit to solr
> >
> >
> >
> > We're injecting about 1000 records / minute (constantly), 
> frequently 
> > pushing the edge of our CPU / RAM limitations.
> >
> >
> >
> > I'm in the process of building a Perl script to use DBI and 
> > lwp::simple::post that will perform this all from a single script 
> > (instead of 3).
> >
> >
> >
> > Two specific questions
> >
> > 1: Does anyone have a clever (or better) way to perform 
> this process 
> > efficiently?
> >
> >
> >
> > 2: Is there a way to inject into solr without using POST / 
> curl / http?
> >
> >
> >
> > Admittedly, I'm no solr expert - I'm starting from someone else's 
> > setup, trying to reverse-engineer my way out.  Any input would be 
> > greatly appreciated.
> >
> >
> 

Reply via email to