It *just happens* that I wrote a blog on this very topic, see:
http://www.lucidimagination.com/blog/2012/02/14/indexing-with-solrj/

That code contains two rather different methods, one that indexes
based on a SQL database and one based on indexing random files
with client-side Tika.

Best
Erick

On Wed, Feb 22, 2012 at 8:51 PM, Mike O'Leary <tmole...@uw.edu> wrote:
> Could you point me to the most non-intimidating introduction to SolrJ that 
> you know of? I have a passing familiarity with Javascript and, with few 
> exceptions, I haven't developing software that has a graphical user interface 
> of any kind in about 25 years. I like the idea of having finer control over 
> data imported from a database though.
> Thanks,
> Mike
>
> -----Original Message-----
> From: Erick Erickson [mailto:erickerick...@gmail.com]
> Sent: Monday, February 13, 2012 6:19 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Recovering from database connection resets in DataimportHandler
>
> I'd seriously consider using SolrJ and your favorite JDBC driver instead. 
> It's actually quite easy to create one, although as always it may be a bit 
> intimidating to get started. This allows you much finer control over error  
> conditions than DIH does, so may be more suited to your needs.
>
> Best
> Erick
>
> On Sat, Feb 11, 2012 at 2:40 AM, Mike O'Leary <tmole...@uw.edu> wrote:
>> I am trying to use Solr's DataImportHandler to index a large number of 
>> database records in a SQL Server database that is owned and managed by a 
>> group we are collaborating with. The indexing jobs I have run so far, except 
>> for the initial very small test runs, have failed due to database connection 
>> resets. I have gotten indexing jobs to go further by using 
>> CachedSqlEntityProcessor and specifying responseBuffering=adaptive in the 
>> connection url, but I think in order to index that data I'm going to have to 
>> work out how to catch database connection reset exceptions and resubmit the 
>> queries that failed. Can anyone can suggest a good way to approach this? Or 
>> have any of you encountered this problem and worked out a solution to it 
>> already?
>> Thanks,
>> Mike

Reply via email to