Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-24 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Fri, Jul 24, 2009 at 2:04 PM, Chantal Ackermann wrote: > oh, I didn't know that. That did it! (Jippieh!) > I've pasted the relevant code below in case you're interested. > > Thank you for your help!! > > Another little observation that I made using the debugger interface: > it threw an exception

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Glen Newton
doop, HBase, UIMA, NLP, NER, IR > > > > - Original Message >> From: Glen Newton >> To: solr-user@lucene.apache.org >> Sent: Thursday, July 23, 2009 5:52:43 AM >> Subject: Re: DataImportHandler / Import from DB : one data set comes in   >> multiple rows &g

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Otis Gospodnetic
text is hiring: http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message > From: Glen Newton > To: solr-user@lucene.apache.org > Sent: Thursday, July 23, 2009 5:52:43 AM > Subject: Re: DataImportHandler / Import

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Chantal Ackermann
Hi Paul, no, I didn't return the unique key, though there is one defined. I added that to the nextRow() implementation, and I am now returning it as part of the map. But it is still not creating any documents, and now that I can see the ID I have realized that it is always processing the sam

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Noble Paul നോബിള്‍ नोब्ळ्
Is there a in your schema ? are you returning a value corresponding to that key name? probably you can paste the whole data-config.xml On Thu, Jul 23, 2009 at 4:59 PM, Chantal Ackermann wrote: > Hi Paul, hi Glen, hi all, > > thank you for your answers. > > I have followed Paul's solution (as I

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Chantal Ackermann
Hi Paul, hi Glen, hi all, thank you for your answers. I have followed Paul's solution (as I received it earlier). (I'll keep your suggestion in mind, though, Glen.) It looks good, except that it's not creating any documents... ;-) It is most probably some misunderstanding on my side, and mayb

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-23 Thread Glen Newton
Chantal, You might consider LuSql[1]. It has much better performance than Solr DIH. It runs 4-10 times faster on a multicore machine, and can run in 1/20th the heap size Solr needs. It produces a Lucene index. See slides 22-25 in this presentation comparing Solr DIH with LuSql: http://code4lib.o

Re: DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
alternately, you can write your own EntityProcessor and just override the nextRow() . I guess you can still use the JdbcDataSource On Wed, Jul 22, 2009 at 10:05 PM, Chantal Ackermann wrote: > Hi all, > > this is my first post, as I am new to SOLR (some Lucene exp). > > I am trying to load data fro

DataImportHandler / Import from DB : one data set comes in multiple rows

2009-07-22 Thread Chantal Ackermann
Hi all, this is my first post, as I am new to SOLR (some Lucene exp). I am trying to load data from an existing datamart into SOLR using the DataImportHandler but in my opinion it is too slow due to the special structure of the datamart I have to use. Root Cause: This datamart uses a row bas