Re: Indexing Best Practice

2011-04-12 Thread Darx Oman
Hi Lance thanx for your reply, but I have a question is this patch committed to trunk?

Re: Indexing Best Practice

2011-04-11 Thread Lance Norskog
SOLR-1499 is a plug-in for the DIH that uses Solr as a DataSource. This means that you can read the database and PDFs separately. You could index all of the PDF content in one DIH script. Then, when there's a database update, you have a separate DIH scripts that reads the old row from Solr, and pul

Re: Indexing Best Practice

2011-04-11 Thread Shaun Campbell
If it's of any help I've split the processing of PDF files from the indexing. I put the PDF content into a text file (but I guess you could load it into a database) and use that as part of the indexing. My processing of the PDF files also compares timestamps on the document and the text file so th

Indexing Best Practice

2011-04-10 Thread Darx Oman
Hi guys I'm wondering how to best configure solr to fulfills my requirements. I'm indexing data from 2 data sources: 1- Database 2- PDF files (password encrypted) Every file has related information stored in the database. Both the file content and the related database fields must be indexed as

Re: looking for multilanguage indexing best practice/hint

2008-12-21 Thread Julian Davchev
>>>> Regards >>>> Sujatha >>>> >>>> >>>> >>>> >>>> On 12/18/08, Feak, Todd wrote: >>>> >>>> >>>>> Don't forget to consider scaling concerns (if there are any)

Re: looking for multilanguage indexing best practice/hint

2008-12-19 Thread Sujatha Arun
gt; > >> > >> > >> > >> On 12/18/08, Feak, Todd wrote: > >> > >>> Don't forget to consider scaling concerns (if there are any). There are > >>> strong differences in the number of searches we receive for each > >>> lang

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Julian Davchev
>>> if we needed to. We see 2 orders of magnitude difference between our >>> most popular language and our least popular. >>> >>> -Todd Feak >>> >>> -Original Message- >>> From: Julian Davchev [mailto:j...@drun.net] >>> Sen

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Chris Hostetter
: Subject: looking for multilanguage indexing best practice/hint : References: <49483388.8030...@drun.net> : <502b8706-828b-4eaa-886d-af0dccf37...@stylesight.com> : <8c0c601f0812170825j766cf005i9546b2604a19f...@mail.gmail.com

RE: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Daniel Alheiros
you can pre-define some base query parts and also do score boosting behind the scenes. I hope it helps. Regards, Daniel -Original Message- From: Sujatha Arun [mailto:suja.a...@gmail.com] Sent: 18 December 2008 04:15 To: solr-user@lucene.apache.org Subject: Re: looking for multilanguage

Re: looking for multilanguage indexing best practice/hint

2008-12-18 Thread Erick Erickson
rs of magnitude difference between our > > most popular language and our least popular. > > > > -Todd Feak > > > > -Original Message- > > From: Julian Davchev [mailto:j...@drun.net] > > Sent: Wednesday, December 17, 2008 11:31 AM > > To: solr-u

Re: looking for multilanguage indexing best practice/hint

2008-12-17 Thread Sujatha Arun
of magnitude difference between our > most popular language and our least popular. > > -Todd Feak > > -Original Message- > From: Julian Davchev [mailto:j...@drun.net] > Sent: Wednesday, December 17, 2008 11:31 AM > To: solr-user@lucene.apache.org > Subject: loo

RE: looking for multilanguage indexing best practice/hint

2008-12-17 Thread Feak, Todd
guage indexing best practice/hint Hi, >From my study on solr and lucene so far it seems that I will use single scheme.at least don't see scenario where I'd need more than that. So question is how do I approach multilanguage indexing and multilang searching. Will it really make sense for j

Re: looking for multilanguage indexing best practice/hint

2008-12-17 Thread Alexander Ramos Jardim
I think this is up to your needs. If you will make one search in many languages, and your doc's won't get too big, you can put all the data in one schema.xml and configure your field types by a language basis. 2008/12/17 Julian Davchev > Hi, > From my study on solr and lucene so far it seems t

looking for multilanguage indexing best practice/hint

2008-12-17 Thread Julian Davchev
Hi, >From my study on solr and lucene so far it seems that I will use single scheme.at least don't see scenario where I'd need more than that. So question is how do I approach multilanguage indexing and multilang searching. Will it really make sense for just searching word..or rather I should s