Hi Lance
thanx for your reply, but I have a question
is this patch committed to trunk?
SOLR-1499 is a plug-in for the DIH that uses Solr as a DataSource.
This means that you can read the database and PDFs separately. You
could index all of the PDF content in one DIH script. Then, when
there's a database update, you have a separate DIH scripts that reads
the old row from Solr, and pul
If it's of any help I've split the processing of PDF files from the
indexing. I put the PDF content into a text file (but I guess you could load
it into a database) and use that as part of the indexing. My processing of
the PDF files also compares timestamps on the document and the text file so
th
Hi guys
I'm wondering how to best configure solr to fulfills my requirements.
I'm indexing data from 2 data sources:
1- Database
2- PDF files (password encrypted)
Every file has related information stored in the database. Both the file
content and the related database fields must be indexed as
>>>> Regards
>>>> Sujatha
>>>>
>>>>
>>>>
>>>>
>>>> On 12/18/08, Feak, Todd wrote:
>>>>
>>>>
>>>>> Don't forget to consider scaling concerns (if there are any)
gt;
> >>
> >>
> >>
> >> On 12/18/08, Feak, Todd wrote:
> >>
> >>> Don't forget to consider scaling concerns (if there are any). There are
> >>> strong differences in the number of searches we receive for each
> >>> lang
>>> if we needed to. We see 2 orders of magnitude difference between our
>>> most popular language and our least popular.
>>>
>>> -Todd Feak
>>>
>>> -Original Message-
>>> From: Julian Davchev [mailto:j...@drun.net]
>>> Sen
: Subject: looking for multilanguage indexing best practice/hint
: References: <49483388.8030...@drun.net>
: <502b8706-828b-4eaa-886d-af0dccf37...@stylesight.com>
: <8c0c601f0812170825j766cf005i9546b2604a19f...@mail.gmail.com
you can pre-define some base query parts and also do score boosting
behind the scenes.
I hope it helps.
Regards,
Daniel
-Original Message-
From: Sujatha Arun [mailto:suja.a...@gmail.com]
Sent: 18 December 2008 04:15
To: solr-user@lucene.apache.org
Subject: Re: looking for multilanguage
rs of magnitude difference between our
> > most popular language and our least popular.
> >
> > -Todd Feak
> >
> > -Original Message-
> > From: Julian Davchev [mailto:j...@drun.net]
> > Sent: Wednesday, December 17, 2008 11:31 AM
> > To: solr-u
of magnitude difference between our
> most popular language and our least popular.
>
> -Todd Feak
>
> -Original Message-
> From: Julian Davchev [mailto:j...@drun.net]
> Sent: Wednesday, December 17, 2008 11:31 AM
> To: solr-user@lucene.apache.org
> Subject: loo
guage indexing best practice/hint
Hi,
>From my study on solr and lucene so far it seems that I will use single
scheme.at least don't see scenario where I'd need more than that.
So question is how do I approach multilanguage indexing and multilang
searching. Will it really make sense for j
I think this is up to your needs.
If you will make one search in many languages, and your doc's won't get too
big, you can put all the data in one schema.xml and configure your field
types by a language basis.
2008/12/17 Julian Davchev
> Hi,
> From my study on solr and lucene so far it seems t
Hi,
>From my study on solr and lucene so far it seems that I will use single
scheme.at least don't see scenario where I'd need more than that.
So question is how do I approach multilanguage indexing and multilang
searching. Will it really make sense for just searching word..or rather
I should s
14 matches
Mail list logo