Re: Simple Faceted Searching out of the box

Tim Archambault Fri, 22 Sep 2006 13:16:08 -0700

Okay. We are all on the same page. I just don't express myself as well in
"programming speak" yet.


I'm going to read up on Otis' "Lucene in Action" tonight. I'd swear he had
an example of how to inject records into a lucene index using java and sql.
Maybe I'm wrong though.



On 9/22/06, Walter Underwood <[EMAIL PROTECTED]> wrote:

Sorry, I was not being exact with "store". Lucene has separate
control over whether the value of a field is stored and whether
it is indexed. The term "nurse" might be searchable, but the
only value that is stored in the index for retrieval is the
database key for each matching job.

It seems like text search should be easy to add to a transactional
database, but lots of smart people have tried to make that work
and failed. Maybe it is possible, but neither Oracle nor Microsoft
nor the open source community have been able to make it happen.
The text search in RDBMSs seems to always be slow and lame.

There is one product that does transactional query and text
search: MarkLogic. It does a good job of both, but it is very
XML-centric. It might be a good match, if you are into commercial
software. It is a rather different style of programming than
SQL or Lucene. You write XQuery to define the result XML with
the contents fetched from the database.

wunder (not affiliated with MarkLogic)

On 9/22/06 12:42 PM, "Tim Archambault" <[EMAIL PROTECTED]>
wrote:

> I'm really confused. I don't mean "store" the data figuratively as in a
> lucene/solr command. Storing an ID number in a solr index isn't going to
> help a user find "nurse". I think part of this is that some people feel
that
> databases like MSSQL, MYSQL should be able to provide quality search
> experience, but they just flat out don't. It's a separate utility.
>
> Thanks Walter.
>
> On 9/22/06, Walter Underwood <[EMAIL PROTECTED]> wrote:
>>
>> On 9/22/06 12:25 PM, "Tim Archambault" <
[EMAIL PROTECTED]>
>> wrote:
>>
>>> A recruitment (jobs) customer goes onto our website and posts an
online
>> job
>>> posting to our newspaper website. Upon insert into the database, I
need
>> to
>>> generate an xml file to be sent to SOLR to ADD as  a record to the
>> search
>>> engine. Same  goes for an edit, my database updates the record and
then
>> I
>>> have to send an ADD statement to Solr again to commit my change. 2x
the
>>> work.
>>>
>>> I've been talking with other papers about Solr and I think what
bothers
>> many
>>> is that there a is a deposit of information in a structured database
>> here
>>> [named A], then we have another set of basically the same data over
here
>>> [named B] and they don't understand why they have to manage to
different
>>> sets of data [A & B] that are virtually the same thing.
>>
>> The work isn't duplicated. Two servers are building two kinds of index,
>> a transactional record index and a text index. That is two kinds of
>> work, not a duplication.
>>
>> Storing the data is the small part of a database or a search engine.
>> The indexes are the real benefit.
>>
>> In fact, the data does not have to be stored in Solr. You can return a
>> database key as the only field, then get the details from the database.
>> That is  how our current search works -- the search result is a list
>> of keys in relevance order. Period.
>>
>> wunder
>> --
>> Walter Underwood
>> Search Guru, Netflix
>>
>>

Re: Simple Faceted Searching out of the box

Reply via email to