Re: Question About Boosting.

2007-03-11 Thread Walter Underwood
Back up another step. What are the documents and what do you
want to show to the users? Have you tried the default configuration
with real user queries?

After you've tested it with user queries, then look at the
results where the ranking isn't performing well.

Lucene and Solr already automatically boost rare terms over
common terms, using tf.idf weighting.

I posted more detail on this in my blog last summer:

http://wunderwood.org/most_casual_observer/2006/06/good_to_great_search.html

wunder

On 3/10/07 8:04 PM, "shai deljo" <[EMAIL PROTECTED]> wrote:

> I have elements within a field that have different importance.
> I thought boosting would be an elegant way to take this into account.
> Please advise,
> 
> 
> On 3/10/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
>> What are you trying to achieve? Let's start with the problem
>> instead of picking one solution which Solr doesn't support. --wunder
>> 
>> On 3/10/07 5:08 PM, "shai deljo" <[EMAIL PROTECTED]> wrote:
>> 
>>> How can i boost some tokens over others in the same field (at Index
>>> time) ? If this is not supported directly, what's the best way around
>>> this problem (what's the hack to solve this :) ).
>>> Thanks,
>>> Shai
>> 
>> 



Highlighting

2007-03-11 Thread Fuad Efendi
I am wondering why highlighting does not work for me...
Is it possible that id MUST be type="string"? It is
"slong" in my environment, and I don't have any other clue...
Thanks,
Fuad



RE: Highlighting

2007-03-11 Thread Fuad Efendi
Suddenly it works now...

I forgot "query a field", instead of q=AMD I need to use q=item_name:AMD
...

-Original Message-
From: Fuad Efendi 
Sent: Sunday, March 11, 2007 3:18 PM
To: solr-user@lucene.apache.org
Subject: Highlighting


I am wondering why highlighting does not work for me...
Is it possible that id MUST be type="string"? It is
"slong" in my environment, and I don't have any other clue...
Thanks,
Fuad





Re: Question About Boosting.

2007-03-11 Thread shai deljo

Thanks,
The only way i found to do this
(http://www.mail-archive.com/solr-user@lucene.apache.org/msg02456.html)
is to hack and repeat the word several times in the field, but
doesn't this screw up the norms?
Also, how do i boost words in a query? e.g. q=key1 key2 and i know
key2 is twice as important than key1 ? (searching 1 field).
Thanks,
S.

On 3/11/07, Walter Underwood <[EMAIL PROTECTED]> wrote:

Back up another step. What are the documents and what do you
want to show to the users? Have you tried the default configuration
with real user queries?

After you've tested it with user queries, then look at the
results where the ranking isn't performing well.

Lucene and Solr already automatically boost rare terms over
common terms, using tf.idf weighting.

I posted more detail on this in my blog last summer:

http://wunderwood.org/most_casual_observer/2006/06/good_to_great_search.html

wunder

On 3/10/07 8:04 PM, "shai deljo" <[EMAIL PROTECTED]> wrote:

> I have elements within a field that have different importance.
> I thought boosting would be an elegant way to take this into account.
> Please advise,
>
>
> On 3/10/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
>> What are you trying to achieve? Let's start with the problem
>> instead of picking one solution which Solr doesn't support. --wunder
>>
>> On 3/10/07 5:08 PM, "shai deljo" <[EMAIL PROTECTED]> wrote:
>>
>>> How can i boost some tokens over others in the same field (at Index
>>> time) ? If this is not supported directly, what's the best way around
>>> this problem (what's the hack to solve this :) ).
>>> Thanks,
>>> Shai
>>
>>




RE: Searching in multiple fields

2007-03-11 Thread netaji . k
Yes i tried it and it worked. thank you




> Well,
>
> AND will find documents that have php in all fields but OR will find
> documents that have php in any field so you can try the same query with
> all ORs.
>
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
> Sent: 09 March 2007 10:42
> To: solr-user@lucene.apache.org
> Subject: RE: Searching in multiple fields
>
>
> Description:php AND title:php AND Subject:php AND notes:php
>
> this one did not work
>
> i will try the other way too and see
>
> thanks
> aditya
>
>
>
>> Well,
>>
>> One way you can do this is:
>>
>> "Description:php AND title:php AND Subject:php AND notes:php"
>>
>> The other option is to create an extra field in your index called All
>> (or whatever) that is a concatenation of these fields with a big slop
>> between the four parts of this field and then you can search that
>> field using a number smaller than the slop.
>>
>> Sachin
>>
>> -Original Message-
>> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
>> Sent: 09 March 2007 10:29
>> To: solr-user@lucene.apache.org
>> Subject: Searching in multiple fields
>>
>> Hello,
>>
>> We have 4 fields in our XML file --- Description, title,
> Subject,notes.
>> I need to search in all these fields, how do I do that?.
>>
>>
>> For Example
>> When I search Description: php, we are able to get records that are
>> matching only with description field. But I want those records in
>> which title and subject and notes also has php.
>>
>> please advise on this
>>
>>
>> regards,
>> aditya
>>
>>
>>
>> This message has been scanned for viruses by MailControl - (see
>> http://bluepages.wsatkins.co.uk/?6875772)
>>
>>
>> This email and any attached files are confidential and copyright
>> protected. If you are not the addressee, any dissemination of this
>> communication is strictly prohibited. Unless otherwise expressly
>> agreed in writing, nothing stated in this communication shall be
> legally binding.
>>
>> The ultimate parent company of the Atkins Group is WS Atkins plc.
>> Registered in England No. 1885586.  Registered Office Woodcote Grove,
>> Ashley Road, Epsom, Surrey KT18 5BW.
>>
>> Consider the environment. Please don't print this e-mail unless you
>> really need to.
>>
>
>
>




Re: Question About Boosting.

2007-03-11 Thread Mike Klaas

On 3/11/07, shai deljo <[EMAIL PROTECTED]> wrote:

Thanks,
The only way i found to do this
(http://www.mail-archive.com/solr-user@lucene.apache.org/msg02456.html)
 is to hack and repeat the word several times in the field, but
doesn't this screw up the norms?


Yes, it can influence the norms.


Also, how do i boost words in a query? e.g. q=key1 key2 and i know
key2 is twice as important than key1 ? (searching 1 field).


q=key1 key2^2

If the keywords that have more importance are the same for every
document, query-time boosting is by far the more preferable route.
You have much more flexibility and it isn't  less performant.

There are some things which are elegantly solved using index-time
boosting, and so it is likely that lucene will support it one day.

-Mike