Re: Keyword Density

Walter Underwood Mon, 01 Jun 2009 12:27:39 -0700

That is the normal relevance scoring formula in Solr and Lucene.
It is a bit fancier than that, but you don't have to do anything
special to get that behavior.


Solr also uses the inverse document frequency (rarity) of each
word for weighting.

Look up "tf.idf" for more info.

wunder

On 6/1/09 11:46 AM, "Alex Shevchenko" <[email protected]> wrote:

> Something like that. Just not '> N times' but '<numbers of foo
> appears>/<total number of words> > <some value>'
> 
> On Mon, Jun 1, 2009 at 21:00, Otis Gospodnetic
> <[email protected]>wrote:
> 
>> 
>> Hi Alex,
>> 
>> Could you please provide an example of this?  Are you looking to do
>> something like "find all docs that match name:foo and where foo appears > N
>> times (in the name field) in the matching document"?
>> 
>>  Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> 
>> 
>> 
>> ----- Original Message ----
>>> From: Alex Shevchenko <[email protected]>
>>> To: [email protected]
>>> Sent: Monday, June 1, 2009 1:32:49 PM
>>> Subject: Re: Keyword Density
>>> 
>>> HI All,
>>> 
>>> Is there a way to perform filtering based on keyword density?
>>> 
>>> Thanks
>>> 
>>> --
>>> Alex Shevchenko

Re: Keyword Density

Reply via email to