I use the 'dismax handler' for my phrase matching. And i have the 'mm' set
this way:
Up to 3 words, match all
up to 4, match 3
up to 4, match 3 & so on
Its been working fine, but for certain phrases like 'san diego drunk driving
defense attorney', its brings up dui attorneys for other cities first
ament from all
states are returned although user specifically asked for florida will. I
don't want to alter the 'mm' either because its working fine for other
searches. Just for the search terms with a 'location' , i want to be able to
match all words. Any easy way to do t
>From the solr wiki, it sounded like if qs is set to 5 for example, & if the
search term is 'child custody', only docs with 'child' & 'custody' within 5
words of one another would be returned in results. Is this correct? If so,
it doesn't seem to be working for me. I see docs with 'child' & 'custo
Somebody please help clear this doubt. What more could i do with the dismax
handler to remove results that don't have 'word1'', 'word2', 'word3' etc in
a search phrase not within 5 words of one another, to not come up in the
results?
anuvenk wrote:
>
occurrence of
exchange (not related to 1031 exchange though). They were not within 5 words
of one another. My guys have been hammering me reg this relevancy issue.
Please help someone.
anuvenk wrote:
>
> From the solr wiki, it sounded like if qs is set to 5 for example, & if
> the s
north & dakota
somewhere. I thought if there is a way to just not return any results if
they are not within close proximity, it would be great.
Yonik Seeley wrote:
>
> On Sun, Nov 23, 2008 at 11:51 PM, anuvenk <[EMAIL PROTECTED]>
> wrote:
>> Please help someone...i
Hello,
I'm using the dismax handler for the phrase matching. I have a few legal
resources in my index in the following format for example
title state
dui faq1 california
dui faq2 florida
dui faq3 federal
. Any other thoughts?
Shalin Shekhar Mangar wrote:
>
> On Tue, Jun 2, 2009 at 12:53 AM, anuvenk wrote:
>
>>
>> title state
>>
>> dui faq1 california
>> dui faq2 florida
>> dui faq3
In my index i have legal faqs, forms, legal videos etc with a state field for
each resource.
Now if i search for real estate san diego, I want to be able to return other
'california' results i.e results from san francisco.
I have the following fields in the index
title
data over and over again.
anuvenk wrote:
>
> In my index i have legal faqs, forms, legal videos etc with a state field
> for each resource.
> Now if i search for real estate san diego, I want to be able to return
> other 'california' results i.e results from san francisco.
>
uses synonyms, a big
> list that sometimes could bring out some wrong match as the one
> noticed by Anuvenk
> for instance
>
>> dui => drunk driving defense
>> or
>> dui,drunk driving defense,drunk driving law
>> query for "dui" matches "dui =>
false
1
Thanks.
Otis Gospodnetic wrote:
>
>
> Hello,
>
> 300K is a pretty small index. I wouldn't worry about the number of
> synonyms unless you are turning a single term into dozens of ORed terms.
>
> Otis
> --
>
A small addition to my earlier post. I wonder if its because of the 'mm'
param, which requires that until 3 words in search phrase, all the words
should be matched. If i alter this now, i'd get ir-relevant results for a
lot of popular 1, 2, 3 word search terms. How to solve for
I've posted a few questions on synonyms before and finally understood how it
worked and settled with index-time synonyms. Seems to work much better than
query time synonyms. But now @ my work, they have a special request. They
want certain synonyms to be applied only to certain sections of the ind
That's right. Simple. I can very well do that. Why didn't I think of it.
Thanks.
rswart wrote:
>
> What is stopping you from defining different field types for faqs and
> attorneys? One with index time synomyms and one without.
>
>
>
> anuvenk wrote:
>&g
Could you elaborate on what the tie param does? I did read the definition in
the solr wiki but still not crystal clear.
Mike Klaas wrote:
>
>
> On 4-Jan-08, at 1:12 PM, s d wrote:
>
>> but i want to sum the scores and not use max, can i still do it
>> with the
>> DisMax? am i missing anythin
Is the parsedquery_ToString, the one passed to solr after all the tokenizing
and analyzing of the query?
For the search term 'chapter 7' i have this parsedquery_ToString
+(text:"(bankruptci chap 7) (7 chapter chap) 7 bankruptci"^0.8 |
((name:bankruptci name:chap)^2.0))~0.01 (text:"(bankruptci ch
I seem to have problems with the results i get for this search term. Not sure
if its because of the synonym mappings i have for this search term.
Search term: chapter 7
The first result doesn't even have any occurrence of chapter, bankruptcy.
But just a few occurrences of 7. But i have the 'mm'
Is it possible to implement something like this with the spellcheckhandler
Like how google does,..
say i search for 'chater 13 bakrupcy',
should be able to display these..
did you search for 'chapter 13 bankruptcy'
Has someone been able to do this?
--
View this message in context:
http://ww
How does the morelikethis handler work? Solr wiki doesn't seem to have an
elaborate explaination. In which cases would it be better to use this
instead of the dismax?
--
View this message in context:
http://www.nabble.com/morelikethishandler-tp14628416p14628416.html
Sent from the Solr - User ma
I've been using the solr admin form with debug=true to do some in-depth
analysis on some results. Could someone explain how to make sense of
this..This is the debugging info for the first result i got.
10.201284 = (MATCH) sum of:
6.2467875 = (MATCH) max plus 0.01 times others of:
6.236769
I have the word delimiter filter factory in the text field definition both at
index and query time.
But it does have some negative effects on some search terms like h1-b visa
It splits this in to three tokens h,1,b. Now if i understand right, does
solr look for matches for 'h' separately, '1' sep
How does adding a phrase slop in the handler help?
I tried ps=25 along with some pf values. I assumed that it means this..for
eg: a search term, 'child custody battle' means documents which have the
words 'child','custody','battle' within 25 words of one another will rank
high. Is that correct?
--
ply want to drop solr.WordDelimiterFilterFactory
> from your analyzer definition, no?
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> - Original Message
> From: anuvenk <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
I noticed that the top 10 results for a particular search term had the same
score. In such cases how does solr determine which should get the first
place, second and so on?
--
View this message in context:
http://www.nabble.com/How-does-solr-rank-multiple-docs-with-same-score-tp14638959p14638959
I understand tf means term frequency. For eg: if the search term is 'chapter
7', does tf mean how frequently 'chapter 7' occurs in the docs? Does it take
in to account the total number of words in a doc to determine frequency.
Also what is idf, fieldNorm and queryNorm. Trying to understand how sol
of one
another because k1 is a single token? I seem to get confused with how solr
matches documents in cases like this.
Yonik Seeley wrote:
>
> On Jan 5, 2008 2:28 PM, anuvenk <[EMAIL PROTECTED]> wrote:
>> Thats what i'm thinking too. If i remove solr.worddelimiter
; Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> - Original Message
> From: anuvenk <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Saturday, January 5, 2008 4:05:00 AM
> Subject: phrase slop param in dismax handler
&g
I log the search phrases the users on my site are using to search, and review
them regularly. Based on that i add synonyms for certain phrases to help
increase the relevant results. The reason i don't have the synonym filter at
index time is because i can't re-index whole/portion of data everytime
I have a ps of 10 set in the config. so for search terms like 'work injury' ,
results that have work and injury within 10 words of one another are coming
up in the first few results which is good. But it also returns results that
have work and injury are more than 10 words apart although they are
I'm also looking in to displaying 'related search terms' for every search
term.
How to do it with solr? The thread explains about some FuzzyLikeThisQuery.
Can this be used with solr?
Has anyone done this before? Could you please share your ideas.
ajaxtrend wrote:
>
> Take a look at this thread
>
How to accomplish this with solr?
Say for a search term like 'alternative dispute resolution', i want to be
able to display
'see related searches' and display related search terms like 'mediation',
'arbitration' etc.
Has anyone done something like this before? If so please share your ideas.
--
V
Is it possible to add a spell check component so i don't have to issue a
separate request to solr to do the spell checking? Sorry if this question is
naive..am just learning to use solr.
and add it to the search handler like this
spellcheck
what would the name of the spell check compo
l' field at index time
But like you had mentioned, the tutorial says we have to use it on a field
thats not tokenized. Now how to use my tokenized fields 'body' and 'name' to
build my spell index?
How to use it effectively for spell checking on multi-word queries?
an
when will this be released? where can i find the list of
improvements/enhancements in 1.3 if its been documented already?
--
View this message in context:
http://www.nabble.com/solr-1.3-tp14989395p14989395.html
Sent from the Solr - User mailing list archive at Nabble.com.
atest code from the trunk and I can tell you
> it's quite stable. The only problem is the documentation sometimes doesn't
> cover lastest changes in the code. You'll probably have to dig into the
> code
> itself or post a question here and many people will be happy to he
Could you please let me know the location from where i can get it.
climbingrose wrote:
>
> I'm using code pulled directly from Subversion.
>
> On Jan 21, 2008 12:34 PM, anuvenk <[EMAIL PROTECTED]> wrote:
>
>>
>> Thanks. Would this be the latest code from
what are term vectors? How do they help with mlt?
--
View this message in context:
http://www.nabble.com/Term-vector-tp14990408p14990408.html
Sent from the Solr - User mailing list archive at Nabble.com.
I followed the steps outlined in
http://wiki.apache.org/solr/SpellCheckerRequestHandler
with regards to setting up of the schema with a new field 'spell' and
copying other fields to this 'spell' field at index time.
It works fine with single word queries but doesn't return anything for
multi-word
http://wiki.apache.org/solr/UpdateXmlMessages
Is this what you are looking for. Index the document again and it should
overwrite the older one with the same id.
Gavin-39 wrote:
>
> Hi,
> Can some one point me to a location where it describes how to update an
> already indexed document? I w
g.
Not sure if i'm missing something. Please help!!
Otis Gospodnetic wrote:
>
> You don't need to wait for 1.3 to be released - you can simply use a
> recent nightly build.
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
ure if i'm missing something. Please help!!
Otis Gospodnetic wrote:
>
> You don't need to wait for 1.3 to be released - you can simply use a
> recent nightly build.
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> - Original Me
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> - Original Message
> From: anuvenk <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Monday, January 21, 2008 12:35:52 AM
> Subject: Re: spellcheckhandler
>
&g
I need to understand this synonym behaviour
I have this synonym
divorce mediation,alternative dispute resolution
so when i do a debug this is the parsedquery_tostring i see:
(((text:divorc^0.8 | name:divorc^2.0)~0.01 (text:mediat^0.8 |
name:mediat^2.0)~0.01)~2) (text:"(divorc altern) (disput med
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
> - Original Message
> From: anuvenk <[EMAIL PROTECTED]>
> To: solr-user@lucene.apache.org
> Sent: Monday, January 21, 2008 12:35:52 AM
> Subject: Re: spellcheckhandler
>
&g
I have a hard time understanding the synonyms behaviour..especially because i
don't have the syn filter at index time.
If i have this synonym at index time
Alternative Sentence,Probation before Judgement,Pretrial Diversion
does all occurrence of 'alternative sentence' also get indexed as 'proba
Thanks. But i'm looking at this
http://.../spellchecker?indent=on&onlyMorePopular=true&accuracy=.6&suggestionCount=20&q=facial+salophosphoprotein
on
http://lucene.apache.org/solr/api/org/apache/solr/handler/SpellCheckerRequestHandler.html
It seems to return results (well in the example)
with and
I followed your instructions exactly. But still have trouble with multiword
queries
for eg: q=grapics returns 'graphics'
but q=grapics card returns nothing.
I even tried with the latest nightly build but didn't solve the problem. Any
solution available.
scott.tabar wrote:
>
> Matthew,
>
> Thank
Here is what it means by injecting at query time:
This is the text field definition i have in my schema
and a ca
them. I
appreciate your help !!
scott.tabar wrote:
>
> Anuvenk,
>
> Sorry for this "Third" email, but I was reading your question below and I
> think it warrants yet another reply.
>
> Just some background from my focus and involvement, and hence the
> gener
Question about sorting with solr. I want to group results in certain sort
order so i can split them & display in tabs easily.
I want to be able to have a custom sort order instead of sort=cat asc score
desc
In the above mentioned way, categories are grouped in ascending order. But i
want certain c
51 matches
Mail list logo