Question about dismax 'mm' - give boost to searches by location

2008-11-20 Thread anuvenk
I use the 'dismax handler' for my phrase matching. And i have the 'mm' set this way: Up to 3 words, match all up to 4, match 3 up to 4, match 3 & so on Its been working fine, but for certain phrases like 'san diego drunk driving defense attorney', its brings up dui attorneys for other cities first

Re: Question about dismax 'mm' - give boost to searches by location

2008-11-21 Thread anuvenk
ament from all states are returned although user specifically asked for florida will. I don't want to alter the 'mm' either because its working fine for other searches. Just for the search terms with a 'location' , i want to be able to match all words. Any easy way to do t

Question about Query Phrase Slop (qs) in dismax

2008-11-22 Thread anuvenk
>From the solr wiki, it sounded like if qs is set to 5 for example, & if the search term is 'child custody', only docs with 'child' & 'custody' within 5 words of one another would be returned in results. Is this correct? If so, it doesn't seem to be working for me. I see docs with 'child' & 'custo

Re: Question about Query Phrase Slop (qs) in dismax

2008-11-23 Thread anuvenk
Somebody please help clear this doubt. What more could i do with the dismax handler to remove results that don't have 'word1'', 'word2', 'word3' etc in a search phrase not within 5 words of one another, to not come up in the results? anuvenk wrote: >

Re: Please Help !! Question about Query Phrase Slop (qs) in dismax

2008-11-23 Thread anuvenk
occurrence of exchange (not related to 1031 exchange though). They were not within 5 words of one another. My guys have been hammering me reg this relevancy issue. Please help someone. anuvenk wrote: > > From the solr wiki, it sounded like if qs is set to 5 for example, & if > the s

Re: Please Help !! Question about Query Phrase Slop (qs) in dismax

2008-11-23 Thread anuvenk
north & dakota somewhere. I thought if there is a way to just not return any results if they are not within close proximity, it would be great. Yonik Seeley wrote: > > On Sun, Nov 23, 2008 at 11:51 PM, anuvenk <[EMAIL PROTECTED]> > wrote: >> Please help someone...i&#x

Dismax handler phrase matching question

2009-06-01 Thread anuvenk
Hello, I'm using the dismax handler for the phrase matching. I have a few legal resources in my index in the following format for example title state dui faq1 california dui faq2 florida dui faq3 federal

Re: Dismax handler phrase matching question

2009-06-02 Thread anuvenk
. Any other thoughts? Shalin Shekhar Mangar wrote: > > On Tue, Jun 2, 2009 at 12:53 AM, anuvenk wrote: > >> >> title state >> >> dui faq1 california >> dui faq2 florida >> dui faq3

Is there Downside to a huge synonyms file?

2009-06-02 Thread anuvenk
In my index i have legal faqs, forms, legal videos etc with a state field for each resource. Now if i search for real estate san diego, I want to be able to return other 'california' results i.e results from san francisco. I have the following fields in the index title

Re: Is there Downside to a huge synonyms file?

2009-06-02 Thread anuvenk
data over and over again. anuvenk wrote: > > In my index i have legal faqs, forms, legal videos etc with a state field > for each resource. > Now if i search for real estate san diego, I want to be able to return > other 'california' results i.e results from san francisco. >

Re: synonyms

2009-06-03 Thread anuvenk
uses synonyms, a big > list that sometimes could bring out some wrong match as the one > noticed by Anuvenk > for instance > >> dui => drunk driving defense >> or >> dui,drunk driving defense,drunk driving law >> query for "dui" matches "dui =>

Re: Is there Downside to a huge synonyms file?

2009-06-03 Thread anuvenk
false 1 Thanks. Otis Gospodnetic wrote: > > > Hello, > > 300K is a pretty small index. I wouldn't worry about the number of > synonyms unless you are turning a single term into dozens of ORed terms. > > Otis > -- >

Re: Is there Downside to a huge synonyms file?

2009-06-03 Thread anuvenk
A small addition to my earlier post. I wonder if its because of the 'mm' param, which requires that until 3 words in search phrase, all the words should be matched. If i alter this now, i'd get ir-relevant results for a lot of popular 1, 2, 3 word search terms. How to solve for

Is it possible to apply index-time synonyms just for a section of the index

2009-06-25 Thread anuvenk
I've posted a few questions on synonyms before and finally understood how it worked and settled with index-time synonyms. Seems to work much better than query time synonyms. But now @ my work, they have a special request. They want certain synonyms to be applied only to certain sections of the ind

Re: Is it possible to apply index-time synonyms just for a section of the index

2009-06-25 Thread anuvenk
That's right. Simple. I can very well do that. Why didn't I think of it. Thanks. rswart wrote: > > What is stopping you from defining different field types for faqs and > attorneys? One with index time synomyms and one without. > > > > anuvenk wrote: >&g

Re: Query Syntax (Standard handler) Question

2008-01-04 Thread anuvenk
Could you elaborate on what the tie param does? I did read the definition in the solr wiki but still not crystal clear. Mike Klaas wrote: > > > On 4-Jan-08, at 1:12 PM, s d wrote: > >> but i want to sum the scores and not use max, can i still do it >> with the >> DisMax? am i missing anythin

parsedquery_ToString

2008-01-04 Thread anuvenk
Is the parsedquery_ToString, the one passed to solr after all the tokenizing and analyzing of the query? For the search term 'chapter 7' i have this parsedquery_ToString +(text:"(bankruptci chap 7) (7 chapter chap) 7 bankruptci"^0.8 | ((name:bankruptci name:chap)^2.0))~0.01 (text:"(bankruptci ch

Dealing with numbers in search terms

2008-01-04 Thread anuvenk
I seem to have problems with the results i get for this search term. Not sure if its because of the synonym mappings i have for this search term. Search term: chapter 7 The first result doesn't even have any occurrence of chapter, bankruptcy. But just a few occurrences of 7. But i have the 'mm'

spellcheckhandler

2008-01-04 Thread anuvenk
Is it possible to implement something like this with the spellcheckhandler Like how google does,.. say i search for 'chater 13 bakrupcy', should be able to display these.. did you search for 'chapter 13 bankruptcy' Has someone been able to do this? -- View this message in context: http://ww

morelikethishandler

2008-01-04 Thread anuvenk
How does the morelikethis handler work? Solr wiki doesn't seem to have an elaborate explaination. In which cases would it be better to use this instead of the dismax? -- View this message in context: http://www.nabble.com/morelikethishandler-tp14628416p14628416.html Sent from the Solr - User ma

solr results debugging

2008-01-04 Thread anuvenk
I've been using the solr admin form with debug=true to do some in-depth analysis on some results. Could someone explain how to make sense of this..This is the debugging info for the first result i got. 10.201284 = (MATCH) sum of: 6.2467875 = (MATCH) max plus 0.01 times others of: 6.236769

solr word delimiter

2008-01-04 Thread anuvenk
I have the word delimiter filter factory in the text field definition both at index and query time. But it does have some negative effects on some search terms like h1-b visa It splits this in to three tokens h,1,b. Now if i understand right, does solr look for matches for 'h' separately, '1' sep

phrase slop param in dismax handler

2008-01-05 Thread anuvenk
How does adding a phrase slop in the handler help? I tried ps=25 along with some pf values. I assumed that it means this..for eg: a search term, 'child custody battle' means documents which have the words 'child','custody','battle' within 25 words of one another will rank high. Is that correct? --

Re: solr word delimiter

2008-01-05 Thread anuvenk
ply want to drop solr.WordDelimiterFilterFactory > from your analyzer definition, no? > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: anuvenk <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org

How does solr rank multiple docs with same score

2008-01-05 Thread anuvenk
I noticed that the top 10 results for a particular search term had the same score. In such cases how does solr determine which should get the first place, second and so on? -- View this message in context: http://www.nabble.com/How-does-solr-rank-multiple-docs-with-same-score-tp14638959p14638959

what are tf,idf,fieldNorm,queryNorm.?

2008-01-05 Thread anuvenk
I understand tf means term frequency. For eg: if the search term is 'chapter 7', does tf mean how frequently 'chapter 7' occurs in the docs? Does it take in to account the total number of words in a doc to determine frequency. Also what is idf, fieldNorm and queryNorm. Trying to understand how sol

Re: solr word delimiter

2008-01-05 Thread anuvenk
of one another because k1 is a single token? I seem to get confused with how solr matches documents in cases like this. Yonik Seeley wrote: > > On Jan 5, 2008 2:28 PM, anuvenk <[EMAIL PROTECTED]> wrote: >> Thats what i'm thinking too. If i remove solr.worddelimiter

Re: phrase slop param in dismax handler

2008-01-05 Thread anuvenk
; Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: anuvenk <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Saturday, January 5, 2008 4:05:00 AM > Subject: phrase slop param in dismax handler &g

Re: parsedquery_ToString

2008-01-06 Thread anuvenk
I log the search phrases the users on my site are using to search, and review them regularly. Based on that i add synonyms for certain phrases to help increase the relevant results. The reason i don't have the synonym filter at index time is because i can't re-index whole/portion of data everytime

phrase slop

2008-01-09 Thread anuvenk
I have a ps of 10 set in the config. so for search terms like 'work injury' , results that have work and injury within 10 words of one another are coming up in the first few results which is good. But it also returns results that have work and injury are more than 10 words apart although they are

Re: Related Search

2008-01-17 Thread anuvenk
I'm also looking in to displaying 'related search terms' for every search term. How to do it with solr? The thread explains about some FuzzyLikeThisQuery. Can this be used with solr? Has anyone done this before? Could you please share your ideas. ajaxtrend wrote: > > Take a look at this thread >

Related searches

2008-01-17 Thread anuvenk
How to accomplish this with solr? Say for a search term like 'alternative dispute resolution', i want to be able to display 'see related searches' and display related search terms like 'mediation', 'arbitration' etc. Has anyone done something like this before? If so please share your ideas. -- V

spell check component

2008-01-19 Thread anuvenk
Is it possible to add a spell check component so i don't have to issue a separate request to solr to do the spell checking? Sorry if this question is naive..am just learning to use solr. and add it to the search handler like this spellcheck what would the name of the spell check compo

Re: spellcheckhandler

2008-01-19 Thread anuvenk
l' field at index time But like you had mentioned, the tutorial says we have to use it on a field thats not tokenized. Now how to use my tokenized fields 'body' and 'name' to build my spell index? How to use it effectively for spell checking on multi-word queries? an

solr 1.3

2008-01-20 Thread anuvenk
when will this be released? where can i find the list of improvements/enhancements in 1.3 if its been documented already? -- View this message in context: http://www.nabble.com/solr-1.3-tp14989395p14989395.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr 1.3

2008-01-20 Thread anuvenk
atest code from the trunk and I can tell you > it's quite stable. The only problem is the documentation sometimes doesn't > cover lastest changes in the code. You'll probably have to dig into the > code > itself or post a question here and many people will be happy to he

Re: solr 1.3

2008-01-20 Thread anuvenk
Could you please let me know the location from where i can get it. climbingrose wrote: > > I'm using code pulled directly from Subversion. > > On Jan 21, 2008 12:34 PM, anuvenk <[EMAIL PROTECTED]> wrote: > >> >> Thanks. Would this be the latest code from

Term vector

2008-01-20 Thread anuvenk
what are term vectors? How do they help with mlt? -- View this message in context: http://www.nabble.com/Term-vector-tp14990408p14990408.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: spellcheckhandler

2008-01-20 Thread anuvenk
I followed the steps outlined in http://wiki.apache.org/solr/SpellCheckerRequestHandler with regards to setting up of the schema with a new field 'spell' and copying other fields to this 'spell' field at index time. It works fine with single word queries but doesn't return anything for multi-word

Re: Update the index

2008-01-20 Thread anuvenk
http://wiki.apache.org/solr/UpdateXmlMessages Is this what you are looking for. Index the document again and it should overwrite the older one with the same id. Gavin-39 wrote: > > Hi, > Can some one point me to a location where it describes how to update an > already indexed document? I w

Re: spellcheckhandler

2008-01-21 Thread anuvenk
g. Not sure if i'm missing something. Please help!! Otis Gospodnetic wrote: > > You don't need to wait for 1.3 to be released - you can simply use a > recent nightly build. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch >

Re: spellcheckhandler

2008-01-22 Thread anuvenk
ure if i'm missing something. Please help!! Otis Gospodnetic wrote: > > You don't need to wait for 1.3 to be released - you can simply use a > recent nightly build. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Me

Re: spellcheckhandler

2008-01-22 Thread anuvenk
> Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: anuvenk <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, January 21, 2008 12:35:52 AM > Subject: Re: spellcheckhandler > &g

solr synonyms behaviour

2008-01-23 Thread anuvenk
I need to understand this synonym behaviour I have this synonym divorce mediation,alternative dispute resolution so when i do a debug this is the parsedquery_tostring i see: (((text:divorc^0.8 | name:divorc^2.0)~0.01 (text:mediat^0.8 | name:mediat^2.0)~0.01)~2) (text:"(divorc altern) (disput med

Re: spellcheckhandler

2008-01-23 Thread anuvenk
> Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: anuvenk <[EMAIL PROTECTED]> > To: solr-user@lucene.apache.org > Sent: Monday, January 21, 2008 12:35:52 AM > Subject: Re: spellcheckhandler > &g

Index time synonyms

2008-01-24 Thread anuvenk
I have a hard time understanding the synonyms behaviour..especially because i don't have the syn filter at index time. If i have this synonym at index time Alternative Sentence,Probation before Judgement,Pretrial Diversion does all occurrence of 'alternative sentence' also get indexed as 'proba

Re: spellcheckhandler

2008-01-25 Thread anuvenk
Thanks. But i'm looking at this http://.../spellchecker?indent=on&onlyMorePopular=true&accuracy=.6&suggestionCount=20&q=facial+salophosphoprotein on http://lucene.apache.org/solr/api/org/apache/solr/handler/SpellCheckerRequestHandler.html It seems to return results (well in the example) with and

Re: Spell Check Handler

2008-01-25 Thread anuvenk
I followed your instructions exactly. But still have trouble with multiword queries for eg: q=grapics returns 'graphics' but q=grapics card returns nothing. I even tried with the latest nightly build but didn't solve the problem. Any solution available. scott.tabar wrote: > > Matthew, > > Thank

Re: Is it possible to add synonyms run time?

2008-01-25 Thread anuvenk
Here is what it means by injecting at query time: This is the text field definition i have in my schema and a ca

Re: spellcheckhandler

2008-01-26 Thread anuvenk
them. I appreciate your help !! scott.tabar wrote: > > Anuvenk, > > Sorry for this "Third" email, but I was reading your question below and I > think it warrants yet another reply. > > Just some background from my focus and involvement, and hence the > gener

solr sorting question

2008-05-27 Thread anuvenk
Question about sorting with solr. I want to group results in certain sort order so i can split them & display in tabs easily. I want to be able to have a custom sort order instead of sort=cat asc score desc In the above mentioned way, categories are grouped in ascending order. But i want certain c