Re: Solr faceting on score

2011-02-03 Thread Bagesh Sharma

Thanks for reply
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-faceting-on-score-tp2422076p2422147.html
Sent from the Solr - User mailing list archive at Nabble.com.


Problem in faceting

2011-02-03 Thread Bagesh Sharma

Dear sir, i have problem with faceting.
 I am searching a text  "water treatment plant" on solr using dismax request
handler.

The final query which goes to solr is here -




+((TDR_SUBIND_PROD_NAMES:water^2.5 | TDR_SUBIND_LOC_ZIP:water^2.5 |
TDR_SUBIND_COMP_NAME:water^1.5 | TDR_SUBIND_TENDER_NO:water |
TDR_SUBIND_SUBTDR_SHORT:water^3.0 | TDR_SUBIND_SUBTDR_DETAILS:water^2.0 |
TDR_SUBIND_LOC_CITY:water^3.0 | TDR_SUBIND_LOC_STATE:water^3.0 |
TDR_SUBIND_NAME:water^1.5)~0.2 (TDR_SUBIND_PROD_NAMES:treatment^2.5 |
TDR_SUBIND_LOC_ZIP:treatment^2.5 | TDR_SUBIND_COMP_NAME:treatment^1.5 |
TDR_SUBIND_TENDER_NO:treatment | TDR_SUBIND_SUBTDR_SHORT:treatment^3.0 |
TDR_SUBIND_SUBTDR_DETAILS:treatment^2.0 | TDR_SUBIND_LOC_CITY:treatment^3.0
| TDR_SUBIND_LOC_STATE:treatment^3.0 | TDR_SUBIND_NAME:treatment^1.5)~0.2
(TDR_SUBIND_PROD_NAMES:plant^2.5 | TDR_SUBIND_LOC_ZIP:plant^2.5 |
TDR_SUBIND_COMP_NAME:plant^1.5 | TDR_SUBIND_TENDER_NO:plant |
TDR_SUBIND_SUBTDR_SHORT:plant^3.0 | TDR_SUBIND_SUBTDR_DETAILS:plant^2.0 |
TDR_SUBIND_LOC_CITY:plant^3.0 | TDR_SUBIND_LOC_STATE:plant^3.0 |
TDR_SUBIND_NAME:plant^1.5)~0.2) (TDR_SUBIND_SUBTDR_DETAILS:"water treatment
plant"^10.0 | TDR_SUBIND_COMP_NAME:"water treatment plant"^20.0 |
TDR_SUBIND_SUBTDR_SHORT:"water treatment plant"^15.0)~0.2





Now i want to do faceting over those results which have complete text "water
treatment plant " in it. means the records which have "water treatment
plant" completely. i donot want to do faceting on the results which has 1 or
2 words matching like "water" or "treatment". But in case of above query i
am not able to achieve this thing.

The Main Problem :

There is a field FACET_CITY in my schema.xml and i want to  find out only
those cities for which the complete text "water treatment plant" should
match. I don't want those cities for which only "water" or "treatment" words
are matching.

I have two possibilities to achieve this functionality -
1. Either anyhow i can find out the cities list for which the complete text
is matching means faceting only on complete text matching documents
 OR
2. Faceting over first 100 documents only for cities list. It may be for
first 100 documents having more score.

Please Suggest me how can i achieve this.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-in-faceting-tp2422182p2422182.html
Sent from the Solr - User mailing list archive at Nabble.com.


Facet Query

2011-02-03 Thread Bagesh Sharma

Hi, Is facet query and fq parameters works only for range queries. can i make
a general query for it like searching a facet.query=city:mumbai and getting
results back. please suggest. 
When i made this query i am only getting count back for it . How can i get
documents for it.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Facet-Query-tp2422212p2422212.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Problem in faceting

2011-02-04 Thread Bagesh Sharma

But i want results as it is as the above query is returning. There is no
problem with the results with it is returning.

Problem detail

I have implemented search for my company in which in search box user can
search any query. Now when a user search "water treatment plant". Then the
results come back according to above given query in which the documents
containing words "water" or "treatment" or "plant" or "water treatment
plant" is matching. All these searched results are correct and fulfill my
requirements . Along with these results i am doing faceting over cities to
display. Currently all cities are displayed if they are of a record matching
with any word "water" or "treatment" or "plant" or "water treatment plant".
But now my requirement is to keep the records as it is but do faceting over
only those cities for which complete text "water treatment plant" is
matching.

Is it possible by a single query to solr please suggest. Thanks a lot for
your response. 
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-in-faceting-tp2422182p2422353.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr faceting on score

2011-02-04 Thread Bagesh Sharma

Hi friends, Is it possible to do faceting over score. I want to results from
facets which have more score. Please suggest.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-faceting-on-score-tp2422076p2422076.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Problem in faceting

2011-02-04 Thread Bagesh Sharma

Sending two separate queries is an approach but i think it may affect
performance of the solr because for every new search there will be two
queries to solr due to this reason i was thinking to do it by a single
query. I am going to implement it with two queries now but if any thing is
found useful in future then suggest me please. Thanks for the suggestion
-- 
Thanks and Regards
   Bagesh Sharma

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-in-faceting-tp2422182p2424104.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Facet Query

2011-02-04 Thread Bagesh Sharma

yes it works fine ... thanks
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Facet-Query-tp2422212p2424155.html
Sent from the Solr - User mailing list archive at Nabble.com.


How to use q.op

2011-02-05 Thread Bagesh Sharma

Hi friends , Please tell me how to use q.op for for dismax and standared
request handler. I found that q.op=AND was not working for dismax.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-use-q-op-tp2431273p2431273.html
Sent from the Solr - User mailing list archive at Nabble.com.


AND operator and dismax request handler

2011-02-05 Thread Bagesh Sharma

Hi friends, Please suggest me that how can i set query operator to AND for
dismax request handler case.

My problem is that i am searching a string "water treatment plant" using
dismax request handler . The query formed is of such type 

http://localhost:8884/solr/select/?q=water+treatment+plant&q.alt=*:*&start=0&rows=5&sort=score%20desc&qt=dismax&omitHeader=true

My handling for dismax request handler in solrConfig.xml is - 



true
explicit
0.2


TDR_SUBIND_SUBTDR_SHORT^3
TDR_SUBIND_SUBTDR_DETAILS^2
TDR_SUBIND_COMP_NAME^1.5
TDR_SUBIND_LOC_STATE^3
TDR_SUBIND_PROD_NAMES^2.5
TDR_SUBIND_LOC_CITY^3
TDR_SUBIND_LOC_ZIP^2.5
TDR_SUBIND_NAME^1.5
TDR_SUBIND_TENDER_NO^1



TDR_SUBIND_SUBTDR_SHORT^15
TDR_SUBIND_SUBTDR_DETAILS^10
TDR_SUBIND_COMP_NAME^20


1
0
20%




In the final parsed query it is like 

+((TDR_SUBIND_PROD_NAMES:water^2.5 | TDR_SUBIND_LOC_ZIP:water^2.5 |
TDR_SUBIND_COMP_NAME:water^1.5 | TDR_SUBIND_TENDER_NO:water |
TDR_SUBIND_SUBTDR_SHORT:water^3.0 | TDR_SUBIND_SUBTDR_DETAILS:water^2.0 |
TDR_SUBIND_LOC_CITY:water^3.0 | TDR_SUBIND_LOC_STATE:water^3.0 |
TDR_SUBIND_NAME:water^1.5)~0.2 (TDR_SUBIND_PROD_NAMES:treatment^2.5 |
TDR_SUBIND_LOC_ZIP:treatment^2.5 | TDR_SUBIND_COMP_NAME:treatment^1.5 |
TDR_SUBIND_TENDER_NO:treatment | TDR_SUBIND_SUBTDR_SHORT:treatment^3.0 |
TDR_SUBIND_SUBTDR_DETAILS:treatment^2.0 | TDR_SUBIND_LOC_CITY:treatment^3.0
| TDR_SUBIND_LOC_STATE:treatment^3.0 | TDR_SUBIND_NAME:treatment^1.5)~0.2
(TDR_SUBIND_PROD_NAMES:plant^2.5 | TDR_SUBIND_LOC_ZIP:plant^2.5 |
TDR_SUBIND_COMP_NAME:plant^1.5 | TDR_SUBIND_TENDER_NO:plant |
TDR_SUBIND_SUBTDR_SHORT:plant^3.0 | TDR_SUBIND_SUBTDR_DETAILS:plant^2.0 |
TDR_SUBIND_LOC_CITY:plant^3.0 | TDR_SUBIND_LOC_STATE:plant^3.0 |
TDR_SUBIND_NAME:plant^1.5)~0.2) (TDR_SUBIND_SUBTDR_DETAILS:"water treatment
plant"^10.0 | TDR_SUBIND_COMP_NAME:"water treatment plant"^20.0 |
TDR_SUBIND_SUBTDR_SHORT:"water treatment plant"^15.0)~0.2



Now it gives me results if any of the word is found from text "water
treatment plant". I think here OR operator is working which finally combines
the results.

Now i want only those results for which only complete text should be
matching "water treatment plant".

1. I do not want to make any change in solrConfig.xml dismax handler. If
possible then suggest any other handler to deal with it.

2. Does there is really or operator is working in query. basically when i
query like this 

q=%2Bwater%2Btreatment%2Bplant&q.alt=*:*&q.op=AND&start=0&rows=5&sort=score
desc,TDR_SUBIND_SUBTDR_OPEN_DATE
asc&omitHeader=true&debugQuery=true&qt=dismax

OR 

q=water+AND+treatment+AND+plant&q.alt=*:*&q.op=AND&start=0&rows=5&sort=score
desc,TDR_SUBIND_SUBTDR_OPEN_DATE
asc&omitHeader=true&debugQuery=true&qt=dismax


Then it is giving different results. Can you suggest what is the difference
between above two queries.

Please suggest me for full text search "water treatment plant".

Thanks for your response.

-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/AND-operator-and-dismax-request-handler-tp2431391p2431391.html
Sent from the Solr - User mailing list archive at Nabble.com.


Loading data to solr from mysql

2011-02-07 Thread Bagesh Sharma

Can anybody suggest me the way to load data from mysql to solr directly.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Loading-data-to-solr-from-mysql-tp2442184p2442184.html
Sent from the Solr - User mailing list archive at Nabble.com.


difference between filter_queries and parsed_filter_queries

2011-02-08 Thread Bagesh Sharma

Hi everybody, please suggest me what's the difference between these two
things. After what processing on filter_queries the parsed_filter_queries
are generated.

Basically ... when i am searching city as fq=city:'noida' 

then filter_queries and parsed_filter_queries both are same as 'noida'.  In
this case i do not get any result.

But when i do query like this  fq=city:"noida" then filter_queries is
"noida" but parsed_filter_queries is noida and it matches with the city and
i am getting correct results. 

what processing is going on from filter_queries to parsed_filter_queries. 

my schema for city is : -

 







please suggest me please.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/difference-between-filter-queries-and-parsed-filter-queries-tp2451708p2451708.html
Sent from the Solr - User mailing list archive at Nabble.com.


Problem in full query searching

2011-02-24 Thread Bagesh Sharma

Hi sir,  My problem is that when i am searching a string "software
engineering institute" in query then i am not getting those documents first
which have complete text matching in them. There are documents which have
complete text matching but they are not appearing above in the result. I
want the results like that first complete string matching after that 2 word
matching and at last any word matching. I am using dismax request handler. I
also studied about "Term Proximity" but its also not working for me.

I have sorted on score desc to result. After analyzing i observed that the
documents which don't have complete text in it but they have more occurrence
of 3 or 2 or 1 words in its body text due to this they are getting higher
score. Is there any way to get high score for those documents which have
complete text matching instead of more occurrences of any word.

Please suggest me.   
-- 
Thanks and Regards
   Bagesh Sharma
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-in-full-query-searching-tp2566054p2566054.html
Sent from the Solr - User mailing list archive at Nabble.com.


query slop issue

2011-02-24 Thread Bagesh Sharma

Hi all, i have a search string q=water+treatment+plant  and i am using dismax
request handler where i have qs = 1 . in which way processing will be done
means with in how many words water or treatment or plant should occur to
come in result set.

 
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/query-slop-issue-tp2567418p2567418.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: query slop issue

2011-02-24 Thread Bagesh Sharma

Thanks  very good explanation.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/query-slop-issue-tp2567418p2573185.html
Sent from the Solr - User mailing list archive at Nabble.com.


solr score issue

2011-02-25 Thread Bagesh Sharma

Hi sir , 

Can anyone explain me how this score is being calculated. i am searching
here "software engineer" using dismax handler. Total documents indexed are
477 and query results are 28.

Query is like that -
   q=software+engineer&fq=location%3Adelhi

dismax setting is - 

   
 alltext
 title^2
 functional_role^1



  body^100



Here alltext field is made by copying all fields.
body field contains detail of job.

I am unable to understand how these scores have been calculated. From where
to start score calculating and what are default score for any term matching.



0.5901718 = (MATCH) sum of:
  0.0032821721 = (MATCH) sum of:
0.0026574256 = (MATCH) max plus 0.1 times others of:
  0.0026574256 = (MATCH) weight(alltext:softwar in 339), product of:
0.0067262817 = queryWeight(alltext:softwar), product of:
  3.6121683 = idf(docFreq=34, maxDocs=477)
  0.0018621174 = queryNorm
0.39508092 = (MATCH) fieldWeight(alltext:softwar in 339), product
of:
  1.0 = tf(termFreq(alltext:softwar)=1)
  3.6121683 = idf(docFreq=34, maxDocs=477)
  0.109375 = fieldNorm(field=alltext, doc=339)
6.2474643E-4 = (MATCH) max plus 0.1 times others of:
  6.2474643E-4 = (MATCH) weight(alltext:engin in 339), product of:
0.0032613424 = queryWeight(alltext:engin), product of:
  1.7514161 = idf(docFreq=224, maxDocs=477)
  0.0018621174 = queryNorm
0.19156113 = (MATCH) fieldWeight(alltext:engin in 339), product of:
  1.0 = tf(termFreq(alltext:engin)=1)
  1.7514161 = idf(docFreq=224, maxDocs=477)
  0.109375 = fieldNorm(field=alltext, doc=339)
  0.5868896 = weight(body:"softwar engin"^100.0 in 339), product of:
0.9995919 = queryWeight(body:"softwar engin"^100.0), product of:
  100.0 = boost
  5.3680387 = idf(body: softwar=34 engin=223)
  0.0018621174 = queryNorm
0.58712924 = fieldWeight(body:"softwar engin" in 339), product of:
  1.0 = tf(phraseFreq=1.0)
  5.3680387 = idf(body: softwar=34 engin=223)
  0.109375 = fieldNorm(field=body, doc=339)



please suggest me.
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/solr-score-issue-tp2574680p2574680.html
Sent from the Solr - User mailing list archive at Nabble.com.