Different ordering of result returned by solr for capital and small letters.

2009-01-03 Thread Tushar_Gandhi

Hi,
   I am using solr 1.3. I have an issue with the ordering of results
returned by solr for capital and small letters. When I am searching for
"cats" I am getting result. Next, when I am searching for "CATS" I am
getting same result but the ordering of result is different as compare to
previous one. Is there any issue with my indexing? OR Is there any priority
for searching depending on capital and small letters in solr ?

I want same result for both cases. What should I do?
 Also, I already have "LowerCaseFactoryFilter" for both on query-time
analyzer  and index in my config.

Please, anyone can help me, to out from this problem.

Thanks,
Tushar
-- 
View this message in context: 
http://www.nabble.com/Different-ordering-of-result-returned-by-solr-for-capital-and-small-letters.-tp21262962p21262962.html
Sent from the Solr - User mailing list archive at Nabble.com.



collectionDistribution vs SolrReplication

2009-01-03 Thread Marc Sturlese

Hey there,

I would like to know the advantages of moving from:
a master-slave system using CollectionDistribution with all their .sh
scripts
http://wiki.apache.org/solr/CollectionDistribution

to:
use SolrReplication and his solrconfig.xml configuration.
http://wiki.apache.org/solr/SolrReplication


Its tecnically much better or mainly for more easy use?
Does SolrReplication do warming aswell?

Checking performance numbers is solrReplication wiki page things seem to be
similar except for the RAM, are the advantages about that?

Thanks in advance!!
-- 
View this message in context: 
http://www.nabble.com/collectionDistribution-vs-SolrReplication-tp21269112p21269112.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Different ordering of result returned by solr for capital and small letters.

2009-01-03 Thread Chris Hostetter
: returned by solr for capital and small letters. When I am searching for
: "cats" I am getting result. Next, when I am searching for "CATS" I am
: getting same result but the ordering of result is different as compare to
: previous one. Is there any issue with my indexing? OR Is there any priority
: for searching depending on capital and small letters in solr ?

no such priority is hardcoded .. but you can get weighting like that by 
searching both case sensity and case-insensitive fields in the same query 
(which is in your control using schema.xml, etc...)

: I want same result for both cases. What should I do?
:  Also, I already have "LowerCaseFactoryFilter" for both on query-time
: analyzer  and index in my config.

my best hunch is that you added LowerCaseFactoryFilter after indexing your 
documents, so some are already in the index with capitalized words ... 
either that are you aren't searching the fields you think you are (maybe 
some fields have LowerCaseFactoryFilter and some don't)

if you add debugQuery=true to your queries, and post some example score 
explanations from both queries ("cats", and "CATS") people might be 
able to help you further ... the most helpful thing is usually to see the 
explanation from both queries for the same doc ... pick the highest 
scoring doc from one query, and then use the explainOther param to get 
it's explanation on the second query (even if it's not on the first page 
of results)


-Hoss



Re: synonyms.txt file updated frequently

2009-01-03 Thread Grant Ingersoll


On Jan 2, 2009, at 10:25 AM, Alexander Ramos Jardim wrote:


Grant,

I am following your idea to write a new TokenFilter.

As long as I looked in SynonymTokenFilter and Factory code, it is the
Factory the reponsible for loading the new

Just let me make some stupid questions:

1. I will have to write a custom TokenFilter and TokenFilterFactory,  
right?


Yes.



2. SynonymTokenFilterFactory does the "synonyms.txt" parse and  
creates the
SynonymTokenFilter instance. If I want the SynonymTokenFilter to  
reload
synonyms.txt file from time to time, I will need to put the file  
load and

parsing strategies inside my new TokenFilter, right?


I think it can go in the SynonymFilterFactory.





2008/12/30 Grant Ingersoll 

I'd probably write a new TokenFilter that was aware of the reload  
policy
(in a generic way) such that I didn't have to go through a whole  
core reload

every time.  Are you just using them during query time or also during
indexing?

-Grant


On Dec 30, 2008, at 8:12 AM, Alexander Ramos Jardim wrote:

Hello guys,


As the title suggests I must update my synonyms.txt file  
frequently. What

is
the best approach? Should I send a commit after the file is  
updated? Does

Solr need to be restarted after the file changes?

--
Alexander Ramos Jardim



--
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ














--
Alexander Ramos Jardim


--
Grant Ingersoll

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ












Re: collectionDistribution vs SolrReplication

2009-01-03 Thread Shalin Shekhar Mangar
I think the main reason is ease of use. Warming is done the same way by
adding a newSearcher listener in solrconfig.xml

On Sun, Jan 4, 2009 at 2:10 AM, Marc Sturlese wrote:

>
> Hey there,
>
> I would like to know the advantages of moving from:
> a master-slave system using CollectionDistribution with all their .sh
> scripts
> http://wiki.apache.org/solr/CollectionDistribution
>
> to:
> use SolrReplication and his solrconfig.xml configuration.
> http://wiki.apache.org/solr/SolrReplication
>
>
> Its tecnically much better or mainly for more easy use?
> Does SolrReplication do warming aswell?
>
> Checking performance numbers is solrReplication wiki page things seem to be
> similar except for the RAM, are the advantages about that?
>
> Thanks in advance!!
> --
> View this message in context:
> http://www.nabble.com/collectionDistribution-vs-SolrReplication-tp21269112p21269112.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>


-- 
Regards,
Shalin Shekhar Mangar.