Different ordering of result returned by solr for capital and small letters.
Hi, I am using solr 1.3. I have an issue with the ordering of results returned by solr for capital and small letters. When I am searching for "cats" I am getting result. Next, when I am searching for "CATS" I am getting same result but the ordering of result is different as compare to previous one. Is there any issue with my indexing? OR Is there any priority for searching depending on capital and small letters in solr ? I want same result for both cases. What should I do? Also, I already have "LowerCaseFactoryFilter" for both on query-time analyzer and index in my config. Please, anyone can help me, to out from this problem. Thanks, Tushar -- View this message in context: http://www.nabble.com/Different-ordering-of-result-returned-by-solr-for-capital-and-small-letters.-tp21262962p21262962.html Sent from the Solr - User mailing list archive at Nabble.com.
collectionDistribution vs SolrReplication
Hey there, I would like to know the advantages of moving from: a master-slave system using CollectionDistribution with all their .sh scripts http://wiki.apache.org/solr/CollectionDistribution to: use SolrReplication and his solrconfig.xml configuration. http://wiki.apache.org/solr/SolrReplication Its tecnically much better or mainly for more easy use? Does SolrReplication do warming aswell? Checking performance numbers is solrReplication wiki page things seem to be similar except for the RAM, are the advantages about that? Thanks in advance!! -- View this message in context: http://www.nabble.com/collectionDistribution-vs-SolrReplication-tp21269112p21269112.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Different ordering of result returned by solr for capital and small letters.
: returned by solr for capital and small letters. When I am searching for : "cats" I am getting result. Next, when I am searching for "CATS" I am : getting same result but the ordering of result is different as compare to : previous one. Is there any issue with my indexing? OR Is there any priority : for searching depending on capital and small letters in solr ? no such priority is hardcoded .. but you can get weighting like that by searching both case sensity and case-insensitive fields in the same query (which is in your control using schema.xml, etc...) : I want same result for both cases. What should I do? : Also, I already have "LowerCaseFactoryFilter" for both on query-time : analyzer and index in my config. my best hunch is that you added LowerCaseFactoryFilter after indexing your documents, so some are already in the index with capitalized words ... either that are you aren't searching the fields you think you are (maybe some fields have LowerCaseFactoryFilter and some don't) if you add debugQuery=true to your queries, and post some example score explanations from both queries ("cats", and "CATS") people might be able to help you further ... the most helpful thing is usually to see the explanation from both queries for the same doc ... pick the highest scoring doc from one query, and then use the explainOther param to get it's explanation on the second query (even if it's not on the first page of results) -Hoss
Re: synonyms.txt file updated frequently
On Jan 2, 2009, at 10:25 AM, Alexander Ramos Jardim wrote: Grant, I am following your idea to write a new TokenFilter. As long as I looked in SynonymTokenFilter and Factory code, it is the Factory the reponsible for loading the new Just let me make some stupid questions: 1. I will have to write a custom TokenFilter and TokenFilterFactory, right? Yes. 2. SynonymTokenFilterFactory does the "synonyms.txt" parse and creates the SynonymTokenFilter instance. If I want the SynonymTokenFilter to reload synonyms.txt file from time to time, I will need to put the file load and parsing strategies inside my new TokenFilter, right? I think it can go in the SynonymFilterFactory. 2008/12/30 Grant Ingersoll I'd probably write a new TokenFilter that was aware of the reload policy (in a generic way) such that I didn't have to go through a whole core reload every time. Are you just using them during query time or also during indexing? -Grant On Dec 30, 2008, at 8:12 AM, Alexander Ramos Jardim wrote: Hello guys, As the title suggests I must update my synonyms.txt file frequently. What is the best approach? Should I send a commit after the file is updated? Does Solr need to be restarted after the file changes? -- Alexander Ramos Jardim -- Grant Ingersoll Lucene Helpful Hints: http://wiki.apache.org/lucene-java/BasicsOfPerformance http://wiki.apache.org/lucene-java/LuceneFAQ -- Alexander Ramos Jardim -- Grant Ingersoll Lucene Helpful Hints: http://wiki.apache.org/lucene-java/BasicsOfPerformance http://wiki.apache.org/lucene-java/LuceneFAQ
Re: collectionDistribution vs SolrReplication
I think the main reason is ease of use. Warming is done the same way by adding a newSearcher listener in solrconfig.xml On Sun, Jan 4, 2009 at 2:10 AM, Marc Sturlese wrote: > > Hey there, > > I would like to know the advantages of moving from: > a master-slave system using CollectionDistribution with all their .sh > scripts > http://wiki.apache.org/solr/CollectionDistribution > > to: > use SolrReplication and his solrconfig.xml configuration. > http://wiki.apache.org/solr/SolrReplication > > > Its tecnically much better or mainly for more easy use? > Does SolrReplication do warming aswell? > > Checking performance numbers is solrReplication wiki page things seem to be > similar except for the RAM, are the advantages about that? > > Thanks in advance!! > -- > View this message in context: > http://www.nabble.com/collectionDistribution-vs-SolrReplication-tp21269112p21269112.html > Sent from the Solr - User mailing list archive at Nabble.com. > > -- Regards, Shalin Shekhar Mangar.