Very low filter cache hit ratio

2019-05-29 Thread Saurabh Sharma
Hi All, I am trying to run an index on solr cloud version 7.3.1 with 3 nodes. Planning to index the records using full index once a day and delta index every 30 minutes. Purpose to keep stale index was to utilize the cache of solr. But to my surprise, when I put real traffic on this index . cache

Re: Very low filter cache hit ratio

2019-05-29 Thread Shawn Heisey
On 5/29/2019 6:57 AM, Saurabh Sharma wrote: What can be the possible reasons for low cache usage? How can I leverage cache feature for high traffic indexes? Your usage apparently does not use the exact same query (or filter query, in the case of filterCache) very often. In order to achieve a

Re: Very low filter cache hit ratio

2019-05-29 Thread Saurabh Sharma
Hi Shwan, Many filters are common among the queries. AFAIK, filter cache are created against filters and by that logic one should get good hit ratio for those cached filter conditions.i tried to create a cache of 100K size and that too was not producing good hit ratio. Any document/suggetion about

Re: Very low filter cache hit ratio

2019-05-29 Thread Atita Arora
You can refer to this one: https://teaspoon-consulting.com/articles/solr-cache-tuning.html HTH, Atita On Wed, May 29, 2019 at 3:33 PM Saurabh Sharma wrote: > Hi Shwan, > > Many filters are common among the queries. AFAIK, filter cache are created > against filters and by that logic one should g

RE: Very low filter cache hit ratio

2019-05-29 Thread Markus Jelsma
Hello, What is missing in that article is you must never use NOW without rounding it down in a filter query. If you have it, round it down to an hour, day or minute to prevent flooding the filter cache. Regards, Markus -Original message- > From:Atita Arora > Sent: Wednesday 29th May 2

ExactSharedStatsCache vs LRUStatsCache

2019-05-29 Thread Walter Underwood
Running 6.6, why should I prefer one over the other? And what kind of cache does Exact use if it isn’t LRU? wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog)

Re: Very low filter cache hit ratio

2019-05-29 Thread Shawn Heisey
On 5/29/2019 7:33 AM, Saurabh Sharma wrote: Many filters are common among the queries. AFAIK, filter cache are created against filters and by that logic one should get good hit ratio for those cached filter conditions.i tried to create a cache of 100K size and that too was not producing good hit

Re: Very low filter cache hit ratio

2019-05-29 Thread Erick Erickson
You must show us the _exact_ filter queries you’re using, or at least a representative sample. Bumping the cache up very high is almost always the wrong thing to do. Each entry takes approximately maxDoc/8 bytes so unless your corpus is very small, you’ll eventually blow memory up. To Markus’

Re: problem indexing GPS metadata for video upload

2019-05-29 Thread Where is Where
Sorry Tim! I missed your last message about this issue! Thank you very much for the information. Is the latest 1.21 Tika Incorporated with the change already? and how about solr? Thanks! On Fri, May 3, 2019 at 11:28 AM Where is Where wrote: > Thank you very much Tim, I wonder how to make the Ti