Re: Performance problems with extremely common terms in collection (Solr 7.4)

2019-04-08 Thread Toke Eskildsen
On Mon, 2019-04-08 at 09:58 +1000, Ash Ramesh wrote: > We have a corpus of 50+ million documents in our collection. I've > noticed that some queries with specific keywords tend to be extremely > slow. > E.g. the q=`photography' or q='background'. After digging into the > raw documents, I could see

Re: Performance problems with extremely common terms in collection (Solr 7.4)

2019-04-08 Thread Ash Ramesh
Hi Toke, Thanks for the prompt reply. I'm glad to hear that this is a common problem. In regards to stop words, I've been thinking about trying that out. In our business case, most of these terms are keywords related to stock photography, therefore it's natural for 'photography' or 'background' to

Re: Solr spellcheck Collation JSON

2019-04-08 Thread Mikhail Khludnev
> > Previous Solr versions > -- > "spellcheck": { > ..., > "collations": [ > "collation":"account" <--correct format > ] However, it's not a JSON. On Mon, Apr 8, 2019 at 2:45 AM Moyer, Brett wrote: > Hello, > > Looks like a more recent Solr release introduced a bug for > collation. Doe

Solr 8.0.0 - CPU usage 100% when indexed documents

2019-04-08 Thread vishal patel
Hi I have configured 2 shards and 3 zoo keeper. When i indexed document in collection, my CPU usage becomes a full. I have attached thread dump. Is there Any changes needed in solrconfig.xml? Sent from Outlook

Solr ignores configuration file

2019-04-08 Thread jaanam
Dear recipients, Can you help me with the following issue: I should present my time stamps in solr in UTC+2 instead of UTC. How can I do it ? I've created the following question in StackOverflow https://stackoverflow.com/questions/55530142/solr-7-6-0-ignores-configuration-file-bin-solr-in-

Re: Solr 8.0.0 - CPU usage 100% when indexed documents

2019-04-08 Thread Jörn Franke
Can you please describe your scenario in detail ? How does your load process look like (custom module? How many threads?)? How many files do you try to index ? What is their format? How does your solr config look like? How many cores do you have? What else is installed on the Solr server? Whic

Solr Cloud - Data Import from Cassandra

2019-04-08 Thread Furkan Çifçi
Hello everyone, We are using Solr(7.1) on cloud mode and trying to get data from Cassandra source. Can't import data from Cassandra. In the error logs; Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to PropertyWriter implementation:SimplePropertiesWri

Re: Solr ignores configuration file

2019-04-08 Thread Nitin Kumar
One workaround is while indexing add +2 hours. On Mon 8 Apr, 2019, 4:16 PM , wrote: > > Dear recipients, > > Can you help me with the following issue: > > I should present my time stamps in solr in UTC+2 instead of UTC. How can > I do it ? > > I've created the following question in StackOverflow

Re: Solr ignores configuration file

2019-04-08 Thread Jörn Franke
If there is no daylight savings... I would not do this. Accept that Solr is in UTC and do the conversion at UI level. Otherwise in case of daylight savings introduction / removal you run into a lot of problems (reindexing etc) > Am 08.04.2019 um 13:08 schrieb Nitin Kumar : > > One workaround is

I it possible to configure solr to show time stamps without the 'Z'- character in the end

2019-04-08 Thread Miettinen Jaana (STAT)
Dear recipient, I have a problem in solr: I should add several (old) time stamps into my solr documents, but all of them are in local time (UTC+2 or UTC+3 depending on day-light-saving situation). As default solr excepts all time stamps to be in UTC-time and adds the 'Z'-character into the end

Re: Solr 8.0.0 - CPU usage 100% when indexed documents

2019-04-08 Thread vishal patel
I have created two solr shards with 3 zoo keeper. First do upconfig in zoo keeper then start the both solr with different port then create a "actionscomments" collection using API call. When I indexed one document in actionscomments, my CPU utilization go high. Note : upconfig command :: zkcli

Re: Performance problems with extremely common terms in collection (Solr 7.4)

2019-04-08 Thread Michael Gibney
In addition to Toke's suggestions (and those in the linked article), some more ideas: If single-term, bare queries are slow, it might be productive to check config/performance of your queryResultCache (I realize this doesn't directly address the concern of slow queries, but might nonetheless be hel

Moving index from stand-alone Solr 6.6.0 to 3 node Solr Cloud 6.6.0 with Zookeeper

2019-04-08 Thread kevinc
Hi all, I'm sure I've done this before but this seems to be falling down a bit and I was wondering if anyone had any helpful ideas. I have a large index (51GB) that exists in a 4 node Solr Cloud instance. The reprocessing for this takes a long time and so we normally reindex on a secondary cluste

Re: I it possible to configure solr to show time stamps without the 'Z'- character in the end

2019-04-08 Thread Erick Erickson
When you ask for a field from Solr, it returns _exactly_ what you gave it. So if you input contains the “Z”, the output will. You have to massage it however you want if you want something different. I can imagine at least 3 ways to do this: 1> create a second field with stored=“true”, indexed=“

Re: Moving index from stand-alone Solr 6.6.0 to 3 node Solr Cloud 6.6.0 with Zookeeper

2019-04-08 Thread Erick Erickson
Here’s what I’d do: 1> Just spin up a _one_ node cluster and copy the index from your offline process and start Solr. I’l probably do this with Solr down. 2> Use the ADDREPLICA command to build out that cluster. The index copy associated with ADDREPLICA is robust. I’d wait until each replica sho

Re: I it possible to configure solr to show time stamps without the 'Z'- character in the end

2019-04-08 Thread Shawn Heisey
On 4/8/2019 4:38 AM, Miettinen Jaana (STAT) wrote: I have a problem in solr: I should add several (old) time stamps into my solr documents, but all of them are in local time (UTC+2 or UTC+3 depending on day-light-saving situation). As default solr excepts all time stamps to be in UTC-time and

solr wild card search

2019-04-08 Thread Anil
Hi Team, Good Morning. I am storing url in string field. wild card search is giving following error. "error":{ "metadata":[ "error-class","org.apache.solr.common.SolrException", "root-error-class","org.apache.solr.parser.ParseException"], "msg":"org.apache.solr.search.Syn

Re: solr wild card search

2019-04-08 Thread Erick Erickson
Show us the exact search you’re using, both the failure and success case please. Most likely you need to escape things like the colon… Best, Erick > On Apr 8, 2019, at 8:19 AM, Anil wrote: > > Hi Team, > > Good Morning. > > I am storing url in string field. wild card search is giving followi

Re: Solr 8.0.0 - CPU usage 100% when indexed documents

2019-04-08 Thread Shawn Heisey
On 4/8/2019 7:22 AM, vishal patel wrote: I have created two solr shards with 3 zoo keeper. First do upconfig in zoo keeper then start the both solr with different port then create a "actionscomments" collection using API call. When I indexed one document in actionscomments, my CPU utilization

Re: Moving index from stand-alone Solr 6.6.0 to 3 node Solr Cloud 6.6.0 with Zookeeper

2019-04-08 Thread Shawn Heisey
On 4/8/2019 8:59 AM, kevinc wrote: I have reindexed to a single Solr 6.6.0 index and spun up a new 3 node Solr cluster with 1 shard and replication factor of 3. I want to copy over the index and have it replicate to the rest of the cluster. I have taken a copy of the data directory from the repr

Re: Moving index from stand-alone Solr 6.6.0 to 3 node Solr Cloud 6.6.0 with Zookeeper

2019-04-08 Thread Shawn Heisey
On 4/8/2019 10:06 AM, Shawn Heisey wrote: * Make sure you have a copy of the source index directory. * Do not copy the tlog directory from the source. * Create the collection in the target cloud. * Shut down the target cloud completely. * Delete all the index directories in the cloud. * Copy the

Re: solr wild card search

2019-04-08 Thread Anil
Hi Eric, url:"https://facebook.com/posts/123456"; is working. url:https://facebook.com/posts * is not working. i tried to escape forward slash and dot (.).. didnt help. i missed colon. let me try. Thanks. Regards, Anil On Mon, 8 Apr 2019 at 21:02, Erick Erick

Re: solr wild card search

2019-04-08 Thread Erick Erickson
See: https://lucene.apache.org/solr/guide/6_6/the-standard-query-parser.html > On Apr 8, 2019, at 9:04 AM, Anil wrote: > > Hi Eric, > > url:"https://facebook.com/posts/123456"; is working. > url:https://facebook.com/posts * is not > working. > > i tried to es

Re: SOLR Text Field

2019-04-08 Thread Dave Beckstrom
Shawn, I can't thank you enough for taking the time to reply to my question and for the info you shared. I don't believe I ever found one example by Googling of how to define a simple text field in SOLR. I saw some examples of Text_General but as you saw it wasn't what I needed. Based on the in

Re: SOLR Text Field

2019-04-08 Thread Shawn Heisey
On 4/8/2019 10:27 AM, Dave Beckstrom wrote: SOLR really should ship with a sample text field defined even if commented out and only for example purposes only. That would have been most helpful. Even a FAQ somewhere would have been helpful. There are two example configs in the latest version o

Moving index from stand-alone Solr 6.6.0 to 3 node Solr Cloud 6.6.0 with Zookeeper

2019-04-08 Thread Kevin Cunningham
Hi all, I'm sure I've done this before but this seems to be falling down a bit and I was wondering if anyone had any helpful ideas. I have a large index (51GB) that exists in a 4 node Solr Cloud instance. The reprocessing for this takes a long time and so we normally reindex on a secondary clus

Solr Cache clear

2019-04-08 Thread Lewin Joy (TMNA)
** PROTECTED 関係者外秘 How do I clear the solr caches without restarting Solr cluster? Is there a way? I tried reloading the collection. But, it did not help. Thanks, Lewin

Re: Solr Cache clear

2019-04-08 Thread Shawn Heisey
On 4/8/2019 2:14 PM, Lewin Joy (TMNA) wrote: How do I clear the solr caches without restarting Solr cluster? Is there a way? I tried reloading the collection. But, it did not help. When I reload a core on a test setup (solr 7.4.0), I see cache sizes reset. What evidence are you seeing that rel

Re: Solr Cache clear

2019-04-08 Thread Alexandre Rafalovitch
You may have warming queries to prepopulate your cache. Check your solrconfig.xml. Regards, Alex On Mon, Apr 8, 2019, 4:16 PM Lewin Joy (TMNA), wrote: > ** PROTECTED 関係者外秘 > How do I clear the solr caches without restarting Solr cluster? > Is there a way? > I tried reloading the collection.

Re: solr wild card search

2019-04-08 Thread Anil
Thanks Eric. escaped colon and it worked. my bad.. i missed it :) On Mon, 8 Apr 2019 at 21:55, Erick Erickson wrote: > See: > https://lucene.apache.org/solr/guide/6_6/the-standard-query-parser.html > > > On Apr 8, 2019, at 9:04 AM, Anil wrote: > > > > Hi Eric, > > > > url:"https://facebook.com/

Sql entity processor sortedmapbackedcache out of memory issue

2019-04-08 Thread Srinivas Kashyap
Hello, I'm using DIH to index the data and the structure of the DIH is like below for solr core: 16 child entities During indexing, since the number of requests being made to database was high(to process one document 17 queries) and was utilizing most of connections of database thereby bloc