[jira] [Commented] (SOLR-14034) remove deprecated min_rf references

2020-12-13 Thread Gaurav Agarwal (Jira)


[ 
https://issues.apache.org/jira/browse/SOLR-14034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17248530#comment-17248530
 ] 

Gaurav Agarwal commented on SOLR-14034:
---

Hello Christine, 

I am new here and looking to contribute. Just wondering if this issue is still 
open and available to be worked on?  Seems like a good place to start.

> remove deprecated min_rf references
> ---
>
> Key: SOLR-14034
> URL: https://issues.apache.org/jira/browse/SOLR-14034
> Project: Solr
>  Issue Type: Task
>Reporter: Christine Poerschke
>Priority: Blocker
>  Labels: newdev
> Fix For: master (9.0)
>
>
> * {{min_rf}} support was added under SOLR-5468 in version 4.9 
> (https://github.com/apache/lucene-solr/blob/releases/lucene-solr/4.9.0/solr/solrj/src/java/org/apache/solr/client/solrj/request/UpdateRequest.java#L50)
>  and deprecated under SOLR-12767 in version 7.6 
> (https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.6.0/solr/solrj/src/java/org/apache/solr/client/solrj/request/UpdateRequest.java#L57-L61)
> * http://lucene.apache.org/solr/7_6_0/changes/Changes.html and 
> https://lucene.apache.org/solr/guide/8_0/major-changes-in-solr-8.html#solr-7-6
>  both clearly mention the deprecation
> This ticket is to fully remove {{min_rf}} references in code, tests and 
> documentation.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Created] (SOLR-15045) Commit through curl command is causing delay in issuing commit

2020-12-13 Thread Raj Yadav (Jira)
Raj Yadav created SOLR-15045:


 Summary: Commit through curl command is causing delay in issuing 
commit
 Key: SOLR-15045
 URL: https://issues.apache.org/jira/browse/SOLR-15045
 Project: Solr
  Issue Type: Bug
  Security Level: Public (Default Security Level. Issues are Public)
  Components: SolrCloud
Affects Versions: 8.5.2
 Environment: Operating system: Linux (centos 7.7.1908)
Reporter: Raj Yadav


Hi All,

When we issue commit through curl command, not all the shards are getting 
`start commit` requests at the same time.


*Solr Setup Detail : (Running in solrCloud mode)*
It has 6 shards, and each shard has only one replica (which is also a
leader) and the replica type is NRT.
Each shards are hosted on the separate physical host.

Zookeeper => We are using external zookeeper ensemble (3 separate node
cluster)

*Shard and Host name*
shard1_0=>solr_199
shard1_1=>solr_200
shard2_0=> solr_254
shard2_1=> solr_132
shard3_0=>solr_133
shard3_1=>solr_198

*Request rate on the system is currently zero and only hourly indexing*
*running on it.*

We are using curl command to issue commit.
{code:java}
curl
"http://solr_254:8389/solr/my_collection/update?openSearcher=true&commit=true&wt=json"{code}

(Using solr_254 host to issue commit)

On using the above command all the shards have started processing commit (i.e
getting `start commit` request) except the one used in curl command (i.e
shard2_0 which is hosted on solr_254). Individually each shards takes around
10 to 12 min to process hard commit (most of this time is spent on reloading
external files).
As per logs, shard2_0 is getting `start commit` request after 10 minutes
(approx). This leads to following timeout error.



{code:java}
2020-12-06 18:47:47.013 ERROR
org.apache.solr.client.solrj.SolrServerException: Timeout occured while
waiting response from server at:
http://solr_132:9744/solr/my_collection_shard2_1_replica_n21/update?update.distrib=TOLEADER&distrib.from=http%3A%2F%2Fsolr_254%3A9744%2Fsolr%2Fmy_collection_shard2_0_replica_n11%2F
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:407)
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:753)
      at
org.apache.solr.client.solrj.impl.ConcurrentUpdateHttp2SolrClient.request(ConcurrentUpdateHttp2SolrClient.java:369)
      at
org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1290)
      at
org.apache.solr.update.SolrCmdDistributor.doRequest(SolrCmdDistributor.java:344)
      at
org.apache.solr.update.SolrCmdDistributor.lambda$submit$0(SolrCmdDistributor.java:333)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at
com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:180)
      at
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:210)
      at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      at java.lang.Thread.run(Thread.java:748)
    Caused by: java.util.concurrent.TimeoutException
      at
org.eclipse.jetty.client.util.InputStreamResponseListener.get(InputStreamResponseListener.java:216)
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:398)
      ... 13 more{code}



Above timeout error is between solr_254 and solr_132. Similar errors are
there between solr_254 and other 4 shards


Since query load is zero, mostly CPU utilization is around 3%.
After issuing curl commit command, CPU goes up to 14% on all shards except
shard2_0 (host: solr_254, the one used in curl command).
And after 10 minutes (i.e after getting the `start commit` request)  CPU  on
shard2_0 also goes up to 14%.

As I mentioned earlier each shards take around 10-12 mins to process commit
and due to delay in starting commit process on one shard (shard2_0) our
overall commit time is doubled now. (22-24 minutes approx).

*We are observing this delay in both hard and soft commit.*


In our solr-5.4.0(having similar setup), we use similar command curl command
to issue commit and there all the shards are getting `start commit` request
at same time. Including the one used in curl command.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Updated] (SOLR-15045) Commit through curl command is causing delay in issuing commit

2020-12-13 Thread Raj Yadav (Jira)


 [ 
https://issues.apache.org/jira/browse/SOLR-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raj Yadav updated SOLR-15045:
-
Description: 
Hi All,

When we issue commit through curl command, not all the shards are getting 
`start commit` requests at the same time.

*Solr Setup Detail : (Running in solrCloud mode)*
 It has 6 shards, and each shard has only one replica (which is also a
 leader) and the replica type is NRT.
 Each shards are hosted on the separate physical host.

Zookeeper => We are using external zookeeper ensemble (3 separate node
 cluster)

*Shard and Host name*
 shard1_0=>solr_199
 shard1_1=>solr_200
 shard2_0=> solr_254
 shard2_1=> solr_132
 shard3_0=>solr_133
 shard3_1=>solr_198

*Request rate on the system is currently zero and only hourly indexing*
 *running on it.*

We are using curl command to issue commit.
{code:java}
curl
"http://solr_254:8389/solr/my_collection/update?openSearcher=true&commit=true&wt=json"{code}
(Using solr_254 host to issue commit)

On using the above command all the shards have started processing commit (i.e
 getting `start commit` request) except the one used in curl command (i.e
 shard2_0 which is hosted on solr_254). Individually each shards takes around
 10 to 12 min to process hard commit (most of this time is spent on reloading
 external files).
 As per logs, shard2_0 is getting `start commit` request after 10 minutes
 (approx). This leads to following timeout error.
{code:java}
2020-12-06 18:47:47.013 ERROR
org.apache.solr.client.solrj.SolrServerException: Timeout occured while
waiting response from server at:
http://solr_132:9744/solr/my_collection_shard2_1_replica_n21/update?update.distrib=TOLEADER&distrib.from=http%3A%2F%2Fsolr_254%3A9744%2Fsolr%2Fmy_collection_shard2_0_replica_n11%2F
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:407)
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:753)
      at
org.apache.solr.client.solrj.impl.ConcurrentUpdateHttp2SolrClient.request(ConcurrentUpdateHttp2SolrClient.java:369)
      at
org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1290)
      at
org.apache.solr.update.SolrCmdDistributor.doRequest(SolrCmdDistributor.java:344)
      at
org.apache.solr.update.SolrCmdDistributor.lambda$submit$0(SolrCmdDistributor.java:333)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
      at
com.codahale.metrics.InstrumentedExecutorService$InstrumentedRunnable.run(InstrumentedExecutorService.java:180)
      at
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:210)
      at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
      at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
      at java.lang.Thread.run(Thread.java:748)
    Caused by: java.util.concurrent.TimeoutException
      at
org.eclipse.jetty.client.util.InputStreamResponseListener.get(InputStreamResponseListener.java:216)
      at
org.apache.solr.client.solrj.impl.Http2SolrClient.request(Http2SolrClient.java:398)
      ... 13 more{code}
Above timeout error is between solr_254 and solr_132. Similar errors are
 there between solr_254 and other 4 shards

Since query load is zero, mostly CPU utilization is around 3%.
 After issuing curl commit command, CPU goes up to 14% on all shards except
 shard2_0 (host: solr_254, the one used in curl command).
 And after 10 minutes (i.e after getting the `start commit` request)  CPU  on
 shard2_0 also goes up to 14%.

As I mentioned earlier each shards take around 10-12 mins to process commit
 and due to delay in starting commit process on one shard (shard2_0) our
 overall commit time is doubled now. (22-24 minutes approx).

*We are observing this delay in both hard and soft commit.*

In our solr-5.4.0(having similar setup), we use the similar curl command to 
issue commit, and there all the shards are getting `start commit` request at 
same time. Including the one used in curl command.

  was:
Hi All,

When we issue commit through curl command, not all the shards are getting 
`start commit` requests at the same time.


*Solr Setup Detail : (Running in solrCloud mode)*
It has 6 shards, and each shard has only one replica (which is also a
leader) and the replica type is NRT.
Each shards are hosted on the separate physical host.

Zookeeper => We are using external zookeeper ensemble (3 separate node
cluster)

*Shard and Host name*
shard1_0=>solr_199
shard1_1=>solr_200
shard2_0=> solr_254
shard2_1=> solr_132
shard3_0=>solr_133
shard3_1=>solr_198

*Request rate on the system is currently zero and only hourly indexing*
*running on it.*

We are using curl command to issue commit.
{code:java}
curl
"http://sol

[jira] [Comment Edited] (SOLR-14848) Demonstrate how Solr 8, master, or any version previous Solr version before pales next to the reference branch.

2020-12-13 Thread Mark Robert Miller (Jira)


[ 
https://issues.apache.org/jira/browse/SOLR-14848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17248722#comment-17248722
 ] 

Mark Robert Miller edited comment on SOLR-14848 at 12/14/20, 2:58 AM:
--

Well, I’ve slept all weekend, this kind of life style came much easier with a 
teenage body and natural inclination vs forced march. I’m essentially ready 
here, but I’m tired, blown out, etc etc. And I hate to look, but I’m guessing 
Xmas is tomorrow or something. So likely nothing to show until the new year. 


was (Author: markrmiller):
Well, I’ve slept all weekend, this kind of life style came much easier with a 
teenage body and natural inclination for forced march. I’m essentially ready 
here, but I’m tired, blown out, etc etc. And I hate to look, but I’m guessing 
Xmas is tomorrow or something. So likely nothing to show until the new year. 

> Demonstrate how Solr 8, master, or any version previous Solr version before 
> pales next to the reference branch.
> ---
>
> Key: SOLR-14848
> URL: https://issues.apache.org/jira/browse/SOLR-14848
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Mark Robert Miller
>Priority: Major
>
> I've got a lot of code here and I have and will be claiming that it's an 
> order of magnitude better than what has come before.
> I've been too busy and will be busy for a bit, so I have not been too 
> concerned about backing that up really at all. Most people have no clue what 
> I have here, some people have an inkling, some people are just totally 
> confused, some people think I  maybe have some fast tests, or a slightly more 
> stable system, or maybe some neato performance changes, or even maybe some 
> poorly coded speed hacks. Maybe one or two has a more hope filled guess.
> Almost everyone will think, "all that new code, mostly done by a single 
> person? I know a lot of smart and smarter devs, who cares what this guy is up 
> to. Why would I leave the safety of the branch I know and feel safe with? By 
> definition, the existing stuff is the battle hardened, tried and true leader, 
> and how are you going to come in here without disrupting our comfortable 
> thing?"
> Well, fair enough. I won't try to come and disrupt anything. Instead, there 
> will be benchmarks, stress tests, chaos monkeys, long term endurance tests, 
> and all sorts of fun competitions. Spy vs Spy. I mean Solr vs Solr.
> And while this vanilla version of my previous work has avoided a lot of great 
> changes and improvements I can make (a "remastered" Solr sensible, initial 
> mandate that puts a hand or two behind my back) ...
> ... The reference branch will trounce previous versions of Solr in benchmark 
> after benchmark. It will keep pumping through endurance tests and performance 
> challenges at impressive speed while Solr proper will struggle to finish in a 
> reasonable time or almost certainly, often enough, simply fail to complete 
> the task. The reference branch will devour available resources and fly 
> through work. Solr master will struggle and meander, sometimes in the wrong 
> direction, while leaving the hardware with gobs of idle cpu to chill with 
> (unless it's using most of the cpu for garbage collection at some points).
> This is not meant to brag or dis previous versions of Solr. I was heavily 
> involved in building them. This is the result of dedication and time more 
> than any of my brilliance - the above is simply meant to  state the path that 
> I see coming. As this comparison information and other experiences and 
> stories start to emerge, that master branch won't look nearly so safe or 
> comfortable anymore. And it's at that point that we will find out if anyone 
> is interested in testing our tolerance for disruption by trying to figure out 
> how to get master into the reference branch as opposed to the other way 
> around.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Commented] (SOLR-14848) Demonstrate how Solr 8, master, or any version previous Solr version before pales next to the reference branch.

2020-12-13 Thread Mark Robert Miller (Jira)


[ 
https://issues.apache.org/jira/browse/SOLR-14848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17248722#comment-17248722
 ] 

Mark Robert Miller commented on SOLR-14848:
---

Well, I’ve slept all weekend, this kind of life style came much easier with a 
teenage body and natural inclination for forced march. I’m essentially ready 
here, but I’m tired, blown out, etc etc. And I hate to look, but I’m guessing 
Xmas is tomorrow or something. So likely nothing to show until the new year. 

> Demonstrate how Solr 8, master, or any version previous Solr version before 
> pales next to the reference branch.
> ---
>
> Key: SOLR-14848
> URL: https://issues.apache.org/jira/browse/SOLR-14848
> Project: Solr
>  Issue Type: Sub-task
>Reporter: Mark Robert Miller
>Priority: Major
>
> I've got a lot of code here and I have and will be claiming that it's an 
> order of magnitude better than what has come before.
> I've been too busy and will be busy for a bit, so I have not been too 
> concerned about backing that up really at all. Most people have no clue what 
> I have here, some people have an inkling, some people are just totally 
> confused, some people think I  maybe have some fast tests, or a slightly more 
> stable system, or maybe some neato performance changes, or even maybe some 
> poorly coded speed hacks. Maybe one or two has a more hope filled guess.
> Almost everyone will think, "all that new code, mostly done by a single 
> person? I know a lot of smart and smarter devs, who cares what this guy is up 
> to. Why would I leave the safety of the branch I know and feel safe with? By 
> definition, the existing stuff is the battle hardened, tried and true leader, 
> and how are you going to come in here without disrupting our comfortable 
> thing?"
> Well, fair enough. I won't try to come and disrupt anything. Instead, there 
> will be benchmarks, stress tests, chaos monkeys, long term endurance tests, 
> and all sorts of fun competitions. Spy vs Spy. I mean Solr vs Solr.
> And while this vanilla version of my previous work has avoided a lot of great 
> changes and improvements I can make (a "remastered" Solr sensible, initial 
> mandate that puts a hand or two behind my back) ...
> ... The reference branch will trounce previous versions of Solr in benchmark 
> after benchmark. It will keep pumping through endurance tests and performance 
> challenges at impressive speed while Solr proper will struggle to finish in a 
> reasonable time or almost certainly, often enough, simply fail to complete 
> the task. The reference branch will devour available resources and fly 
> through work. Solr master will struggle and meander, sometimes in the wrong 
> direction, while leaving the hardware with gobs of idle cpu to chill with 
> (unless it's using most of the cpu for garbage collection at some points).
> This is not meant to brag or dis previous versions of Solr. I was heavily 
> involved in building them. This is the result of dedication and time more 
> than any of my brilliance - the above is simply meant to  state the path that 
> I see coming. As this comparison information and other experiences and 
> stories start to emerge, that master branch won't look nearly so safe or 
> comfortable anymore. And it's at that point that we will find out if anyone 
> is interested in testing our tolerance for disruption by trying to figure out 
> how to get master into the reference branch as opposed to the other way 
> around.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Commented] (SOLR-14827) Refactor schema loading to not use XPath

2020-12-13 Thread Noble Paul (Jira)


[ 
https://issues.apache.org/jira/browse/SOLR-14827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17248784#comment-17248784
 ] 

Noble Paul commented on SOLR-14827:
---

I added a simple testcase to load multiple schema objects

The time taken has come down from ~540ms to ~270ms

> Refactor schema loading to not use XPath
> 
>
> Key: SOLR-14827
> URL: https://issues.apache.org/jira/browse/SOLR-14827
> Project: Solr
>  Issue Type: Task
>Reporter: Noble Paul
>Assignee: Noble Paul
>Priority: Major
>  Labels: perfomance
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> XPath is slower compared to DOM. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org