[jira] [Comment Edited] (SOLR-13933) Cluster mode Stress test suite

Ishan Chattopadhyaya (Jira) Sat, 08 Aug 2020 04:57:22 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-13933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17173556#comment-17173556
 ]


Ishan Chattopadhyaya edited comment on SOLR-13933 at 8/8/20, 11:56 AM:
-----------------------------------------------------------------------

I am actively working on this here: 
https://github.com/TheSearchStack/solr-bench/tree/stress-harness

Here is a sample suite: 
https://github.com/TheSearchStack/solr-bench/blob/stress-harness/rolling.json
It starts 3 Solr nodes, 1GB ram each.
It has two tasks: 
# task1 that creates a bunch of collections and indexes some data in them.
# task2 that waits for task1 to end, and after that it does a rolling restart 
of the cluster (i.e. restarts a node and waits until all replicas in the 
restarted node is active, and proceeds to the next node until all nodes are 
restarted).

While doing this, it measures the timings on all of them.

Here is another sample suite: 
https://github.com/TheSearchStack/solr-bench/blob/stress-harness/workflow.json

Does various tasks in task1 through task5, including indexing, creating many 
collections, shard splitting, restarting a node, validating number of documents 
in a collection (this one is WIP). As defined, some of these tasks are done in 
parallel with each other, some are blocking tasks.


It is my intention to quickly wrap up this quickly and start running automated 
tests for medium scale Solr cluster (say, 50-100 nodes, ~2GB RAM each) on each 
of our branches and publishing results periodically. This will be specially 
useful to benchmark stability of the cluster, with SOLR-13951 and SOLR-14636.


was (Author: ichattopadhyaya):
I am actively working on this here: 
https://github.com/TheSearchStack/solr-bench/tree/stress-harness

Here is a sample suite: 
https://github.com/TheSearchStack/solr-bench/blob/stress-harness/rolling.json
It starts 3 Solr nodes, 1GB ram each.
It has two tasks: 
# task1 that creates a bunch of collections and indexes some data in them.
# task2 that waits for task1 to end, and after that it does a rolling restart 
of the cluster (i.e. restarts a node and waits until all replicas in the 
restarted node is active, and proceeds to the next node until all nodes are 
restarted).

While doing this, it measures the timings on all of them.

Here is another sample suite: 
https://github.com/TheSearchStack/solr-bench/blob/stress-harness/workflow.json

Does various tasks in task1 through task5, including indexing, creating many 
collections, shard splitting, restarting a node, validating number of documents 
in a collection (this one is WIP). As defined, some of these tasks are done in 
parallel with each other, some are blocking tasks.


It is my intention to quickly wrap up this quite and start running automated 
tests for medium scale Solr cluster (say, 50-100 nodes, ~2GB RAM each) on each 
of our branches and publishing results periodically. This will be specially 
useful to benchmark stability of the cluster, with SOLR-13951 and SOLR-14636.

> Cluster mode Stress test suite 
> -------------------------------
>
>                 Key: SOLR-13933
>                 URL: https://issues.apache.org/jira/browse/SOLR-13933
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Ishan Chattopadhyaya
>            Assignee: Ishan Chattopadhyaya
>            Priority: Major
>
> We need a stress test harness based on 10s or 100s of nodes, 1000s of 
> collection API operations, overseer operations etc. This suite should run 
> nightly, publish results publicly, so as to help with:
> # Uncover stability problems
> # Benchmarking (timings, resource metrics etc.) on collection operations
> # Indexing/querying performance
> # Validate the accuracy of potential improvements
> References:
> SOLR-10317
> https://github.com/lucidworks/solr-scale-tk
> https://github.com/shalinmangar/solr-perf-tools
> Lucene benchmarks



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-13933) Cluster mode Stress test suite

Reply via email to