It is worth doing a basic CPU speed test. Once you have enough RAM, indexing is 
mostly CPU-bound.

Try something like this. Run it once to get the tgz file cached in OS file 
buffers, then once to time it.

time gunzip < solr-6.4.1.tgz > /dev/null

I get 1.3 seconds on an Amazon c4.8xlarge and 0.8 seconds on my MacBook. A 
bigger file would be a better test, but that is the general idea.

Also, are you running 6.4.1 on both hosts? The new metrics code caused some 
slowdowns from 6.3.0 to 6.4.0.

On the other hand, I’m indexing about a million documents per minute into a 16 
node cluster (4 shards, 4-way replication factor) built with the c4.8xlarge 
instances. I’m running 64 indexing threads and 1000 doc batches. It might go a 
bit faster after we switch the cloud driver in SolrJ.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)


> On Feb 13, 2017, at 9:10 PM, Zheng Lin Edwin Yeo <edwinye...@gmail.com> wrote:
> 
> No, currently the server is slower, and my laptop is faster.
> 
> But shouldn't the server be faster, since it has a much better
> specification, like more RAM, better processor and SSD drive.
> 
> Regards,
> Edwin
> 
> 
> On 14 February 2017 at 12:26, Walter Underwood <wun...@wunderwood.org>
> wrote:
> 
>> Are you sure the server is faster? My MacBook Pro is a lot faster than
>> many of our Amazon EC2 servers.
>> 
>> wunder
>> Walter Underwood
>> wun...@wunderwood.org
>> http://observer.wunderwood.org/  (my blog)
>> 
>> 
>>> On Feb 13, 2017, at 8:12 PM, Zheng Lin Edwin Yeo <edwinye...@gmail.com>
>> wrote:
>>> 
>>> Hi,
>>> 
>>> I'm facing the issue of the indexing speed is slower is slower on a
>> server
>>> with a much better specification with Solr running on SSD, as compared
>> to a
>>> laptop with a normal hard disk.
>>> 
>>> Both the system has the exact same configurations. The configurations are
>>> first setup on the laptop, before being replicate to the server.
>>> 
>>> The setup is Solr 6.4.1, of 1 shard with 2 replica, using external
>>> ZooKeeper 3.4.8. The only difference is that in my laptop, both the
>> shards
>>> and ZooKeeper are on the same hard disk, while a the server, the
>> ZooKeeper
>>> is running on it's own hard disk, and each of the shards are also running
>>> on a separate hard disk. From what I know, this configuration should
>> result
>>> in improving the performance, instead of making it worse?
>>> 
>>> What could be the other reasons that this could happen?
>>> 
>>> I'm running on Solr 6.4.1
>>> 
>>> Regards,
>>> Edwin
>> 
>> 

Reply via email to