smarthanwang opened a new pull request #3721:
URL: https://github.com/apache/hadoop/pull/3721


   
   JIRA:  https://issues.apache.org/jira/browse/HADOOP-18023
   
   ### Description of PR
   
   Allow hadoop fs -cp command to  run with multi-thread to improve copy speed.
   
   ### How was this patch tested?
   It's useful to allow  -cp command to  run with multi-thread,  like the 
improvement we done for -put/-get commands.  It would reduce about 90% time 
cost when run with 10 threads in my test cases.
   
   #### Source dir:  1 dir  401 files  2.3G 
   ##### Test 1: run with single thread
   ```
     time hadoop fs -cp /tmp/data/test /tmp/data/t1
     real    1m9.394s
     user    0m16.688s
     sys    0m5.331s
   ```
   ##### Test 2: run with 10 threads
   ```
     time hadoop fs -cp -t 10 /tmp/data/test /tmp/data/t2
     real    0m8.217s
     user    0m19.864s
     sys    0m8.776s
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to