smarthanwang opened a new pull request #3721: URL: https://github.com/apache/hadoop/pull/3721
JIRA: https://issues.apache.org/jira/browse/HADOOP-18023 ### Description of PR Allow hadoop fs -cp command to run with multi-thread to improve copy speed. ### How was this patch tested? It's useful to allow -cp command to run with multi-thread, like the improvement we done for -put/-get commands. It would reduce about 90% time cost when run with 10 threads in my test cases. #### Source dir: 1 dir 401 files 2.3G ##### Test 1: run with single thread ``` time hadoop fs -cp /tmp/data/test /tmp/data/t1 real 1m9.394s user 0m16.688s sys 0m5.331s ``` ##### Test 2: run with 10 threads ``` time hadoop fs -cp -t 10 /tmp/data/test /tmp/data/t2 real 0m8.217s user 0m19.864s sys 0m8.776s ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
