[ 
https://issues.apache.org/jira/browse/HADOOP-18023?focusedWorklogId=686246&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-686246
 ]

ASF GitHub Bot logged work on HADOOP-18023:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 25/Nov/21 06:45
            Start Date: 25/Nov/21 06:45
    Worklog Time Spent: 10m 
      Work Description: smarthanwang opened a new pull request #3721:
URL: https://github.com/apache/hadoop/pull/3721


   
   JIRA:  https://issues.apache.org/jira/browse/HADOOP-18023
   
   ### Description of PR
   
   Allow hadoop fs -cp command to  run with multi-thread to improve copy speed.
   
   ### How was this patch tested?
   It's useful to allow  -cp command to  run with multi-thread,  like the 
improvement we done for -put/-get commands.  It would reduce about 90% time 
cost when run with 10 threads in my test cases.
   
   #### Source dir:  1 dir  401 files  2.3G 
   ##### Test 1: run with single thread
   ```
     time hadoop fs -cp /tmp/data/test /tmp/data/t1
     real    1m9.394s
     user    0m16.688s
     sys    0m5.331s
   ```
   ##### Test 2: run with 10 threads
   ```
     time hadoop fs -cp -t 10 /tmp/data/test /tmp/data/t2
     real    0m8.217s
     user    0m19.864s
     sys    0m8.776s
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

            Worklog Id:     (was: 686246)
    Remaining Estimate: 0h
            Time Spent: 10m

> Allow cp command to run with multi threads.
> -------------------------------------------
>
>                 Key: HADOOP-18023
>                 URL: https://issues.apache.org/jira/browse/HADOOP-18023
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Chengwei Wang
>            Assignee: Chengwei Wang
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Allow _*hadoop fs -cp*_ command to  run with multi-thread to improve copy 
> speed.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to