[jira] [Commented] (HADOOP-13738) DiskChecker should perform some disk IO

Arpit Agarwal (JIRA) Thu, 20 Oct 2016 12:54:32 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-13738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15592804#comment-15592804
 ]


Arpit Agarwal commented on HADOOP-13738:
----------------------------------------

Thanks for the feedback [~kihwal].

bq. Any particular reason why it retries on FNFE? When do you think that will 
happen?
The retry on FNFE handles the very unlikely situation of file name collision 
while creating the FileOutputStream. e.g. due to simultaneous checks or a 
previously existing file which cannot be deleted.

> DiskChecker should perform some disk IO
> ---------------------------------------
>
>                 Key: HADOOP-13738
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13738
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Arpit Agarwal
>            Assignee: Arpit Agarwal
>         Attachments: HADOOP-13738.01.patch
>
>
> DiskChecker can fail to detect total disk/controller failures indefinitely. 
> We have seen this in real clusters. DiskChecker performs simple 
> permissions-based checks on directories which do not guarantee that any disk 
> IO will be attempted.
> A simple improvement is to write some data and flush it to the disk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HADOOP-13738) DiskChecker should perform some disk IO

Reply via email to