K S created HADOOP-16378:
----------------------------
Summary: RawLocalFileStatus throws exception if a file is created
and deleted quickly
Key: HADOOP-16378
URL: https://issues.apache.org/jira/browse/HADOOP-16378
Project: Hadoop Common
Issue Type: Bug
Components: fs
Reporter: K S
Bug occurs when Hadoop creates temporary ".nfs*" files as part of file moves
and accesses. If this file is deleted very quickly after being created, a
RuntimeException is thrown. The root cause is in the loadPermissionInfo method
in org.apache.hadoop.fs.RawLocalFileSystem. To get the permission info, it
first does
{code:java}
ls -ld{code}
and then attempts to get permissions info about each file. If a file
disappears between these two steps, an exception is thrown.
*Reproduction Steps:*
An isolated way to reproduce the bug is to run FileInputFormat.listStatus over
and over on the same dir that we’re creating those temp files in. On Ubuntu or
any other Linux-based system, this should fail intermittently. On MacOS (due to
differences in how `ls` returns status codes) this should not fail.
*Fix:*
One way in which we managed to fix this was to ignore the exception being
thrown in loadPemissionInfo() if the exit code is 1 or 2.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]