Did you mean 0.20.2?
If so then Wow, that is a bit of a stumper. Line 200 of BZip2Codec.java is the
following
196: public void write(int b) throws IOException {
197: if (needsReset) {
198: internalReset();
199: }
200: this.output.write(b);
201: }
So it must be that the output stream itself(this.output) is null (or this is
null which would mean that java itself has something very wrong with it). So
it looks like for some reason the output stream for the spill file is coming
back as null, but if I look at the code for IFile, where the output stream is
created
...
this.checksumOut = new IFileOutputStream(out);
...
if (codec != null) {
this.compressor = CodecPool.getCompressor(codec);
this.compressor.reset();
this.compressedOut = codec.createOutputStream(checksumOut, compressor);
...
I don't see any way that checksumOut could be null. There may have been some
sort of an optimization with in IFileOutputStream, but I really don't see how.
You might want to look at how full the disks are on the nodes that it is
failing on. You might also want to check to see if any records were output by
these mappers at all, because this is failing on close, and it would be very
interesting to see if anything else was output to the IFile before this?
--Bobby Evans
On 11/7/11 10:36 AM, "Steve Lewis" <[email protected]> wrote:
0.202 and using that API -
On Mon, Nov 7, 2011 at 8:27 AM, Robert Evans <[email protected]> wrote:
What version of Hadoop are you using?
On 11/5/11 11:09 AM, "Steve Lewis" <[email protected]
<http://[email protected]> > wrote:
My job is dying during a map task write. This happened in enough task to kill
the job although most tasks succeeded -
Any ideas as to where to start diagnosing the problem
Caused by: java.lang.NullPointerException
at
org.apache.hadoop.io.compress.BZip2Codec$BZip2CompressionOutputStream.write(BZip2Codec.java:200)
at
org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:41)
at java.io.DataOutputStream.writeByte(DataOutputStream.java:136)
at org.apache.hadoop.io.WritableUtils.writeVLong(WritableUtils.java:263)
at org.apache.hadoop.io.WritableUtils.writeVInt(WritableUtils.java:243)
at org.apache.hadoop.mapred.IFile$Writer.close(IFile.java:126)
at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1242)
at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.access$1800(MapTask.java:648)
at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer$SpillThread.run(MapTask.java:1135)