Hi,

I had some questions about what the Aggregate Word Count example in the
hadoop-mapreduce-examples-3.3.1.jar actually does.

This is how I executed the AggregateWordCount example - hadoop jar
hadoop-3.3.1/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar
aggregatewordcount /examples-input/wordcount/ /examples-output/wordcount/ 1
textinputformat

/examples-input/wordcount/ contains 2 files - wc01.txt and wc02.txt.

These are the contents of wc01.txt:
Hello World Bye World

These are the contents of wc02.txt:
Hello Hadoop Goodbye Hadoop

The generated output file - /examples-output/wordcount/part-r-00000
contains the following line:
record_count 2

I tried adding another file - wc03.txt which changed the content of the
generated file to:
record_count 3

Does this mean that Aggregate WordCount is merely counting the number of
files in the input directory?

Regards,


-- 
Pratyush Das

Reply via email to