Hi, I had some questions about what the Aggregate Word Count example in the hadoop-mapreduce-examples-3.3.1.jar actually does.
This is how I executed the AggregateWordCount example - hadoop jar hadoop-3.3.1/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar aggregatewordcount /examples-input/wordcount/ /examples-output/wordcount/ 1 textinputformat /examples-input/wordcount/ contains 2 files - wc01.txt and wc02.txt. These are the contents of wc01.txt: Hello World Bye World These are the contents of wc02.txt: Hello Hadoop Goodbye Hadoop The generated output file - /examples-output/wordcount/part-r-00000 contains the following line: record_count 2 I tried adding another file - wc03.txt which changed the content of the generated file to: record_count 3 Does this mean that Aggregate WordCount is merely counting the number of files in the input directory? Regards, -- Pratyush Das
