Hi, The MRIT (MapReduceIndexerTool) uses NLineInputFormat for the morphline mapper. The mapper doesn't co-locate with the input data that it process. Isn't this a performance hit?
Ideally, morphline mapper should be run on those hosts that contain most data blocks for the input files it process. Regards, Tom