Corey,
My apologies for not making myself clear. But, the points you listed are exactly what I meant. Joe: I did checkout RSync, but we are planning to establish a continuos data flow pipeline from wide range of servers, message bus, etc. to HDFS. We think Apache Nifi can be integrated/used as a data flow system with our Analytics as a Service Platform that we are building. Thanks for the help. Kartik
