Hi Wei-Chiu, Awesome question! In a different direction from "features" for my interests simplifying the architecture of HDFS would be helpful as an operator. As my team thinks about on-prem cloud deployments, operational simplicity is becoming more key.
Being able to deploy a single type of daemon to achieve an HA, fault-tolerant cluster would be amazing (rather than the number of specialized daemons needed today). -Clay From: [email protected] At: 06/10/19 13:09:00To: [email protected], [email protected], [email protected] Subject: [DISCUSS] HDFS roadmap/wish list Hi! I am soliciting feedbacks for HDFS roadmap items and wish list in the future Hadoop releases. A community meetup is happening soon, and perhaps we can use this thread to converge on things we should talk about there. I am aware of several major features that merged into trunk, such as RBF, Consistent Standby Serving Reads, as well as some recent features that merged into 3.2.0 release (storage policy satisfier). What else should we be doing? I have a laundry list of supportability improvement projects, mostly about improving performance or making performance diagnostics easier. I can share the list if folks are interested. Are there things we should do to make developer's life easier or things that would be nice to have for downstream applications? I know Sahil Takiar made a series of improvements in HDFS for Impala recently, and those improvements are applicable to other downstreamers such as HBase. Or would it help if we provide more Hadoop API examples?
