It is written in Java and supports all major platforms
It is a primary distributed storage used by Hadoop applications for storing a large amount of data
HDFS and YARN build the data management layer of Apache Hadoop
It employs a NameNode and DataNode architecture to implement a distributed file system that provides high-performance access to data across highly scalable Hadoop clusters.
Highly configurable with a default configuration well suited for many installations
Rack awareness
File permissions and authentication
Safe mode
Minimal data motion
Upgrade and Rollback
Operability