What is Hadoop

  • Hadoop is a distributed master/ slave technique that handles parallel processing of large volume of data. It contains two components: the File System for storage and the Mapreduce for processing the large amount of data. The number of file systems will communicate with each other to form the hadoop cluster and the data is shared among these clusters. Then the processed data will form the single file system.