What is the difference between Apache Spark and Hadoop MapReduce

  • Apache Spark is an alternative to the MapReduce model that can be used to process the real time streaming data and completes the execution of application within seconds. Data sharing in memory is much faster than Disk. Spark improves the efficiency by storing the data in distributed memory and it stores the state of memory as an object whereas Hadoop stores the data on disk. Thus data sharing in Mapreduce is slow due to HDFS read and write operations (Disk) whereas Spark uses resilient distributed dataset.