Open In App
Related Articles

Difference Between Apache Hadoop and Apache Storm

Improve Article
Save Article
Like Article

Apache Hadoop: It is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. 

Apache Storm: It is a distributed stream processing computation framework written predominantly in the Clojure programming language. Originally created by Nathan Marz and the team at BackType, the project was open-sourced after being acquired by Twitter. Apache-Hadoop-vs-Apache-Storm 

Below is a table of differences between Apache Hadoop and Apache Storm: 

FeaturesApache HadoopApache Storm
ProcessingDistributed batch processing which uses MapReduceDistributed real-time data processing which uses DAGs
LatencyHigh Latency i.e slow computationLow Latency i.e fast computation
Written LanguageWhole frame work is written in JavaFrame work is written in Clojure and Java
Streaming processingIt is State-full streaming processingIt is State-less streaming processing
SetupEasy to setup but operating cluster is hardEasy to use
Data streamingData is dynamic and continuously streamedData is static and nonvolatile i.e data is persistence
Use casesIt is used in Twitter, Navisite, Wego etcIt is used in Black Box Data, Search Engine Data etc
ArchitectureHadoop comprises HDFS (used for data storage) and MapReduce (used for Computation) as architectural units.Storm comprises streams, spouts, and bolts as their architectural units.
Last Updated : 15 Feb, 2023
Like Article
Save Article
Similar Reads
Related Tutorials