Hadoop is an open-source software programming framework. The framework of Hadoop is based on Java Programming Language with some native code in shell script and C.
This framework is used to manage, store and process the data & computation for the different applications of big data running under clustered systems. The main components of Hadoop are HDFS, MapReduce and YARN.
Cassandra is an open-source distributed data management system with wide column store and NoSQL database. In this NoSQL database provides the capability to handle a very large amount of data across many commodity hardware with no single point of failure and high availability. The code is written in Java and developed by the Apache Software Foundation.
Difference Between Hadoop and Cassandra
|1||Hadoop is a scalable framework that is designed to be deployed on low-cost hardware.||It is deployed in a very distributed fashion as a cluster of instances that are all aware of each other.|
|2||Hadoop is a big data processing framework based on the famous MapReduce programming model.||Cassandra is mainly used for real-time data processing.|
|3||Hadoop supports a variety of formats.||Cassandra does not support images.|
|4||Hadoop follows a master slave architecture.||Cassandra follows a peer-to-peer architecture|
|5||Hadoop is deployed in a single data center.||Cassandra is deployed in a very distributed fashion.|
|6||It used map reduce to read/write.||This uses Cassandra query language.|
|7||In hadoop, data is directly written to data node.||While in Cassandra, data is first written to mem-table and then it is written to disk.|
|8||Hadoop has a fixed replication factor of 3.||Replication factor in Cassandra depend on the number of nodes.|
|9||It has high latency rate.||It has less latency rate.|
|10||Hadoop uses TCP and UDP for communication.||In Cassandra, gossip protocol is used for communication.|
|11||It is for data batch processing.||It is for real-time processing.|
- Difference Between Hadoop 2.x vs Hadoop 3.x
- Difference between Hadoop 1 and Hadoop 2
- Difference Between Hadoop and SQL
- Difference Between Hadoop and Spark
- Difference Between RDBMS and Hadoop
- Difference Between Hadoop and MongoDB
- Difference Between Hadoop and Teradata
- Difference Between Hadoop and MapReduce
- Difference Between Hadoop and Elasticsearch
- Difference Between Hadoop and HBase
- Difference Between Hadoop and Splunk
- Difference Between Hadoop and SQL Performance
- Difference Between Hadoop and Hive
- Difference Between Hadoop and Apache Spark
- Difference Between Big Data and Apache Hadoop
- Difference Between Cloud Computing and Hadoop
- Difference between Cassandra and IBM Db2
- Difference between MS SQL Server and Cassandra
- Difference between Derby and Cassandra
- Difference between SQLite and Cassandra
If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to email@example.com. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.