1. Hive :
Hive is a data warehouse software for querying and managing large distributed datasets, built on Hadoop. It is developed by Apache Software Foundation in 2012. It contains two modules, one is MapReduce and another is Hadoop Distributed File System (HDFS). It stores schema in a database and processed data into HDFS. It resides on top of Hadoop to summarize Big Data, and makes querying and analyzing easy.
2. Cassandra :
Cassandra is a free and open-source, distributed, wide column store, NoSQL database management system. It was developed by Apache Software foundation and initially released on July 2008. Cassandra is designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.
Difference between Hive and Cassandra :
|1.||It is developed by Apache Software Foundation in 2012.||It is also developed by Apache Software Foundation in 2008.|
|2.||It is written in Java language.||It is written in Java language.|
|3.||The primary database model for Hive is Relational DBMS.||The primary database model for Cassandra is Wide Column Store.|
|4.||Server operating systems for Hive is all OS with a Java VM .||Cassandra server operating systems are BSD, Linux, OS X, Windows.|
|5.||It is an open source software.||It is also an open source software.|
|7.||The replication method that Hive supports is Selectable Replication Factor.||The replication method that Cassandra supports is Selectable Replication Factor.|
|8.||It do not support In-memory capabilities.||It support In-memory capabilities.|
|9.||No transaction concepts.||ACID properties of transaction is used.|
|10.||JDBC, ODBC, Thrift are used as APIs and other access methods.||Proprietary protocol, Thrift are used asAPIs and other access methods.|
Attention reader! Don’t stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.