Skip to content
Related Articles

Related Articles

Improve Article
Save Article
Like Article

Difference between Impala and MongoDB

  • Last Updated : 14 Jul, 2020

1. Impala :
Impala is a query engine that runs on Hadoop. It is an open source software and massively parallel processing SQL query engine. It supports in-memory data processing. It is pioneering the use of the Parquet file format, a columnar storage layout that is optimized for large-scale queries typical in data warehouse scenarios. It provides high-performance, low-latency SQL queries and also offers interactive query processing on data stored in Hadoop file formats.

2. Mongodb :
MongoDB is a cross-platform document-oriented and a non relational (i.e., NoSQL) database program. It is an open-source document database, that stores the data in the form of key-value pairs. MongoDB is developed by MongoDB Inc. and initially released on 11 February 2009. It is written in C++, Go, JavaScript, Python languages. MongoDB offers high speed, high availability, and high scalability.

Attention reader! Don’t stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.


Difference between Impala and MongoDB :

1.It is developed by Cloudera in 2013.It is developed by MongoDB Inc. in 2009.
2.It is an open source software.It is also an open source software.
3.Server operating systems for Impala is Linux.Server operating systems for MongoDB are Solaris, Linux, OS X, Windows.
4.It do not support In-memory capabilities.It support In-memory capabilities.
5.No transaction concepts.ACID properties of transaction is used.
6.The replication method that Impala supports is Selectable Replication Factor.The replication method that MongoDB supports is Master Slave Replication
7.All programming languages supporting JDBC/ODBC are supported by Impala.It supports many programming languages like C, C#, Java, JavaScript, PHP, Lau, Python, R, Ruby, etc.
8.It support Sharding partitioning methods for storing different data on different nodes.It support Sharding Partitioning methods.
9.JDBC and ODBC are used as APIs and access methods.Proprietary protocol using JSON are used as APIs and other access methods.
10.The Primary database model is Relational DBMS.The Primary database model is Document store.

My Personal Notes arrow_drop_up
Recommended Articles
Page :