Similar Topics

Python

20k+ articles

Difference Between

4.3k+ articles

GBlog

3k+ articles

Misc

2.6k+ articles

Computer Subject

1.7k+ articles

BigData

49 articles

Hadoop

33 articles

MapReduce

18 articles

Apache-Hive

12 articles

Apache Pig

6 articles

Hadoop

110+ posts

Hive - Load Data Into Table

Last Updated: 24 November 2020

Hive tables provide us the schema to store data in various formats (like CSV). Hive provides multiple ways to add data to the tables. We can...read more

Hadoop Streaming Using Python - Word Count Problem

Last Updated: 19 January 2022

Hadoop Streaming is a feature that comes with Hadoop and allows users or developers to use various different languages for writing MapReduce...read more

MapReduce Architecture

Last Updated: 10 September 2020

MapReduce and HDFS are the two major components of Hadoop which makes it so powerful and efficient to use. MapReduce is a programming model ...read more

Similar Topics

Python

20k+ articles

Difference Between

4.3k+ articles

GBlog

3k+ articles

Misc

2.6k+ articles

Computer Subject

1.7k+ articles

BigData

49+ articles

Hadoop

33+ articles

MapReduce

18+ articles

Apache-Hive

12+ articles

Apache Pig

6+ articles

Different Sources of Data for Data Analysis

Last Updated: 08 July 2022

Data collection is the process of acquiring, collecting, extracting, and storing the voluminous amount of data which may be in the structure...read more

Technical Scripter

Write From Home

Hadoop - copyFromLocal Command

Last Updated: 27 December 2021

Hadoop copyFromLocal command is used to copy the file from your local file system to the HDFS(Hadoop Distributed File System). copyFromLocal...read more

Hadoop - Architecture

Last Updated: 03 January 2023

As we all know Hadoop is a framework written in Java that utilizes a large cluster of commodity hardware to maintain and store big size data...read more

Matrix Multiplication With 1 MapReduce Step

Last Updated: 11 January 2023

MapReduce is a technique in which a huge program is subdivided into small tasks and run parallelly to make computation faster, save time, an...read more

Difference Between Hadoop and Spark

Last Updated: 06 February 2023

Apache Hadoop is a platform that got its start as a Yahoo project in 2006, which became a top-level Apache open-source project afterward. Th ...read more

Computer Subject

Difference Between

Write From Home

What is Big Data?

Last Updated: 02 December 2022

Data science is the study of data analysis by advanced technology (Machine Learning, Artificial Intelligence, Big data). It processes a huge...read more

Computer Subject

Data Engineering

Applications of Big Data

Last Updated: 15 June 2022

In today's world, there are a lot of data. Big companies utilize those data for their business growth. By analyzing this data, the useful de ...read more

Computer Subject

How to Execute WordCount Program in MapReduce using Cloudera Distribution Hadoop(CDH)

Last Updated: 24 June 2021

Prerequisites: Hadoop and MapReduceCounting the number of words in any language is a piece of cake like in C, C++, Python, Java, etc. MapRed...read more

Last Updated: 07 March 2024

HDFS is the primary or major component of the Hadoop ecosystem which is responsible for storing large data sets of structured or unstructure...read more

Introduction to Apache Pig

Last Updated: 14 May 2023

Pig Represents Big Data as data flows. Pig is a high-level platform or tool which is used to process the large datasets. It provides a high- ...read more

Hadoop Ecosystem

Last Updated: 29 January 2024

Overview: Apache Hadoop is an open source framework intended to make interaction with big data easier, However, for those who are not acquai...read more

Architecture and Working of Hive

Last Updated: 25 April 2023

Prerequisite - Introduction to Hadoop, Apache HiveThe major components of Hive and its interaction with the Hadoop is demonstrated in the fi...read more

1 2 3 4 5 6 7 8 >>