Difference Between Data Mining and Data Analysis
1. Data Analysis :
Data Analysis involves extraction, cleaning, transformation, modeling and visualization of data with an objective to extract important and helpful information which can be additional helpful in deriving conclusions and make choices.
The main purpose of data analysis is to search out some important information in raw data so the derived knowledge is often used to create vital choices.
2. Data Mining :
Data mining could be called as a subset of Data Analysis. It is the exploration and analysis of huge knowledge to find important patterns and rules.
Data mining could also be a systematic and successive method of identifying and discovering hidden patterns and data throughout a big dataset. Moreover, it is used to build machine learning models that are further used in artificial intelligence.
Below is a table of differences between Data Mining and Data Analysis :
|It is the process of extracting important pattern from large datasets.
|It is the process of analysing and organizing raw data in order to determine useful informations and decisions
|It is used in discovering hidden patterns in raw data sets .
|In this all operations are involved in examining data sets to fine conclusions.
|In this data set are generally large and structured.
|Dataset can be large, medium or small, Also structured, semi structured, unstructured.
|Often require mathematical and statistical models
|Analytical and business intelligence models
|It generally does not require visualization
|Surely requires Data visualization.
|Prime goal is to make data usable.
|It is used to make data driven decisions.
|It involves the intersection of machine learning, statistics, and databases.
|It requires the knowledge of computer science, statistics, mathematics, subject knowledge Al/Machine Learning.
|Also known as
|It is also known as Knowledge discovery in databases.
|Data analysis can be divided into descriptive statistics, exploratory data analysis, and confirmatory data analysis.
|It shows the data tends and patterns.
|The output is verified or discarded hypothesis
Share your thoughts in the comments
Please Login to comment...