A data warehouse is built to support management functions whereas data mining is used to extract useful information and patterns from data. Data warehousing is the process of compiling information into a data warehouse.
It is a technology that aggregates structured data from one or more sources so that it can be compared and analyzed rather than transaction processing. A data warehouse is designed to support management decision-making process by providing a platform for data cleaning, data integration and data consolidation. A data warehouse contains subject-oriented, integrated, time-variant and non-volatile data.
Data warehouse consolidates data from many sources while ensuring data quality, consistency and accuracy. Data warehouse improves system performance by separating analytics processing from transnational databases. Data flows into a data warehouse from the various databases. A data warehouse works by organizing data into a schema which describes the layout and type of data. Query tools analyze the data tables using schema.
It is the process of finding patterns and correlations within large data sets to identify relationships between data. Data mining tools allow a business organization to predict customer behavior. Data mining tools are used to build risk models and detect fraud. Data mining is used in market analysis and management, fraud detection, corporate analysis and risk management.
Comparison between data mining and data warehousing:
|Data Warehousing||Data Mining|
|A data warehouse is database system which is designed for analytical analysis instead of transactional work.||Data mining is the process of analyzing data patterns.|
|Data is stored periodically.||Data is analyzed regularly.|
|Data warehousing is the process of extracting and storing data to allow easier reporting.||Data mining is the use of pattern recognition logic to identify patterns|
|Data warehousing is solely carried out by engineers.||Data mining is carried by business users with the help of engineers.|
|Data warehousing is the process of pooling all relevant data together.||Data mining is considered as a process of extracting data from large data sets.|
Attention reader! Don’t stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.
- Difference Between Data Mining and Text Mining
- Difference Between Data Mining and Web Mining
- Difference between Data Warehousing and Online transaction processing (OLTP)
- Data Warehousing
- Difference Between Data Science and Data Mining
- Difference Between Big Data and Data Mining
- Difference Between Data Mining and Data Visualization
- Difference Between Data Mining and Data Analysis
- Difference Between Data mining and Machine learning
- Difference Between Data Mining and Statistics
- Difference between Business Intelligence and Data Mining
- Difference between Data Mining and OLAP
- Difference between Spatial and Temporal Data Mining
- Data Mining: Data Attributes and Quality
- Relationship between Data Mining and Machine Learning
- Difference between Web Content, Web Structure, and Web Usage Mining
- Difference between Text Mining and Natural Language Processing
- Types of Sources of Data in Data Mining
- Data Normalization in Data Mining
- Data Preprocessing in Data Mining
If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to firstname.lastname@example.org. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.