## ML | R-squared in Regression Analysis

R-squared is a statistical measure that represents the goodness of fit of a regression model. The ideal value for r-square is 1. The closer the… Read More »

R-squared is a statistical measure that represents the goodness of fit of a regression model. The ideal value for r-square is 1. The closer the… Read More »

Attribute subset Selection is a technique which is used for data reduction in data mining process. Data reduction reduces the size of data so that… Read More »

Preprocessing in Data Mining: Data preprocessing is a data mining technique which is used to transform the raw data in a useful and efficient format.… Read More »

Fact Constellation is a schema for representing multidimensional model. It is a collection of multiple fact tables having some common dimension tables. It can be… Read More »

A data warehouse is built to support management functions whereas data mining is used to extract useful information and patterns from data. Data warehousing is… Read More »

Analytics is the discovery and communication of meaningful patterns in data. Especially, valuable in areas rich with recorded information, analytics relies on the simultaneous application… Read More »

Association rule mining finds interesting associations and relationships among large sets of data items. This rule shows how frequently a itemset occurs in a transaction.… Read More »

Prerequisite – Frequent Item set in Data set (Association Rule Mining) Apriori algorithm is given by R. Agrawal and R. Srikant in 1994 for finding… Read More »

Association Mining searches for frequent items in the data-set. In frequent mining usually the interesting associations and correlations between item sets in transactional and relational… Read More »

In this post, we will discuss what are different sources of data that are used in data mining process. The data from multiple sources are… Read More »

Data Mining – Knowledge Discovery in Databases(KDD). Why we need Data Mining? Volume of information is increasing everyday that we can handle from business transactions,… Read More »

Data science is an interdisciplinary field of scientific methods, processes, algorithms and systems to extract knowledge or insights from data in various forms, either structured… Read More »

Database systems, like any other computer system, are subject to failures but the data stored in it must be available as and when required. When… Read More »

In general terms, “Mining” is the process of extraction of some valuable material from the earth e.g. coal mining, diamond mining etc. In the context… Read More »

Quoting the words of Pat Gelsinger, the CEO of VMware “Data is the new science, Big Data holds the answers”. Going by this statement data… Read More »