Skip to content
Related Articles

Related Articles

Market Basket Analysis in Data Mining

View Discussion
Improve Article
Save Article
  • Difficulty Level : Easy
  • Last Updated : 15 Mar, 2022

A data mining technique that is used to uncover purchase patterns in any retail setting is known as Market Basket Analysis. In simple terms Basically, Market basket analysis in data mining is to analyze the combination of products which been bought together.

This is a technique that gives the careful study of purchases done by a customer in a supermarket. This concept identifies the pattern of frequent purchase items by customers. This analysis can help to promote deals, offers, sale by the companies, and data mining techniques helps to achieve this analysis task. Example:

  • Data mining concepts are in use for Sales and marketing to provide better customer service, to improve cross-selling opportunities, to increase direct mail response rates.
  • Customer Retention in the form of pattern identification and prediction of likely defections is possible by Data mining.
  • Risk Assessment and Fraud area also use the data-mining concept for identifying inappropriate or unusual behavior etc.

Market basket analysis mainly works with the ASSOCIATION RULE {IF} -> {THEN}.

  • IF means Antecedent: An antecedent is an item found within the data
  • THEN means Consequent: A consequent is an item found in combination with the antecedent. 

Let’s see ASSOCIATION RULE {IF} -> {THEN} rules used in Market Basket Analysis in Data Mining. For example, customers buying a domain means they definitely need extra plugins/extensions to make it easier for the users.

Like we said above Antecedent is the item sets that are available in data. By formulating from the rules means {if} component and from the example is the domain.

Same as Consequent is the item that is found with the combination of Antecedents. By formulating from the rules means {THEN} component and from the example is extra plugins/extensions.

With the help of these, we are able to predict customer behavioral patterns. From this, we are able to make certain combinations with offers that customers will probably buy those products. That will automatically increase the sales and revenue of the company.

With the help of the Apriori Algorithm, we can further classify and simplify the item sets which are frequently bought by the consumer.

There are three components in APRIORI ALGORITHM:

  • LIFT

Now take an example, suppose 5000 transactions have been made through a popular eCommerce website. Now they want to calculate the support, confidence, and lift for the two products, let’s say pen and notebook for example out of 5000 transactions, 500 transactions for pen, 700 transactions for notebook, and  1000 transactions for both.

SUPPORT: It is been calculated with the number of transactions divided by the total number of transactions made,

Support = freq(A,B)/N

support(pen) = transactions related to pen/total transactions

i.e support -> 500/5000=10 percent

CONFIDENCE: It is been calculated for whether the product sales are popular on individual sales or through combined sales. That is calculated with combined transactions/individual transactions.

Confidence = freq(A,B)/freq(A)

Confidence =   combine transactions/individual transactions

i.e confidence-> 1000/500=20 percent

LIFT: Lift is calculated for knowing the ratio for the sales.

Lift= confidence percent/ support percent

Lift-> 20/10=2 

When the Lift value is below 1 means the combination is not so frequently bought by consumers. But in this case, it shows that the probability of buying both the things together is high when compared to the transaction for the individual items sold. 

With this, we come to an overall view of the Market Basket Analysis in Data Mining and how to calculate the sales for combination products.

My Personal Notes arrow_drop_up
Recommended Articles
Page :

Start Your Coding Journey Now!