Open In App

Different Phases of Projected Clustering in Data Analytics

We know Projected clustering is a typical dimension reduction subspace clustering method which instead of initiating from single dimensional spaces, proceeds by identifying an initial approximation of the clusters in high dimensional attribute space. But to do this projected clustering algorithm goes through different phases. In this article, we are going to discuss these different phases of projected clustering in data analytics in detail. 

Prequisite – Projected clustering



Three Phases for Projected Clustering 

  1. Initialization Phase
  2. Iterative Phase
  3. Refinement Phase

These are explained as following below. 

1. Initialization Phase: 

This phase comprises two steps to select the superset.



                            S= random sample size A.k,

This set is designated as M where the hill climbing technique is put in during the next phase.

2. Iterative Phase: 

From the initialization phase, we got a set of data points that should hold the medoids. In this phase, we will find the best medoids from M. Randomly picks up the set of points M current, and restore the “bad” medoids from other points in M if required by which cluster quality is upgraded. The freshly formed meaningful medoid set is designated as M best. For the medoids, the following will be done as follows.

3. Refinement Phase -Handle Outliers :

Major Drawback of Projected Clustering Algorithm :

Article Tags :