CLIQUE Algorithm
CLIQUE Algorithm
For example
A B is the minimum cluster description of the shaded region.
C D E is a non-minimal cluster description of the same region.
28
Clique Working
2 Step Process
Dense units found with respect to age for the dimensions salary
and vacation are intersected in order to provide a candidate
search space for dense units of higher dimensionality.
2nd stage- Minimal Description
For each cluster, Clique determines the maximal
region that covers the cluster of connected dense units.
It then determines a minimal cover (logic description)
foreach cluster.
Effectiveness of Clique-
CLIQUE automatically finds subspaces of the highest
dimensionalitysuch that high-density clusters exist in
thosesubspaces.
It is insensitive to the orderof input objects
It scales linearly with the size of input
Easily scalable with the number of dimensions in the
data
GRID-BASED CLUSTERING
METHODS
7 plane
20 30 40 50 60
Age
Techniques for Grid-Based Clustering
age
salary
Example (Cont.)
0 1 2 3 4 5 6 7
20
30
40
50
Vacation
60
age
30
Vacation
(week)
50
0 1 2 3 4 5 6 7
20
30
40
age
50
60
age
CLIQUE
Strength
It automatically finds subspaces of the highest
dimensionality such that high density clusters exist in those
subspaces
It is insensitive to the order of records in input and does not
presume some canonical data distribution
It scales linearly with the size of input and has good
scalability as the number of dimensions in the data
increases
Weakness
The accuracy of the clustering result may be degraded at
the expense of simplicity of the method