Introduction To Data Analytics MCA-3282 Open Elective - 6 Sem B.Tech Topic - Grouping
Introduction To Data Analytics MCA-3282 Open Elective - 6 Sem B.Tech Topic - Grouping
Introduction To Data Analytics MCA-3282 Open Elective - 6 Sem B.Tech Topic - Grouping
MCA-3282
Open Elective – 6th sem B.Tech
Topic - Grouping
Rohini R. Rao
Dept of Computer Applications
Jan 2020
(Slide set 3 out of 5)
Contents
Introduction to grouping
Clustering
Categories of clustering algorithms
Distance measures
K-means clustering
Agglomerative clustering
Frequent Patterns
Associative Rules
Measures of Pattern Interestingness
Apriori
Types of Association Rules
Case Studies
Case study 3 : Crime & Hot spot analysis
Case study 4: Amazon – recommender systems
References
Problem Definition
●
Deliverables
●
Roles and responsibilities
●
Current situation
●
Timeline
●
Costs and benefits
●
Pull together data table
Data Preparation
●
Categorize the data
●
Clean the data
●
Remove unnecessary data
●
Transform the data
●
Partition the data
●
Summarizing the data
Implementation of the Analysis ●
Finding hidden relationships
●
Making prediction
Deployment
●
●
Measure and monitor performance
●
Review the project
Minkowski Distance
dist(i,j) = (| xi1 – x j1|p + | xi2 – xj2| p+… | xin – xjn|p) 1/p
Three observations
5 variables
d (i, j) p
p
m
Content-based approach
recommends items that are similar to items the user preferred or queried in
the past.
Collaborative approach
consider a user’s social environment.
It recommends items based on the opinions of other customers who have
similar tastes or preferences as the user.