ho
ho
Course No(s)
Credit Units 5
Course Objectives
No Course Objective
CO1 Gain basic understanding of the role of Data Science in various scenarios in the real-world
of business, industry and government.
CO2 Understand various roles and stages in a Data Science Project and ethical issues to
be considered.
CO3 Explore the processes, tools and technologies for collection and analysis of structured
and unstructured data.
CO4 Appreciate the importance of techniques like data visualization, storytelling with data
for the effective presentations of the outcomes with the stakeholders
CO6 Implement data analytic techniques for discovering interesting patterns from data.
Text Book(s)
T1 Introduction to Data Mining, by Tan, Steinbach and Vipin Kumar 2nd Ed, Pearson
2021
T4 Data Mining: Concepts and Techniques, 4th Edition by Jiawei Han and others
Morgan Kaufmann Publishers, 2023
R3 Python Data Science Handbook: Essential tools for working with data by Jake
VanderPlas
R4 KDD, SEMMA and CRISP-DM: A Parallel Overview , Ana Azevedo and M.F. Santos ,
IADS-DM, 2008
Content Structure
1 Fundamentals of Data Science (2 hrs)
1.1 Real World applications
1.2 Data Science Challenges
1.3 Data Science Teams and Roles
1.4 Data Science Process
a) CRISP-DM Methodology
b) SEMMA
c) BIG DATA LIFE CYCLE
d) SMAM
1.5 Software Engineering for Data Science
1.5.1 DataOps
1.5.2 MLOps
6. Clustering (6 hrs)
6.1. Cluster analysis concepts.
6.2. Partitioning methods – k-Means algorithm
6.3. Hierarchical methods for cluster analysis
6.4. Density based methods for cluster analysis - DBSCAN
6.5. Evaluation of clustering algorithms
Course No
Lead Instructor
Sessio
n No. Topic Title Resource Reference
Clustering
12
• Density based methods for cluster T1 – Ch 5
analysis – DBSCAN T4 – Ch 8
• Hierarchical methods for cluster
analysis
Clustering
13 T1 – Ch 5
• Evaluation of clustering
algorithms
Anomaly Detection
14
• Concepts of Outliers
• Statistical approaches T1 – Ch 9
• Proximity and Density based T4 – Ch 11
outlier detection
Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
Note:
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 8
Syllabus for Comprehensive Exam (Open Book): All topics (Session Nos. 1 to 16)
Important links and information:
Contact sessions: Students should attend the online lectures as per the schedule provided
on the Elearn portal.
Evaluation Guidelines:
1 EC-1 consists of two Quizzes. Students will attempt them through the course pages
on the Elearn portal. Announcements will be made on the portal, in a timely manner.
2 EC-2 consists of either one or two Assignments. Students will attempt them through
the course pages on the Elearn portal. Announcements will be made on the portal, in
a timely manner.
3 For Closed Book tests: No books or reference material of any kind will be permitted.
4 For Open Book exams: Use of books and any printed / written reference material
(filed or bound) is permitted. However, loose sheets of paper will not be allowed.
Use of calculators is permitted in all exams. Laptops/Mobiles of any kind are not
allowed. Exchange of any material is not allowed.
5 If a student is unable to appear for the Regular Test/Exam due to genuine exigencies,
the student should follow the procedure to apply for the Make-Up Test/Exam which
will be made available on the Elearn portal. The Make-Up Test/Exam will be
conducted only at selected exam centres on the dates to be announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-
study schedule as given in the course hand-out, attend the online lectures, and take all the
prescribed evaluation components such as Assignment/Quiz, Mid-Semester Test and
Comprehensive Exam according to the evaluation scheme provided in the hand-out.