0% found this document useful (0 votes)

19 views

M.Sc. Data Science Syllabus Final draft for submission (1)

Uploaded by

Ankam Yashwanth Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

M.Sc. Data Science Syllabus Final draft for submission (1)

Uploaded by

Ankam Yashwanth Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

TELANGANA STATE COUNCIL OF HIGHER EDUCATION

(TSCHE)

Scheme and Syllabus for

M.Sc. (Data Science) Two years Course

(w.e.f. the Academic year 2023-24)

Page 1 of 33
Syllabus Drafting Committee Constituted by TSCHE

S.No. Name of the Member with Designation

1 Prof. S. Ramachandram, Vice-Chancellor, Anurag University, Hyderabad

2 Prof. P.V. Sudha, Dean, Informatics, Osmania University, Hyd

3 Prof. N.Ch. Bhatracharyulu, Head, Department of Statistics, Osmania University

4 Prof. G. Jayasree, Department of Statistics, Osmania University, Hyd

5 Prof. N. Kishan, Department of Mathematics, Osmania University, Hyd

6 Prof. B. Surender Reddy, Department of Mathematics, Osmania University, Hyd

7 Mr. Satish, Devarakonda, Regional Head TCSiON

8 Mr. Krishna, TCSiON

9 Mr. Alok Ranjan, Head, Accenture Research

10 Mr. Ugamurthy Duraiswamy, MassMutual India

11 Mr. Sathsih Thipparapu, COO, Swinfy Solutions Pvt. Ltd.

12 Dr. A.S.N. Pavani, OSD, Academics, Higher Education, TSWREIS

Page 2 of 33
M.Sc. Data Science Course Curriculum
(w.e.f. Academic Year 2023-24)
M.SC. (Data Science) FIRST YEAR
I-SEMESTER
Paper Max.
Title of the paper Credits # Hours
Code Marks
MDS-101 Mathematical Foundations for Data Science 4 4 100

MDS-102 Design and Analysis of Algorithms 4 4 100

MDS-103 Software Engineering 4 4 100

A: Principles of Data Science

MDS-104 4 4 100
B: Java Programming

MDS-105 Design and Analysis of Algorithms Lab 2 4 50

MDS-106 A: Principle of Data Science Lab 2 4 50

B: Java Programming Lab

Total 20 24 500

II-SEMESTER

Paper # Max.
Title of the paper Credits
Code Hours Marks
MDS-201 Statistical Inference 4 4 100

MDS-202 Data Visualization Techniques 4 4 100

MDS-203 Cloud Computing 4 4 100

MDS-204 Artificial Intelligence 4 4 100

MDS-205 Advanced Machine Learning Techniques 4 4 100

MDS-206 Statistical Inference & Data Visualisation Lab 2 4 50

MDS-207 Advanced Machine Learning Lab 2 4 50

Total 24 28 500

Page 3 of 33
M.SC. (Data Science) SECOND YEAR

III-SEMESTER

Paper Code Title of the paper Credits # Hours Max.

Marks
MDS-301 Deep Learning Techniques 4 4 100
MDS-302 Machine Learning Operations 4 4 100
a) Data Mining
b) Text Data Analytics
MDS-303
4 4 100
(E-I) c) Enterprise Architecture
d) Business Intelligence
a) Data Stream Mining
MDS-304 b) Sentimental Analysis 4 4 100
(E-II)
c) Scalable Architecture
d) Computer Vision
MDS-305 Deep Learning Techniques Lab 2
MDS-306 Capstone Project-I 2 4 50
MDS-307 Seminar 2 4 50
No of Credits 22 24 500

Elective Streams E-I & E-II:

A: Data Mining and Data stream mining
B: Text Data Analytics and Sentimental Analysis
C: Architecting Applications and Scalable Architecture
D: Business Intelligence and Computer Vision

IV-SEMESTER

Paper Max.
Title of the paper Credits # Hours
Code Marks
MDS-401 Research Methodology 2 2 100

MDS-402 Capstone Project-II 12 24 100

No of Credits 24 26 500

Page 4 of 33
M.SC. (DATA SCIENCE) I-SEMESTER
MDS-101: PAPER- I: MATHEMATICAL FOUNDATIONS FOR DATA SCIENCE
SYLLABUS
UNIT – I
Vector Spaces: Vector spaces, Subspaces, Basis and dimension of a vector space, linear
dependence and independence, spanning set. Linear transformation, kernel, range, Matrix
Representation of a linear transformation, rank- nullity theorem, change of basis and similar
matrices Inner-product spaces, orthogonal sets and bases, Orthogonal Projection, Gram-
Schmidt orthogonalization process
UNIT-II
Matrices: Trace and Rank of a Matrix and their properties, Determinants, Inverse, symmetric,
orthogonal and idempotent matrices and their properties, Gauss elimination, row canonical
form, diagonal form, triangular form,
UNIT – III
Eigenvalue Problems: Characteristic roots and vectors, Caley-Hamilton theorem,
Diagonalization of a Matrix, algebraic and geometric multiplicity of a characteristic root and
spectral decomposition of a real symmetric matrix., Singular value Decomposition. Gauss-
Jordan-LU decomposition, Singular Value Decomposition,
UNIT – IV
Quadratic forms: Real quadratic forms (QFs), reduction and classification of QFs, index and
signature. Simultaneous reduction of two QFs. Extreme form of a QF. Moore-Penrose and
generalized inverses and their properties.

REFERENCE BOOKS
1. Gilbert Strang (2016): Introduction to linear algebra, 5/e., Wellesley-Cambridge.
2. David C. Lay (2019): Linear Algebra and Its Applications, Pearson, 5/e.
3. Graybill, F.A. (1983) : Matrices with applications in statistics, 2 ed, Wadsworth.
nd

4. Rao,C.R., Mithra, S.K. (1971): Generalised inverse of matrices and its applications, John
Wiley & Sons Inc.
5. Rao, C.R. and Bhimasankaram, P. (1992): Linear algebra, TMH.

Page 5 of 33
M.SC. (DATASCIENCE) I-SEMESTER
MDS-102: PAPER- II: DESIGN AND ANALYSIS OF ALGORITHMS
SYLLABUS
UNIT I
Introduction to Algorithms: Algorithm Specification, Performance Analysis, Randomized
Algorithms.
Elementary Data Structures: Stacks and Queues, Trees, Dictionaries, Priority Queues, Sets
and Disjoint Set Union, Graphs.
Divide and Conquer: Binary Search, Finding the Maximum and Minimum, Merge Sort; Quick
Sort, Selection sort, Strassen's Matrix Multiplication, Convex Hull.
UNIT-II
Greedy Method: Knapsack Problem, Job Sequencing with Deadlines, Minimum-Cost
Spanning Trees (Kruskal’s & Prim’s), Single Source Shortest Paths (Dijkstra’s).
Dynamic Programming: General Method, Multistage Graphs, All-Pairs Shortest Paths,
Single-Source Shortest Paths, Optimal Binary Search Trees, 0/1 Knapsack, Traveling
Salesperson Problem.
UNIT-III
Basic Traversal and Search Techniques: Techniques for Binary Trees, Techniques for
Graphs, Connected Components and Spanning Trees, Biconnected Components and DFS.
Back Tracking: General Method, 8-Queens Problem, Sum of Subsets, Graph Colouring,
Hamiltonian Cycles, Knapsack Problem.
Branch-Bound: General Method, 0/1 Knapsack Problem, Traveling Sales Person problem.
UNIT -IV
NP-Hard and NP-Complete Problems: Basic Concepts, Cook's Theorem, NP-Hard. Graph
Problems, NP-Hard Scheduling Problems, NP-Hard Code Generation, Some Simplified NP-
Hard Problems.
REFERENCE BOOKS
1. E Horowitz, S Sahni, S Rajasekaran, (2007): Fundamentals of Computer Algorithms, 2/e,
Universities Press.
2. T.H. Cormen, CE Leiserson, R.L Rivert, C Stein, (2010): Introduction to Algorithms, 3/e,
PHI.
3. R. Pannerselvam (2007): Design and Analysis of Algorithms, PHI.
4. Hari Mohan Pandey, (2009): Design, Analysis and Algorithm, University Science Press.

Page 6 of 33
M.SC. (DATA SCIENCE) I-SEMESTER
MDS-103: PAPER- III: SOFTWARE ENGINEERING
SYLLABUS
UNIT – I
Software Engineering: The Nature of Software, Changing Nature of Software, Defining the
Discipline, Software Process, Software Engineering Practice. The Software Process: A Generic
Process Model, Defining a Framework Activity, Process Assessment and Improvement,
Prescriptive Process Models, Specialized Process Models, Unified Process, Personal and Team
Process Models. Defining Agility, Agile Process, Extreme Programming, Psychology of
Software Engineering, Software Team Structures, Software Engineering Using the Cloud,
Global Teams.
UNIT – II
Requirements: Core Principles of Modeling, Requirements Engineering, Establishing the
Groundwork, Eliciting Requirements, Developing Use Cases, Building the Analysis Model,
Requirements Analysis, UML Models That Supplement the Use Case, Identifying Analysis
Classes, Specifying Attributes, Defining Operations, Class Responsibility-Collaborator
Modeling, Associations and Dependencies, Analysis Packages. Design Concepts: Design
within the Context of SE, Design Process, Design Concepts, Design Model, Software
Architecture, Architectural Styles, Architectural Considerations, Architectural Design,
Component, Designing Class-Based Components, Conducting Component-Level Design,
Component-Based Development, User Interface Design Rules.
UNIT – III
Quality Management: Quality, Software Quality, Software Quality Dilemma, Achieving
Software Quality, Defect Amplification and Removal, Reviews, Informal Reviews, Formal
Technical Reviews, Elements of Software Quality Assurance, SQA Tasks, Goals, and Metrics,
Software Reliability, A Strategic Approach to Software Testing, Test Validation Testing, System
Testing, Debugging, Software Testing Fundamentals, White-Box Testing, Black-Box Testing,
Path Testing, Control Structure Testing, Object-Oriented Testing Strategies& Methods,
Security Engineering Analysis, Security Assurance, Security Risk Analysis.
UNIT – IV
Software Configuration Management, SCM Process, Product Metrics for Requirements
Model, Design Model, Source Code, Testing and Maintenance. Managing Software Projects:
The Project Management Spectrum, W5HH Principle, Metrics in the Process and Project
Domains, Software Measurement, Metrics for Software Quality, Integrating Metrics within the
Software Process, Software Project Estimation, Decomposition Techniques, Project
Scheduling – basics, scheduling, Software Risks, Risk Mitigation, Monitoring, and
Management, Software Maintenance, Software Reengineering, Reverse Engineering, Forward
Engineering.
REFERENCE BOOKS
1. Roger S Pressman, B R Maxim, Software Engineering – A Practitioner’s Approach (8e)
References
2. Ian Sommerville, Software Engineering
3. Hans Van Vliet, Software Engineering
4. D. Bell, Software Engineering for Students
5. K.K. Aggarwal, Y. Singh, Software Engineering
6. R. Mall, Fundamentals of Software Engineering

Page 7 of 33
M.SC. (DATASCIENCE) I-SEMESTER
MDS-104(A): PAPER- IV(A): PRINCIPLES OF DATA SCIENCE
(Mandatory for B.Sc. Computer Science Stream)
SYLLABUS
UNIT – I
Introduction to Data: Data types, Measurement of scales, understanding data with
descriptive statistics and with data visualization and data pre-processing steps. Data
transformations (Standardize, Normalize, converting data from one scale to other scales).
Parametric & Non-Parametric tests (z-, χ , t-, F-tests; Sign test, Median, Wilcoxon sign rank,
2

Mann-Whitney U, Run test) and Feature selection methods.

UNIT – II
Introduction to Data Modelling: Review of the modelling process, Concepts of Model
evolution, over fitting, under fitting, cross validation concepts, (train/test, K fold and Leave out
one approaches), Supervised and Un-supervised Modelling, Model Performance for
classification techniques for qualitative and Quantitative data, Model improvement and saving
models for future use.
UNIT-III
Supervised learning algorithms: Classification algorithms, Multiple Linear regression,
Multinomial logistic, Support vector Machine, Random-forest, Ada Boost, Ensemble methods.
Decision Tree, Perceptron learning, Naïve Bayes classifier.
UNIT-IV
Unsupervised learning Algorithms: Introduction to Cluster analysis, Clustering techniques,
Agglomerative Hierarchical cluster techniques, K-means, K-medoid, KNN.
REFERENCE BOOKS
1. Foster Provost and Tom Fawcett (2021): Data Science for Business, O’REILLY
Publications
2. Henrik Brink, Joseph W. Richards. Mark Fetherolf (2012): Real World Machine
Learning, Manning Publications.

Page 8 of 33
M.SC. (DATASCIENCE) I-SEMESTER
MDS-104(B): PAPER- IV(B): JAVA PROGRAMMING
(Mandatory for B.Sc. Data Science stream)
SYLLABUS
UNIT – I
Core Java: Class Object, Object Oriented Concepts with respect to Java, Interfaces, Packages
and Exception Handling, Applets, Overview of Collection Framework (No question to be set
from above topics). AWT: Introduction, AWT Class Hierarchy, Creating Container, Adding
Components, Layout, Using Panel, Text Field, Text Area, List, Checkbox, Check Box Group,
Choice, Event Handling, Dialog Boxes, ScrollBar, Menu. Swing: Containment Hierarchy,
Adding Components, JTextField, JPasswordField, JTable, JComboBox, JProgressBar, JList,
JTree, JColorChooser, Dialogs. Remote Method Invocation (RMI): Introduction, Remote
Method Invocation, Java RMI Interfaces and Classes, an Application, Compiling the Program,
Generating Stub Classes, Running the Program, Callback with an Application.
UNIT – II
Servlet: Server-Side Java, Servlet Alternatives, Servlet Strengths, Servlet Architecture, Servlet
Life Cycle, GenericServlet, HttpServlet, Servlet Example, Passing Parameters to Servlets,
Retrieving Parameters, Cookies, Filters. Java Server Pages (JSP): Introduction, JSP Engines,
How JSP Works, JSP and Servlet, Anatomy of a JSP Page, JSP Syntax, JSP Components, Beans,
Session Tracking, Users Passing Control and Data between Pages, Sharing Session and
Application Data.
UNIT – III
Java Database Connectivity (JDBC): Introduction, JDBC Drivers, JDBC Architecture,
JDBC Classes and Interfaces, Loading a Driver, Making a Connection, Execute SQL
Statement, SQL Statements, Retrieving Result, Getting Database Information, Scrollable and
Updatable Resultset, Result Set Metadata. Hibernate: Introduction, Writing POJO Class,
Creating a Table, Writing a Hibernate Application, Compiling and Running Application, Book
Application Using Annotation, Object Life Cycle, HQL, Using Native SQL Query, Named
Queries, Generating DDL, Generator Class, Hibernate Tools.
UNIT – IV
Java Naming and Directory Interface (JNDI): Naming Concepts, Directory Concepts, Java
Naming and Directory Interface, Specifying JNDI Properties, Name Servers, Naming
Operations, Working with Directory. Overview of J2EE, Introduction to JavaBeans,
Advantages of JavaBeans, Properties of JavaBeans with examples, JavaBeans API, Enterprise
JavaBeans (EJB), Applications using Session Beans and Entity Beans, Introduction to Struts
Framework. Java Server Faces (JSF): Introduction, Simple Application, Request Processing
Life-Cycle, Tracing Phases, Managed Bean, Basic JSF Tags, Expression Language, Event
Handling with Example, Page Navigation.
REFERENCE BOOKS
1. Uttam K. Roy, Advanced Java programming
2. Herbertt Schildt, Java Complete Reference
3. Cay S. Horstmans, Gray Coronell, Core Java Vol. II – Advanced Features
4. Sharanam Shah, Vaishali Shah, Java EE 7 for Beginners

Page 9 of 33
M.SC. (DATASCIENCE) I-SEMESTER
MDS-105 (LAB-1): PAPER- V: DESIGN & ANALYSIS OF ALGORITHMS LAB
List of Experiments (Using Python)

1. Write a program for sorting the given list using: Insertion Sort, Selection Sort, Merge
Sort, Quick Sort, Heap Sort.
2. Write a program to find the given number in a list using Sequential Search, Binary
Search.
3. Write a program to find the minimal spanning tree for a weighted graph using Kruskal’s
and prims Algorithms.
4. Write a program to find the shortest path in a weighted graph using Dijkstra’s Algorithm.
5. Write a program to solve using dynamic programming technique for Travelling sales man
problem. Multistage Graph problem, All-Pairs Shortest Paths (Warshal), Single-Source
Shortest Paths (Bellman ford), Optimal Binary Search Trees.
6. Write a program to solve Knapsack problem using Back tracking

M.SC. (DATASCIENCE) I-SEMESTER

MDS-106(A) (LAB-2): PAPER- VI: PRINCIPLES OF DATA SCIENCE LAB
List of Experiments (Using Python)
1. Data Preprocessing
2. Write a Python code for the implementation of Classification techniques for the data
sets:
a) Multiple Linear regression,
b) Multinomial logistic,
c) Support vector Machine,
d) Random-forest,
e) Ada Boost/ XG Boost.
f) Decision Tree,
g) Naïve Bayes classifier.
3. Write a Python code for the implementation of Cluster analysis for the data sets using:
a) Agglomerative Hierarchical cluster techniques,
b) K-means,
c) KNN.

Page 10 of 33
M.SC. (DATASCIENCE) I-SEMESTER
MDS-106(B) (LAB-2): PAPER- VI: JAVA PROGRAMMING LAB
List of Experiments
1. Create GUI to present a set of choices for a user to select stationary products and display
the price of Product after selection from the list.
2. Create GUI to demonstrate typical Editable Table which describing Employee for a
software company.
3. Create GUI to demonstrate swing components using student registration form.
4. Create a Remote Object for simple arithmetic operators. Use AWT/SWING to create
user interface.
5. Write an RMI application using call back mechanism
6. Develop Servlet Question-Answer Application using HttpServletRequest and
HttpServletRequest interfaces.
7. Develop Servlet application to accept HTNO of a student from client and display the
memorandum of marks from the server
8. JSP Programs a. Create a JSP page that prints temperature conversion (from Celsius to
Fahrenheit) chart b. Create a JSP page to print current date and time c. Create a JSP
page to print number of times page is referred after the page is loaded.
9. Write a simple JSP application to demonstrate the use of implicit object (at least 5).
10. Develop a Hibernate application to Store Feedback of Website Visitors in MySQL
Database.
11. Develop a JSP Application to accept Registration Details from the user and store
database table.
12. Develop a JSP Application to Authenticate User Login as per the Registration Details.
If Login Success then forward User to Index Page otherwise show Login failure
Message.
13. Develop a web Application to add items in the inventory using JSF.
14. Write EJB applications using stateless session beans and state-full session beans.
15. Develop a Room Reservation System Application using Entity Beans.
16. Create Three-tire application using Servlets, JSP, EJB.

Page 11 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-201: PAPER- I: STATISTICAL INFERENCE

SYLLABUS
UNIT-I
Estimation Theory: Desirable properties of a good estimator: Unbiasedness, consistency,
efficiency and sufficiency - examples. Cramer-Rao inequality, Rao-Blackwell theorem, Fisher
Information, and Lehmann-Scheffe theorem. Simple Problems on UMVUE.
UNIT-II
Methods of Estimation: Method of maximum likelihood estimation and its Properties, Simple
problems on MLE. Jackknife, Bootstrap resampling methods. Estimation of bias and standard
deviation of point estimation by the Jackknife & bootstrap methods with examples, U-statistic,
Kernal and examples. Interval estimation, confidence level CI using pivots and shortest length
CI and example problems.
UNIT-III
Testing of Hypotheses: Neyman-Pearson Lemma, Most Powerful tests, Uniformly Most
Powerful tests, Likelihood ratio tests, Sequential Probability Ratio Tests.
Non parametric tests: One and two sample tests (Kolmogorov Smirnov, Kruskal Wallis &
Friedman test, Kendal’s tau, Ansari broadly tests.)
UNIT-IV
Non-parametric Density Estimation: Rosenblatt’s naïve density estimator, its bias and
variance. Consistency of Kernel density estimators and its MSE.
Simulation: Introduction, generation of random numbers for Uniform, Normal, Exponential,
Cauchy and Poisson Distributions. Estimating the reliability of the random numbers. Priori and
Posteriori distributions, conjugate families, Bayesian estimation of parameters, MCMC
algorithms: Metropolis Hasting and Gibbs Sampler.
REFERENCES
1. Rohatgi, V.K.: An Introduction to Probability Theory and Mathematical Statistics (Wiley)
2. Gibbons: Non-Parametric Statistical Inference, (TMH)
3. Lehman, E. L.: Testing of hypothesis, John Wiley
4. Goon, Gupta and Das Gupta: Outlines of Statistics, Vol. II, World Press.
5. C.R. Rao – Linear Statistical Inference (John Wiley)

Page 12 of 33
M.SC. (DATA SCIENCE) II-SEMESTER
MDS-202: PAPER- II: DATA VISUALIZATION TECHNIQUES
SYLLABUS

UNIT-I
Introduction to Data Visualization: Introduction, importance of Data visualization, Human
perception – Methodology – Seven Stages of Data Visualization - Data Visualization Tools.
Visualizing Data: Mapping Data onto Aesthetics – Visualizing Amounts - Visualizing
Distributions: Histograms and Density Plots – Visualizing Propositions: – Visualizing
Associations: Among Two or More Quantitative Variables – Visualizing Time Series and
Other Functions of an Independent Variable – Trends – Visualizing Geospatial Data.
UNIT-II
Basic Applied Visualizations: Data Visualization: One dimensional (Pictogram, Pie Chart,
Bar Chart,), two-dimensional (Histogram, Line plot, frequency curves& polygons, ogive
curves, Scatter Plot,) and other data visualization techniques. Gantt Chart, Heat Map, Box and
Whisker Plot, Waterfall Chart, Area Chart, Stacked Bar Charts - Sub Plots – Matplotlib,
Seaborn Styles, Box plot - Density Plot - - Tree map - Graph Networks.
UNIT-III
Interactive Visualisation: Introduction to D3 - Fundamental Technology: The Web – HTML
– DOM – CSS – JavaScript – SVG. D3 Setup – Generating Page Elements – Binding Data -
Drawing with data – Scales: Domains and Ranges – Normalization – Creating a Scale – Scaling
the Scatter Plot – Other Methods and Other Scales. Axes – Modernizing the Chart – Update
the Data – Transition – Updates – Interactivity.
Timeline, Highlight Table, Bullet Graph, Choropleth Map, Word Cloud, Path diagram,
Network Diagram, Correlation Matrices.
UNIT-IV
Principles of Information Visualization: Visual Perception and Cognition, Gestalt's
Principles, Tufte's Principles, Applications of Principles of Information Visualization,
Dashboard Design.
REFERENCE BOOKS
1. Julie Steele, Noah Iliinsky (2010); Beautiful Visualization Publisher(s): O'Reilly Media
2. Edward R. Tufte (20 16): The Visual Display of Quantitative Information” by
3. Kieran Healy(20160: Data Visualization: A Practical Introduction,
4. Claus O. Wilke (2020) Fundamentals of Data Visualization: A Primer on Making
Informative and Compelling Figures:, O’relly
5. Stephanie D.H. Evergreen(2020): Effective Data Visualization: The Right Chart for the
Right Data

Page 13 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-203: PAPER- III: CLOUD COMPUTING

SYLLABUS

UNIT- I
Introduction, Benefits and challenges, Cloud computing services, Resource Virtualization,
Resource pooling sharing and provisioning, Case study of Iaas, Paas and Saas
UNIT -II
Scaling in the Cloud, Capacity Planning, Load Balancing, File System and Storage,
Containers. Multi-tenant Software, Data in Cloud , Database Technology.
UNIT-III
Content Delivery Network, Security Reference Model , Security Issues, Privacy and
Compliance Issues, Portability and Interoperability Issues, Cloud Management and a
Programming Model Case Study, Popular Cloud Services.
UNIT- V
Enterprise architecture and SOA, Enterprise Software, Enterprise Custom Applications,
Workflow and Business Processes, Enterprise Analytics and Search, Enterprise Cloud
Computing Ecosystem.

REFERENCE BOOKS
1. Sandeep Bhowmik (2017): Cloud Computing, Cambridge University Press.
2. Gautam Shroff (2016): Enterprise Cloud Computing - Technology, Architecture,
Applications by Cambridge University Press.
3. Kai Hwang, Geoffrey C. Fox, Jack J. Dongarra (2012): Distributed and Cloud
Computing From Parallel Processing to the Internet of Things, Elsevier.

Page 14 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-204: PAPER- IV: ARTIFICIAL INTELLIGENCE

SYLLABUS

UNIT - 1
Introduction: History Intelligent Systems, Foundations of Artificial Intelligence, Sub areas of
Al, Applications. Problem Solving - State - Space Search and Control Strategies: Introduction,
General Problem-Solving Characteristics of problem, Exhaustive Searches, Heuristic Search
Techniques, Iterative - Deepening A*, Constraint Satisfaction. Game Playing, Bounded Look
- ahead Strategy and use of Evaluation Functions, Alpha Beta Pruning.
UNIT – II
Logic Concepts and Logic Programming: Introduction, Propositional Calculus Propositional
Logic, Natural Deduction System, Axiomatic System, Semantic Table, A System in
Propositional Logic, Resolution, Refutation in Propositional Logic, Predicate Logic, Logic
Programming. Knowledge Representation: Introduction, Approaches to knowledge
Representation, Knowledge Representation using Semantic Network, Extended Semantic
Networks for KR, Knowledge Representation using Frames.
UNIT - III
Expert System and Applications: Introduction, Phases in Building Expert Systems Expert
System Architecture, Expert Systems Vs Traditional Systems, Truth Maintenance Systems,
Application of Expert Systems, List of Shells and tools. Uncertainty Measure - Probability
Theory: Introduction, Probability Theory, Bayesian Belief Networks, Certainty Factor Theory,
Dempster - Shafer Theory.
UNIT – IV
Advanced Knowledge Representation Techniques: Case Grammars, Semantic Web. Natural
Language Processing: Introduction, Sentence Analysis Phases, Grammars and Parsers, Types
of Parsers, Semantic Analysis, Universal Networking Knowledge.
REFERENCE BOOKS
1. Saroj Kaushik, Artificial Intelligence, Cengage Learning India, First Edition, 2011.
2. Russell, Norvig, Artificial Intelligence: A Modern Approach, Pearson Education, 2nd
Edition, 2004.
3. Rich, Knight, Nair, Artificial Intelligence, Tata McGraw Hill, 3rd Edition 2009.

Page 15 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-205: PAPER- V: ADVANCED MACHINE LEARNING TECHNIQUES

SYLLABUS

UNIT – I
Multivariate Data Analysis Techniques: Path analysis, Correspondence analysis.
Multidimensional Scaling, Feature extraction and Feature selection techniques, Inter and intra
class distance measures, Probabilistic distance measures, Principal component analysis,
Factor analysis, Canonical Correlations and Canonical Variables. Conjoint Analysis.
UNIT - II
Classification Techniques-I: Linear classifies, Multiple Linear regression, Logistic
regression, Linear Discriminant Function (for binary outputs) with minimum squared error,
Linear discriminant function using likelihood ratios based on Multivariate normal populations,
Bayes Mis-classification; Naïve Bayes classifier, Support Vector Machines,
UNIT – III
Classification Techniques-II: Decision Tree algorithms, Random Forest algorithm, Bagging,
Gradient boosting, Ada-Boosting and XG-Boosting algorithm, KNN algorithm, Market-Basket
Analysis.
UNIT – IV
Cluster Analysis: Introduction, similarities and dissimilarities, Hierarchical clustering,
Single linkage method, k-means and k-Nearest Neighbourhood (KNN) clustering,
Categorical Data Analysis techniques and their applications; Time series forecasting: ARIMA
model, ARCH & GARCH model.

REFERENCE BOOKS
1. Johnson, R.A, and Dean W. Wichern: Applied Multivariate Statistical Analysis.
2. Morrison, D: An Introduction to Multivariate Analysis.
3. Seber: Multivariate Observations
4. Anderson: An Introduction to Multivariate Analysis.
5. Bishop: Analysis of Categorical data.

Page 16 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-206 (LAB-1): PAPER- VI: DATA VISUALISATION & INFERENCE LAB

List of Experiments (using Python)

1. Drawing One dimensional diagrams (Pictogram, Pie Chart, Bar Chart,).

2. Drawing two-dimensional (Histogram, Line plot, frequency curves & polygons, ogive
curves, Scatter Plot)
3. Drawing 3D and other data visualization techniques.
4. Drawing Gantt Chart, Heat Map, Box - Whisker Plot, Waterfall Chart, Area Chart, Stacked
Bar Charts –
5. Drawing Density Plot, Bullet Graph, Choropleth Map, Tree map, Path diagram, Network
Diagram, Correlation Matrices.
6. Generation of Jackknife and Bootstrap samples and its parameter estimation.
7. Computation of Confidence Interval estimation of parameters.
8. Small and Large sample tests (for Mean(s), Standard deviation/variance(s), Proportion(s)).
9. Non parametric tests: One and two sample tests (Kolmogorov Smirnov, Kruskal Wallis
& Friedman test, Kendal’s tau, Ansari broadly tests.)
10. Generation of random numbers for Uniform, Normal, Exponential, Cauchy and Poisson
Distributions.
11. Bayesian estimation of parameters, (using Metropolis Hasting and Gibbs Sampler).

Page 17 of 33
M.SC. (DATASCIENCE) II-SEMESTER

MDS-207: (LAB-2): PAPER- VII: ADVANCED MACHINE LEARNING LAB

List of Experiments

1. Implement and demonstrate the use of set of training data samples. Read the training data
from a .CSV file.
2. Write a program to demonstrate the working of the decision tree-based ID-3 algorithm.
Use an appropriate data set for building the decision tree and apply this knowledge to
classify a new sample.
3. Build an Artificial Neural Network by implementing the Backpropagation algorithm and
test the same using appropriate data sets.
4. Write a program to implement the naïve Bayesian classifier for a sample training data set
stored as a .CSV file. Compute the accuracy of the classifier, considering few test data
sets.
5. Assuming a set of documents that need to be classified, use the naïve Bayesian Classifier
model to perform this task. Built-in Java classes/API can be used to write the program.
Calculate the accuracy, precision, and recall for your data set.
6. Write a program to construct a Bayesian network considering medical data. Use this
model to demonstrate the diagnosis of heart patients using standard Heart Disease Data
Set. You can use Java/Python ML library classes/API.
7. Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data set
for clustering using k-Means algorithm. Compare the results of these two algorithms and
comment on the quality of clustering. You can add Java/Python ML library classes/API
in the program.
8. Write a program to implement k-Nearest Neighbour algorithm to classify the iris data
set. Print both correct and wrong predictions. Java/Python ML library classes can be used
for this problem.
9. Implement the non-parametric Locally Weighted Regression algorithm in order to fit data
points. Select appropriate data set for your experiment and draw graphs.

Page 18 of 33
M.SC. (DATASCIENCE) III-SEMESTER

MDS-301: PAPER- I: DEEP LEARNING TECHNIQUES

SYLLABUS

UNIT – I
Artificial Neural Networks: Introduction, Biological Activations of Neuron; Artificial
Neuron Models: McCulloch-Pitts, Perceptron, Adaline, Hebbian Models; Characteristics of
ANN, Types of Neuron Activation Function, Signal functions and their properties,
monotonicity, ANN Architecture, Classification Taxonomy of ANN, Supervised, Un-
supervised and Reinforcement learning; Learning tasks, Memory, Adaptation, Statistical nature
of the learning process. Statistical learning theory. Gathering and partitioning of data for ANN
and its pre and post processing.
UNIT – II
Supervised learning algorithms: Perceptron Learning Algorithm, Derivation, Perceptron
convergence theorem (statement); Multi-layer Perceptron Learning rule, limitations.
Applications of the Perceptron learning. Gradient Descent Learning, Least Mean Square
learning, Widrow-Hoff Learning. Feed-forward and Feed-back Back-Propagation Algorithms
and derivation.
UNIT – III
Unsupervised learning Algorithms: Hebbian Learning, Competitive learning. Self-
Organizing Maps, SOM algorithm, properties of feature map, computer simulations, Vector
quantization, Learning vector quantization.
Radial Basis Function Networks, Approximation properties of Radial Basis Function
Networks. Boltzman Machine, Hopfield model.

UNIT – IV
Reinforcement learning, Markov Decision Process, Hidden Markov Model, Convolutional
Neural Networks, Recurrent Neural Networks, Long-Short Term Memory Networks,
Generative Adversarial Networks, Deep belief Networks.
REFERENCES
1. Haykin, S. (1994). Neural Networks: A Comprehensive Foundation. New York: Macmillan
Publishing. A comprehensive book and contains a great deal of background theory
2. Yagnanarayana, B. (1999): “Artificial Neural Networks” PHI
3. Bart Kosko(1997): Neural Networks and Fuzzy systems, PHI
4. Jacek M. Zurada(1992): Artificial Neural Systems, West Publishing Company.
5. Carling, A. (1992). Introducing Neural Networks. Wilmslow, UK: Sigma Press.
6. Fausett, L. (1994). Fundamentals of Neural Networks. New York: Prentice Hall.
7. Box and Jenkins: Time Series analysis, Springer
8. Brockwell,P.J., and Davis,R.A.: Time Series : Theory and Methods (Second Edition).
Springer–Verlag.

Page 19 of 33
M.Sc. (DATA SCIENCE) III-SEMESTER

MDS-302: PAPER- II: MACHINE LEARNING OPERATIONS

SYLLABUS

UNIT – I
Introduction to ML Operations: Overview of ML Ops and its significance in the
industry, Challenges in deploying and managing machine learning models, Principles of
DevOps and its application to ML Ops, Introduction to cloud platforms and tools for ML Ops
UNIT – II
Data Management and Version Control: Data versioning and management in ML projects,
Implementing reproducibility and data lineage, Git and version control for ML models and
pipelines, Collaborative development workflows for ML Ops
UNIT – III
Model Deployment and Monitoring: Model packaging and containerization, Deploying models
to production environments, Infrastructure orchestration and scaling for ML workloads,
Monitoring model performance and managing drift
UNIT – IV
Continuous Integration and Delivery for ML: Automated testing and validation of ML models,
Continuous integration pipelines for ML Ops, Continuous deployment and release strategies,
Feedback loops and continuous improvement in ML Ops

REFERENCE BOOKS

1. Building Machine Learning Pipelines by Hannes Hapke and Catherine Nelson

2. Data Science on AWS: Implementing End-to-End, Continuous AI and Machine
Learning Pipelines by Chris Fregly and Antje Barth
3. MLOps with Azure: Machine Learning Operations on Azure by Srinivasa Rao and
Amit Kumar Saha

Page 20 of 33
M.SC. (DATASCIENCE) III-SEMESTER

MDS-303 (E-I): PAPER- III (A): DATA MINING

SYLLABUS
UNIT-I
Introduction: Why Data Mining? What is Data Mining? What kinds of data can be mined?
What kinds of patterns can be mined? Which technologies are used ? Which kinds of
applications are Targeted? Major issues in Data Mining. Getting to know your data: Data
objects and attributed types. Basic statistical descriptions of data. Data visualization,
Measuring data similarity and dissimilarity.
UNIT-II
Mining frequent patterns, Associations and correlations: Basic concepts and methods,
Frequent Item set Mining Methods, Which patterns are interesting? Pattern evaluation methods.

UNIT-III
Classification: Basic concepts, Decision tree induction, Bayes classification methods,
Classification: Advance methods, Bayesian Belief Network, Classification by back
propagation, Support vector machine,
UNIT-IV
Cluster Analysis: Concepts and Methods: Cluster Analysis, Partitioning Methods,
Hierarchical Methods, Density-Based Methods, Grid-Based Methods, Evaluation of clustering.
UNIT-V
Data Mining Trends and Research Frontiers: Mining Complex Data Types, Other
Methodologies of Data Mining, Data Mining Applications, Data Mining and Society, Data
Mining trends.
REFERENCE BOOKS

1. Jiawei Han, Micheline Kamber, Jin Pei, Data Mining: Concepts & Techniques, 3rd
Edition., Morgon Koffman ,2011
2. Vikram Pudi P.Radha Krishna, Data Mining, Oxford University Press, 1st Edition, 2009.
3. Pang-Ning Tan, Michael Steinbach, Vipin kumar, Introduction to Data Mining, Pearson
Education, 2008.

Page 21 of 33
M.SC. (DATASCIENCE) III-SEMESTER

MDS-303 (E-I): PAPER- III (B): TEXT DATA ANALYTICS

SYLLABUS

UNIT - I
Introduction to Natural Language Processing Basics, Language Syntax and Structure (Words,
Phrases, Clauses, & Grammar), Language Semantics Processing, (Lexical Semantic Relations,
Homonyms, Homographs, and Homophones, Capitonyms, Hyponyms and Hypernyms), Text
Corpora (Corpora Annotation and Utilities), Accessing Text Corpora (Brown Corpus,
WordNet Corpus) and NLP Applications (Machine Translation, Text Summarization and
Text categorization)
UNIT – II
Concept of the Tokenization, Sentence Tokenization, Word Tokenization, Concept of the Text
Normalization, (Cleaning Text, Removing Special characters, Removing stop
words,..etc) correcting words using stemming and Lemmatization and Understanding text
syntax and structure (POS tagging and Parsing)
UNIT – III
Concepts of feature extraction, Methods of Feature extraction (Bag of words Model, TF-IDF
Models, Advanced word Factorization Models likes Word2vec), Strengths and weakness of
models and Word cloud. etc, Concepts of Document term matrix, Term Document Matrix
UNIT – IV
Concepts of Topic Modelling, Algorithms of Topic Modelling (Latent Semantic Indexing
(LSI), Latent Dirichlet Allocation (LDA), Non Negative Matrix Factorization (NMF) and
Similarity based text clustering models), Text Classification using supervised methods
(Like Multinomial Naïve Bayes, Support vector machines, Random Forest …), concept of
Sentiment Analysis and its applications. Sentence Subjectivity and Sentiment Classification;
Sentiment Lexicon generation and Summarization;

REFERENCE BOOKS
1. Chapman & Hall: Handbook of Natural Language Processing, Second Edition
2. CRC: Machine Learning & Pattern Recognition, 2nd Edition
3. Christopher Manning and Hinrich Schuetze: Foundations of Statistical Natural Language
Processing
4. Dipanjan Sarkar: Text Analytics with Python, A press Publication
5. Julia Silge: Text Mining with R: A Tidy Approach, 1st Edition.

Page 22 of 33
M.SC. (DATASCIENCE) III-SEMESTER

MDS-303 (E-I): PAPER- III (C): ENTERPRISE ARCHITECTURE

SYLLABUS
UNIT-I
Introduction to EA -System analysis, general system theory, definitions and objectives of
considerations, Properties of EA, system approach to EA development, principle definitions
Business architecture, definition and features- BSC – balanced score card basics and its
reflection in EA, Strategic governance, Event Causality effects in EA under scope of BSC
UNIT-II
Organizational structure of EA and basic models- Information and technology
architecture basics, Introduction to EA structuring and modeling, Business architecture (inc.
business process modeling, IBM Component business model), Information architecture,
Technology architecture and integration between the layers model.
UNIT-III
Introduction to enterprise Engineering (EE)- Enterprise transformations (waterfall and
agile), EAP, EA methodologies: PRISM, ARIS Framework, Zachmann Framework , FEAF,
DODAF and TOGAF, Introduction to Service orientation in Enterprise Engineering (SOA,
SoEA), Technological infrastructure for Big Data handling in EA.
UNIT-IV
Cloud Computing Opportunities for EA- Flexible (agile) business and information
architectures (SoEA). Introduction to Spark, Spark Data Frames, SQL, Datasets through
worked examples. Spark’s low level APIs, RDDS, execution of SQL & Data Frames. How
Spark Runs on a Cluster. Structured Streaming, Spark’s Stream – Processing Engine.

REFERENCE BOOKS
1. Designing Enterprise Architecture Frameworks: Integrating Business Processes with
IT Infrastructure by N Zarvić, R Wieringa. Apple Academic Press (19 April 2016).
2. Neubauer M., Stary CH., S-BPM in the Production Industry. Stakeholder approach,
Springer Open, 2017.
3. A systematic literature review on Enterprise Architecture Implementation
Methodologies by Babak D., Mohd N. Elsevier (June 2015).
4. Spark : The Definite Guide – Bill Chambers, Matei Zaharia, 2018

Page 23 of 33
M.Sc. (DATA SCIENCE) III-SEMESTER

MDS-303 (E-I): PAPER- III (D): BUSINESS INTELLIGENCE

SYLLABUS

UNIT – I
Business Intelligence: Introduction – Definition, Leveraging Data and Knowledge for BI, BI
Components, BI Dimensions, Information Hierarchy, Business Intelligence and Business
Analytics. BI Life Cycle. Data for BI - Data Issues and Data Quality for BI.
UNIT - II
BI Implementation - Key Drivers, Key Performance Indicators and Performance Metrics, BI
Architecture/Framework, Best Practices, Business Decision Making, Styles of BI-vent-Driven
alerts - A cyclic process of Intelligence Creation. The value of Business Intelligence-Value
driven & Information use.
UNIT - III
Advanced BI – Big Data and BI, Social Networks, Mobile BI, emerging trends, Description of
different BI-Tools (Pentaho, KNIME)
UNIT-IV
Business intelligence implementation-Business Intelligence and integration implementation-
connecting in BI systems- Issues of legality- Privacy and ethics- Social networking and BI.

REFERENCE BOOKS

1. Rajiv Sabherwal “Business Intelligence” Wiley Publications, 2012.

2. Efraim Turban, Ramesh Sharda, Jay Aronson, David King, Decision Support and Business
Intelligence Systems, 9th Edition, Pearson Education, 2009.
3. David Loshin, Business Intelligence - The Savy Manager's Guide Getting Onboard with
Emerging IT, Morgan Kaufmann Publishers, 2009.
4. Philo Janus, Stacia Misner, Building Integrated Business Intelligence Solutions with SQL
Server, 2008 R2 & Office 2010, TMH, 2011.
5. Business Intelligence Data Mining and Optimization for decision making [Author: Carlo-
Verellis] [Publication: (Wiley)].

Page 24 of 33
M.SC. (DATASCIENCE) III-SEMESTER
MDS-304 (E-II): PAPER- IV (A): DATA STREAM MINING
SYLLABUS
UNIT-I
MOA Stream Mining, Assumptions, Requirements, Mining Strategies, Change Detection
Strategies, MOA Experimental Settings, Previous Evaluation Practices, Evaluation Procedures
for Data Streams, Testing Framework, Environments, Data Sources, Generation Speed and
Data Size, Evolving Stream Experimental Setting.
UNIT-II
Hoeffding Trees, The Hoeffding Bound for Tree Induction, The Basic Algorithm, Memory
Management, Numeric Attributes, Batch Setting Approaches, Data Stream Approaches.
UNIT-III
Prediction Strategies, Majority Class, Naïve Bayes Leaves, Adaptive Hybrid, Hoeffding Tree
Ensembles, Data Stream Setting, Realistic Ensemble Sizes. Evolving Data Streams, Algorithms
for Mining with Change, A Methodology for Adaptive Stream Mining, Optimal Change
Detector and Predictor, Adaptive Sliding Windows, Introduction, Maintaining Updated
Windows of Varying Length.
UNIT-IV
Adaptive Hoeffding Trees, Introduction, Decision Trees on Sliding Windows, Hoeffding
Adaptive Trees, Adaptive Ensemble Methods, New methods of Bagging using trees of different
size, New method of bagging using ADWIN, Adaptive Hoeffding Option Trees, Method
performance.
REFERENCE BOOKS
1. Data Stream Mining: A Practical Approach by Albert Bifet and Richard Kirkby.
2. Knowledge discovery from data streams by Gama João. ISBN: 978-1-4398-2611-9.
3. Machine Learning for Data Streams by Albert Bifet, Ricard Gavalda; MIT Press, 2017.

Page 25 of 33
M.Sc. (DATA SCIENCE) III-SEMESTER

MDS-304 (E-II): PAPER- IV (B): SENTIMENT ANALYSIS

SYLLABUS

UNIT-I
Introduction to Sentiment Analysis Introduction: Sentiment Analysis Applications -
Sentiment Analysis Research - Sentiment Analysis as Mini NLP. The Problem of Sentiment
Analysis: Definition of Opinion - Definition of Opinion Summary - Affect, Emotion, and Mood
- Different Types of Opinions - Author and Reader Standpoint. Document Sentiment
Classification: Supervised Sentiment Classification - Unsupervised Sentiment Classification -
Sentiment Rating Prediction - Cross-Domain Sentiment Classification - Cross-Language
Sentiment Classification - Emotion Classification of Documents.
UNIT-II
Subjectivity Classification and Challenges: Sentence Subjectivity and Sentiment
Classification: Subjectivity - Sentence Subjectivity Classification - Sentence Sentiment
Classification - Dealing with Conditional Sentences - Dealing with Sarcastic Sentences - Cross-
Language Subjectivity and Sentiment Classification - Using Discourse Information for
Sentiment Classification - Emotion Classification of Sentences.
UNIT-III
Sentiment Lexicon generation and Summarization: Sentiment Lexicon Generation:
Dictionary-Based Approach - Corpus-Based Approach - Desirable and Undesirable Facts.
Analysis of Comparative Opinions: Problem Definition - Identify Comparative Sentences -
Identifying the Preferred Entity Set - Special Types of Comparison - Entity and Aspect
Extraction. Opinion Summarization and Search: Aspect-Based Opinion Summarization -
Enhancements to Aspect-Based Summary - Contrastive View Summarization - Traditional
Summarization - Summarization of Comparative Opinions - Opinion Search - Existing Opinion
Retrieval Techniques. Mining Intentions: Problem of Intention Mining - Intention
Classification - Fine-Grained Mining of Intentions.
UNIT-IV
Identifying intention, fake and quality of opinion: Detecting Fake or Deceptive Opinions:
Different Types of Spam - Supervised Fake Review. Detection - Supervised Yelp Data
Experiment - Automated Discovery of Abnormal Patterns – Model
Based Behavioral Analysis - Group Spam Detection - Identifying Reviewers with Multiple
User ids - Exploiting Business in Reviews - Some Future Research Directions. Quality of
Reviews: Quality Prediction as a Regression Problem - Other Methods - Some New Frontiers.

REFERENCE BOOK
1. Bing Liu “Sentiment Analysis: Mining Opinions, Sentiments and Emotions, Cambridge
University Press, 2015.

Page 26 of 33
M.Sc. (DATA SCIENCE) III-SEMESTER

MDS-304 (E-II): PAPER- IV (C): SCALABLE ARCHETECTURE

SYLLABUS
UNIT–I
Introduction to Scalable applications, Challenges with running applications using Machine
Learning with scaling, Algorithms for Large scale Learning, Overview of Hadoop and Current
Big Data Systems. How Programming for Data Flow Differs, Basic Spark, Working with
Vectors and Matrices in Spark, Brief tour of Spark ML, Beyond parallelization, Practical Big
Data.
UNIT–II
Anatomy of Fast Data Applications, SMACK Stack – Functional Decomposition, Message
Backbone- Understanding messaging requirements, Data ingestion, Fast data& low latency,
Message Delivery Semantics, Distributing Messages.
UNIT – III
Compute Engines- Micro Batch Processing, One-at-a time Processing, Choice of processing
engine, Storage as the Fast Data Borders, The message backbone as Transition Point
UNIT-IV
Sharing Stateful Streaming State, Data Driven Micro-services, State and Micro-services.
Deployment environments for Fast Data Applications, Application containerization, resource
scheduling, Apache Mesos, Kubernetes, Cloud Deployments.
REFERENCE BOOKS
1. Jan Kunigk, Ian Buss, Paul Wilkinson & Lars George,” Architecting Modern Data
Platforms”, O‟reilly, 2019.
2. Gerard Maas, Stavros Kontopoulos, Sean Glover, “Designing Fast Data Application
Architectures”, O'Reilly Media, Inc., June 2018.
3. Bill Chambers, Matei Zaharia “Spark- The definitive Guide”, O'Reilly Media, Inc., June
2019.

Page 27 of 33
M.SC. (DATASCIENCE) III-SEMESTER

MDS-304 (E-II): PAPER- IV (D): COMPUTER VISION

SYLLABUS

UNIT I

Computer Vision Introduction: Computer vision-Image Formation: Geometric primitives and

transformation-Photometric image formation-The digital camera.

UNIT: II

Image Processing: Point Operation-Linear filtering-More neighbourhood operators, Fourier

Transforms-Pyramids and wavelets – Geometric Transformations – Global optimization.

UNIT III

Feature detection and matching: Points and patches – Edges – Lines. Segmentation: Active

Contours – Split and merge – Mean shift and mode finding – Normalized cuts –Graph cuts

and energy –based methods

UNIT-IV

Recognition: Object detection – Face recognition – Instance recognition – Category

Recognition – Context and scene understanding –Recognition datasets and test sets

REFERENCE BOOKS

1. Richard Szeliski (2011): Computer Vision-Algorithms and Applications, Springer

verlang London Limited.
2. Deep Learning, by Goodfellow, Bengio, and Courville.
3. Dictionary of Computer Vision and Image Processing, by Fisher et al.

Page 28 of 33
M.SC. (DATASCIENCE) III-SEMESTER
MDS-305 (LAB-I): PAPER- V: DEEP LEARNING TECHNIQUES LAB
SYLLABUS
LIST OF EXPERIMENTS (Using Python / R)
1. Implementation of Perceptron Learning Algorithm.
2. Implementation of, Derivation, Multi-layer Perceptron Learning
3. Implementation of Gradient Descent Learning,
4. Implementation of Least Mean Square learning,
5. Implementation of Widrow-Hoff Learning.
6. Implementation of Back-Propagation Algorithms.
7. Implementation of Hebbian Learning,
8. Implementation of Competitive learning
9. Implementation of Markov Decision Process,
10. Implementation of Hidden Markov Model,
11. Implementation of Convolutional Neural Networks,
12. Implementation of Recurrent Neural Networks,
13. Implementation of Long-Short Term Memory.

M.SC. (DATASCIENCE) III-SEMESTER

MDS-306 (LAB-2): PAPER- VI: CAPSTONE PROJECT PHASE -I

Project is of two-semesters duration (III and IV Semesters). Each student carries out a
project individually. A student with the help of the guide formulates the problem during the
III semester and implements it during the IV semester. A student can carry out the project at
industry for the entire IV Semester.
Note: Objectives, Outcomes and Guidelines are given under Capstone Project Phase-II.

Page 29 of 33
M.SC. (DATASCIENCE) III-SEMESTER
MDS-308 (LAB-3): PAPER- V: SEMINAR
(Guidelines For Seminar)

Objectives:
1. To familiarize tools and techniques and content for presentation
2. To enhance practical presentation, effective communication and professional skills
3. To expose the students to answer the queries raised on the topic of presentation.
4. To encourage students to work with innovative and entrepreneurial ideas

The objective of the seminar is to prepare the student for a systematic and independent
study of state-of-the-art topics in a broad area of his / her specialization. Seminar topics may
be chosen by the students with advice from the faculty members. Students are to be exposed
to following aspects of seminar presentations; Literature survey, Organization of material,
power point presentation, technical writing Each student will be required to
1. Each student has to give minimum two seminars.
2. Submit a one page synopsis of the seminar talk for display on the notice board.
3. Give a 20 minutes presentation through power point presentation followed by 10
minutes of discussion.
4. Submit a report on the seminar topic with list of references.
Seminars are to be scheduled from the 2nd week to the last week of the semester and
any change in schedule should be discouraged. The sessional marks will be awarded to the
students by at least 2 faculty members on the basis of an oral and a written presentation as
well as their involvement in the discussions.

Page 30 of 33
M.SC. (DATA SCIENCE) IV-SEMESTER
MDS/401: PAPER- I: RESEARCH METHODOLOGY
SYLLABUS

UNIT - I
Research Methodology: Objectives and Motivation of Research, Types of Research, research
approaches, Significance of Research, Research Methods Verses Methodology, Research
Process, Criteria of Good Research, Problems Encountered by Researchers in India, Benefits
to the society in general. Defining the Research Problem: Selection of Research Problem,
Necessity of Defining the Problem, Research ethics
Literature Survey: Importance and purpose of Literature Survey, Sources of Information,
Assessment of Quality of Journals and Articles, Need of Review, Guidelines for Review,
Record of Research Review.
UNIT - II
Research Design: Meaning of Research Design, Need of Research Design, Feature of a Good
Design, Important Concepts Related to Research Design, Different Research Designs
(exploratory, descriptive, experimental), Basic Principles of Experimental Design, Developing
a Research Plan, Steps in sample design, types of sample designs.
Report writing: Meaning of interpretation, layout of research report, Types of reports,
Mechanism of writing a report. Research Proposal Preparation: Writing a Research Proposal
and Research Report, Writing Research Grant Proposal. Paraphrasing & Plagiarism. Process of
Writing a research paper; Indexing, Citation of sources; Writing first draft of thesis, Revising
/ Editing - The final draft and proof reading; Research Paper Publication: Reputed Journals –
National/International – ISSN No, No. of volumes, Scopus Index/UGC Journals – Free
publications;
REFERENCE BOOKS
1. C.R Kothari, Research Methodology, Methods & Technique‖; New Age International
Publishers, 2004
2. R. Ganesan, Research Methodology for Engineers‖, MJP Publishers, 2011
3. Y.P. Agarwal, Statistical Methods: Concepts, Application and Computation‖, Sterling
Publications Pvt. Ltd., New Delhi, 2004.

Page 31 of 33
M.SC. (DATA SCIENCE) IV-SEMESTER
MDS-402: PAPER- II: CAPSTONE PROJECT PHASE -II
(Guidelines continuation of Phase-I of III-Semester)

Objectives:
1. To enhance practical and professional skills.
2. To familiarize tools and techniques of systematic Literature survey and documentation
3. To expose the students to industry practices and team work.
4. To encourage students to work with innovative and entrepreneurial ideas
Outcomes: Student will be able to:
1. Demonstrate the ability to synthesize and apply the knowledge and skills
acquired in theacademic program to real-world problems
2. Evaluate different solutions based on economic and technical feasibility
3. Effectively plan a project and confidently perform all aspects of project management
4. Demonstrate effective written and oral communication skills
Project is of 2-semester duration (III and IV Semesters). Each student carries out a
project individually. A student with the help of the guide formulates the problem during the
III semester and implements it during the IV semester. A student can carry out the project at
industry for the entire IV Semester. The aim of project work is to develop solutions to realistic
problems (related to Data Science) applying the knowledge and skills obtained in different
courses, new technologies and current industry practices.
1. The institution / Department shall network with the industry to get internships for
students.
2. The department will appoint a project coordinator and Internal supervisor who will
coordinate the following:
• Collection of project topics/ descriptions from faculty members (Problems can
also beinvited from the industries allocating the students to industry for full period
of 4-5 months)
• Allotment of projects and project guides to students
• Conducting seminar presentations (minimum three)
• Coordinating viva-voce exam
The above tasks should be completed within the first two weeks of III semester.
To get awareness on current problems and solution techniques, Project coordinator
shall arrange special lectures during the first 2 weeks of III semester by inviting faculty

Page 32 of 33
members, professionals from industries and R&D institutions. Further, these lectures may be
conducted anytime during the semester to enable the students to gather information on
problems and industry practices. At the end of 2nd week, each student with the help of guide
shall formalise the project proposal with problem definition, scope, literature survey, probable
solution etc. The coordinator shall prepare seminar schedule for all the students (batch wise)
from the 5th week to the last week of the semester which should be strictly adhered to.
The coordinator will prepare seminar schedule for all the students from the 5th
week to the lastweek of the semester, which should be strictly adhered to.
Each student will be required to:
1. Submit a one-page synopsis before the seminar for display on notice board.
2. Give a 30 minutes presentation followed by 10 minutes discussion.
3. Submit a technical write-up on the talk.

Page 33 of 33

M.sc Data Science23,24 Syllabus (1)
No ratings yet
M.sc Data Science23,24 Syllabus (1)
31 pages
5. M.sc. Data Science Syllabus I and II Sem Final
No ratings yet
5. M.sc. Data Science Syllabus I and II Sem Final
21 pages
Syllabuspg
No ratings yet
Syllabuspg
62 pages
IV Sem BTech Data Science230222083656
No ratings yet
IV Sem BTech Data Science230222083656
10 pages
MTech DS Curriculum 2023
No ratings yet
MTech DS Curriculum 2023
16 pages
Course Title Course Number
No ratings yet
Course Title Course Number
15 pages
IV Sem BTech AIML230222083345
No ratings yet
IV Sem BTech AIML230222083345
10 pages
Curriculum and Syllabus For The PG Course Executive Mtech Programme in Artificial Intelligence & Data Science
No ratings yet
Curriculum and Syllabus For The PG Course Executive Mtech Programme in Artificial Intelligence & Data Science
20 pages
Detailed Syllabus M. Tech (CS)
No ratings yet
Detailed Syllabus M. Tech (CS)
24 pages
Mtechds 2021
No ratings yet
Mtechds 2021
17 pages
CSE R22 08 NOV 2022-Course Structure
No ratings yet
CSE R22 08 NOV 2022-Course Structure
52 pages
MT Cse Ai
No ratings yet
MT Cse Ai
8 pages
msc-syllabus-2017-18_21
No ratings yet
msc-syllabus-2017-18_21
11 pages
ME Computer Science and Engineering - Syllabi
No ratings yet
ME Computer Science and Engineering - Syllabi
17 pages
R 2008 Cse Syllabus
No ratings yet
R 2008 Cse Syllabus
89 pages
Syllabus New Data Science (See Page 32)
No ratings yet
Syllabus New Data Science (See Page 32)
34 pages
M.SC (Data Science) 28.02.2018
No ratings yet
M.SC (Data Science) 28.02.2018
16 pages
53_Information Technology_UG
No ratings yet
53_Information Technology_UG
7 pages
(ME Computer Science and Engineering) : Structure of Post Graduate
No ratings yet
(ME Computer Science and Engineering) : Structure of Post Graduate
43 pages
Curriculum MTechDE
No ratings yet
Curriculum MTechDE
61 pages
Courses Scheme & Syllabus: M.E. (Software Engineering)
No ratings yet
Courses Scheme & Syllabus: M.E. (Software Engineering)
48 pages
B.Tech. CSE Syllabus 3rd-4th
No ratings yet
B.Tech. CSE Syllabus 3rd-4th
26 pages
MTech AI Curriculam
No ratings yet
MTech AI Curriculam
20 pages
ME
No ratings yet
ME
37 pages
PG M.E CSE CC With Syllabus 27.12.2024
No ratings yet
PG M.E CSE CC With Syllabus 27.12.2024
157 pages
R22M.tech - CSE Syllabus
No ratings yet
R22M.tech - CSE Syllabus
50 pages
Syllabus
No ratings yet
Syllabus
16 pages
Screenshot 2024-12-08 at 1.23.08 pm
No ratings yet
Screenshot 2024-12-08 at 1.23.08 pm
72 pages
CSE Courses Details
No ratings yet
CSE Courses Details
123 pages
5th Sem Syllabi
No ratings yet
5th Sem Syllabi
12 pages
CSD - III & IV Sem MR-21 Syllabus
No ratings yet
CSD - III & IV Sem MR-21 Syllabus
48 pages
2nd Year Cirriculum
No ratings yet
2nd Year Cirriculum
11 pages
PG Diploma in Data Analytics2024
No ratings yet
PG Diploma in Data Analytics2024
15 pages
R22M.Tech - CSE CSSyllabus
No ratings yet
R22M.Tech - CSE CSSyllabus
49 pages
MCA Syllabus
0% (2)
MCA Syllabus
45 pages
ME Computer Engineering Syllabus
No ratings yet
ME Computer Engineering Syllabus
37 pages
Course Structure & Syllabus of Master of Technology (M.Tech)
No ratings yet
Course Structure & Syllabus of Master of Technology (M.Tech)
43 pages
Cse (Ai & ML)
No ratings yet
Cse (Ai & ML)
41 pages
R22M.Tech - CSE - IYear - I Sem - Syllabus
No ratings yet
R22M.Tech - CSE - IYear - I Sem - Syllabus
12 pages
MSC Computer Science Updated On 12 06 2017 PDF
No ratings yet
MSC Computer Science Updated On 12 06 2017 PDF
33 pages
J 8707 Paper Ii
No ratings yet
J 8707 Paper Ii
33 pages
Jawaharlal Nehru Technological University Anantapur: (9A05301) Mathematical Foundations of Computer Science
No ratings yet
Jawaharlal Nehru Technological University Anantapur: (9A05301) Mathematical Foundations of Computer Science
3 pages
MTech_CSE_Syllabus_June2019
No ratings yet
MTech_CSE_Syllabus_June2019
60 pages
MR22 3-1 and 3-2
No ratings yet
MR22 3-1 and 3-2
68 pages
M.tech. Computer Science & Engineering (Effective From The Session - 2016-17)
No ratings yet
M.tech. Computer Science & Engineering (Effective From The Session - 2016-17)
39 pages
Mca 6
No ratings yet
Mca 6
42 pages
MSCCS Merged Merged
No ratings yet
MSCCS Merged Merged
57 pages
M.tech Computer Science and Engineering Full Sylabi
No ratings yet
M.tech Computer Science and Engineering Full Sylabi
78 pages
Network Engineering 05 It 60xx
No ratings yet
Network Engineering 05 It 60xx
62 pages
67-CSD-528
No ratings yet
67-CSD-528
76 pages
St. Peter's University: 2008 Scheme of Examinations
No ratings yet
St. Peter's University: 2008 Scheme of Examinations
50 pages
Course Code:22DS102
No ratings yet
Course Code:22DS102
2 pages
SYLLABUS
No ratings yet
SYLLABUS
13 pages
frmCourseSyllabusIPDownload (13)
No ratings yet
frmCourseSyllabusIPDownload (13)
2 pages
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
Data Science, AI, and Blockchain: Integrated Approaches
From Everand
Data Science, AI, and Blockchain: Integrated Approaches
Ekaaksh Deshpande
No ratings yet
Mastering Vector Databases: The Future of Data Retrieval and AI
From Everand
Mastering Vector Databases: The Future of Data Retrieval and AI
Robert Johnson
No ratings yet
Data Science Unveiled: A Practical Guide to Key Techniques
From Everand
Data Science Unveiled: A Practical Guide to Key Techniques
Ed A Norex
No ratings yet
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Cloud-Based Multi-Modal Information Analytics
From Everand
Cloud-Based Multi-Modal Information Analytics
Tanushri Kaniyar
No ratings yet
Numerical Linear Algebra by SUNDARAPANDIANV
No ratings yet
Numerical Linear Algebra by SUNDARAPANDIANV
3 pages
DM3
No ratings yet
DM3
3 pages
408 Note
No ratings yet
408 Note
57 pages
Point Cloud Densification
No ratings yet
Point Cloud Densification
72 pages
Zam 05 A
No ratings yet
Zam 05 A
2 pages
Linear Algebra Done Right Third Edition 3rd Edition Sheldon Axler download
No ratings yet
Linear Algebra Done Right Third Edition 3rd Edition Sheldon Axler download
76 pages
Analysis of ECG Signals by Dynamic Mode Decomposition
No ratings yet
Analysis of ECG Signals by Dynamic Mode Decomposition
12 pages
Ex 11
No ratings yet
Ex 11
11 pages
8 - The Singular Value Decomposition: Cmda 3606 Mark Embree
No ratings yet
8 - The Singular Value Decomposition: Cmda 3606 Mark Embree
24 pages
A Literature Survey On Various Approaches On Content Based Image Search
No ratings yet
A Literature Survey On Various Approaches On Content Based Image Search
6 pages
Zacharaki - A Digital Subtraction
No ratings yet
Zacharaki - A Digital Subtraction
14 pages
Applied Mechmatreianics of Solids (A.f
No ratings yet
Applied Mechmatreianics of Solids (A.f
22 pages
ml notes question bank exstraction from notes
No ratings yet
ml notes question bank exstraction from notes
163 pages
Jordan's Principal Angles in Complex Vector Spaces: A. Gal Antai and Cs. J. Heged Us
No ratings yet
Jordan's Principal Angles in Complex Vector Spaces: A. Gal Antai and Cs. J. Heged Us
10 pages
K K Singh
No ratings yet
K K Singh
18 pages
Quantitative Mueller Matrix Polarimetry With Diverse Applications
No ratings yet
Quantitative Mueller Matrix Polarimetry With Diverse Applications
41 pages
Gmres Fom Versus QMR Bicg
No ratings yet
Gmres Fom Versus QMR Bicg
24 pages
Lyapunov Exponents, Barreira
No ratings yet
Lyapunov Exponents, Barreira
273 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
11 pages
MATLAB Linear Algebra
100% (3)
MATLAB Linear Algebra
9 pages
Bourlard, Kamp - 1988 - Auto-Association by Multilayer Perceptrons and Singular Value Decomposition
No ratings yet
Bourlard, Kamp - 1988 - Auto-Association by Multilayer Perceptrons and Singular Value Decomposition
4 pages
Survey of Clustering Data Mining Techniques: Pavel Berkhin
100% (1)
Survey of Clustering Data Mining Techniques: Pavel Berkhin
56 pages
1 s2.0 S0378437116303144 Main
No ratings yet
1 s2.0 S0378437116303144 Main
11 pages
Performance Analysis of Downlink NOMA Multi-User With Multiple Antennas Beamfoming
No ratings yet
Performance Analysis of Downlink NOMA Multi-User With Multiple Antennas Beamfoming
6 pages
Students' Course Results Prediction Based On Data Processing and Machine Learning Methods
No ratings yet
Students' Course Results Prediction Based On Data Processing and Machine Learning Methods
13 pages
Random Matrix Theory
No ratings yet
Random Matrix Theory
65 pages
Instant ebooks textbook Fundamentals of Linear Algebra and Optimization Jean Gallier And Jocelyn Quaintance download all chapters
100% (1)
Instant ebooks textbook Fundamentals of Linear Algebra and Optimization Jean Gallier And Jocelyn Quaintance download all chapters
55 pages
Nash J.C. Compact Numerical Methods for Computers.. Lin. Algebra and Function Minimisation (2ed., IOP, 1990)(288s)_MN_ (1)
No ratings yet
Nash J.C. Compact Numerical Methods for Computers.. Lin. Algebra and Function Minimisation (2ed., IOP, 1990)(288s)_MN_ (1)
288 pages
Assignment1 2014
No ratings yet
Assignment1 2014
2 pages
Download Full Numerical methods for inverse problems 1st Edition Kern PDF All Chapters
100% (4)
Download Full Numerical methods for inverse problems 1st Edition Kern PDF All Chapters
61 pages