Data Science Using Python
Module 1: Introduction to Data Science
o What is Data Science?
o What is Machine Learning?
o What is Deep Learning?
o What is AI?
o Data Analytics & it’s types
Module 2: Introduction to Python
o What is Python?
o Why Python?
o Installing Python
o Python IDEs
o Jupyter Notebook Overview
Module 3: Python Basics
o Python Basic Data types
o Lists
o Slicing
o IF statements
o Loops
o Dictionaries
o Tuples
o Functions
o Array
o Selection by position & Labels
Module 4: Python Packages
o Pandas
o Data Manipulation
o Numpy
o Sci-kit Learn
o Mat-plot library
Module 5: Importing data
o Reading CSV files
o Saving in Python data
o Loading Python data objects
o Writing data to csv file
Module 7: Statistics Basics
o Central Tendency
Mean
Median
Mode
Normal Distribution
o Probability and Inferential Statistics-I
o Probability and Inferential Statistics-II
o Basic of Linear Algebra
o Gradient Descent
o Calculus for Data Science
Module 8: Machine Learning I (Supervised Learning)
o Introduction to Linear Regression
o Linear Regression-Important Terms
o Regression: Statistical View
o Linear Regression
o Multiple Linear Regression
o Real Time Example
o Logistic Regression
o Hypothesis testing
o Introduction to Supervised Learning
o Introduction to data and data distribution
o Data Visualization
o Introductory Concepts of Supervised Learning
o Data Pre-processing
o Data Transformation
o Data Reduction and Multivariate Analysis
o Feature Extraction
o Introduction to Classification and regression
o Classification of Supervised Algorithms
o Bias Variance Trade-off
o Under-fitting and Over-fitting
o Evaluation Methods: Hold Out Method
o Evaluation Methods: Cross-validation & Bootstrapping Method
o Introduction to Performance Metrics
o Application of Performance Metrics I & II
o Performance Matrices for Regression
o Introduction to Naïve Bayes
o Naïve Bayes Classifier for Labelled Data Type
o Naïve Bayes Classifier for Mixed Data Types
o Naïve Bayes Classifier
o K Nearest Neighbours
o Introduction to Support Vector Machine
SVM for Linear Separable Problems
SVM for Non-Linear Separable Problems
o Decision Tree I & II
o Random Forest
o Introduction to Random Forest Classifier
o Random Forest Technique and Construction
Module 8: Machine Learning-II (Unsupervised Learning)
Introduction to Clustering
Evaluation of Clustering
Proximity Measures in Machine Learning
Understanding Clustering Types
Hierarchical Clustering
DBSCAN
Anomaly Detection
K-Means Clustering
Association Rule Mining
Principal Component Analysis