0% found this document useful (0 votes)

50 views

Busiess Analytics Data Mining Lecture 7

This document discusses predictive modeling and various machine learning techniques. It introduces neural networks and their basic elements like processing elements, network architecture, and information processing. Common neural network types like feedforward and recurrent networks are described. The document also discusses supervised learning using backpropagation and evaluating neural network models. Finally, it introduces other techniques like support vector machines and their use of kernels and hyperplanes for classification.

Uploaded by

utkarsh bhargava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Busiess Analytics Data Mining Lecture 7

Uploaded by

utkarsh bhargava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Business analytics

Lecture 7- Predictive modelling

Why is it important to study
medical procedures
▪Clinical decision support systems that
use the outcome of data mining
studies can support healthcare
managers and/or medical
professionals in making accurate and
timely decisions to optimally allocate
resources in order to increase the
quantity and quality of medical
services
▪Healthcare systems Effectiveness is
probably more of a clinical concern,
while efficiency is more of a
managerial concern.
▪Clinical decision support systems that
use the outcome of data mining
studies are shown to be useful and
reasonably accurate predictors,
especially if used in combination
Neural Network Concepts
▪Neural networks (NN): a brain metaphor for
information processing
▪Neural computing
▪Artificial neural network (ANN)
▪Many uses for ANN for
▪ pattern recognition, forecasting, prediction, and
classification
▪Many application areas
▪ finance, marketing, manufacturing, operations,
information systems, and so on
Biological Neural Networks

▪Two interconnected brain cells (neurons)

Processing Information in ANN
Inputs Weights Outputs

x1
w1 Y1

x2 w2 Neuron (or PE) f (S )

. S = 
n
X iW
Y
. Y2
. i =1
i

.
. Summation
Transfer
.
Function
wn Yn
xn

▪A single neuron (processing element – PE) with

inputs and outputs
Biology Analogy
Elements of ANN
▪Processing element (PE)
▪Network architecture
▪Hidden layers
▪Parallel processing
▪Network information processing
▪Inputs
▪Outputs
▪Connection weights
▪Summation function
Elements of ANN

Neural Network with

One Hidden Layer
Elements of ANN
(a) Single neuron (b) Multiple neurons

x1 x1 w11 (PE) Y1
w1
w21
(PE) Y

w1 w12
x2 Y = X 1W1 + X 2W2
x2 w22 (PE) Y2
PE: Processing Element (or neuron)

Y1 = X1W11 + X 2W21
Summation Function for a Single w23
Y2 = X1W12 + X2W22
Neuron (a), and
Y3 = X 2W 23 (PE) Y3
Several Neurons (b)
Elements of ANN
▪Transformation (Transfer) Function
▪ Linear function
▪ Sigmoid (logical activation) function [0 1]
▪ Tangent Hyperbolic function [-1 1]

Summation function: Y = 3(0.2) + 1(0.4) + 2(0.1) = 1.2

X1 = 3 Transfer function: YT = 1/(1 + e-1.2) = 0.77
W
1 =0
.2

W2 = 0.4 Processing Y = 1.2

X2 = 1 YT = 0.77
element (PE)
.1
3
=0
W

X3 = 2
❖ Threshold value?
Neural Network Architectures
▪Architecture of a neural network is driven by
the task it is intended to address
▪ Classification, regression, clustering, general
optimization, association, ….
▪Most popular architecture: Feedforward, multi-
layered perceptron with backpropagation
learning algorithm
▪ Used for both classification and regression type
problems
▪Others – Recurrent, self-organizing feature
maps, Hopfield networks, …
Neural Network Architectures
Feed-Forward Neural Networks
Feed-forward MLP with 1 Hidden Layer
Neural Network Architectures
Recurrent Neural Networks
Other Popular ANN Paradigms
Self-Organizing Maps (SOM)

Input 1 ▪ First introduced

by the Finnish
Professor Teuvo
Input 2
Kohonen
▪ Applies to
clustering type
problems
Input 3
Development Process of an ANN
An MLP ANN Structure for
the Box-Office Prediction Problem
Testing a Trained ANN Model
▪Data is split into three parts
▪Training (~60%)
▪Validation (~20%)
▪Testing (~20%)

▪k-fold cross validation

▪Less bias
▪Time consuming
AN Learning Process
A Supervised Learning Process
ANN
Model
Three-step process:
1. Compute temporary
Compute
output outputs.
2. Compare outputs with
desired targets.
3. Adjust the weights and
Is desired
Adjust
weights
No
output repeat the process.
achieved?

Yes

Stop
learning
Backpropagation Learning
a(Zi – Yi)
x1 error
w1

x2 w2 Neuron (or PE) f (S )

Y = f (S )
. S = 
n
X iW i
Yi
. i =1

. Summation
Transfer
Function
wn
xn

▪Backpropagation of Error for a Single Neuron

Backpropagation Learning
▪The learning algorithm procedure
1. Initialize weights with random values and set other
network parameters
2. Read in the inputs and the desired outputs
3. Compute the actual output (by working forward
through the layers)
4. Compute the error (difference between the actual and
desired output)
5. Change the weights by working backward through the
hidden layers
6. Repeat steps 2-5 until weights stabilize
Illuminating The Black Box
Sensitivity Analysis on ANN
▪A common criticism for ANN: The lack of
transparency/explainability
▪The black-box syndrome!
▪Answer: sensitivity analysis
▪Conducted on a trained ANN
▪The inputs are perturbed while the relative
change on the output is measured /
recorded
▪Results illustrate the relative importance of
input variables
Sensitivity Analysis on ANN
Models
Systematically Trained ANN
Perturbed “the black-box” Observed
Inputs Change in
Outputs

▪For a good example, see Application Case 6.3

▪ Sensitivity analysis reveals the most important injury
severity factors in traffic accidents
Support Vector Machines (SVM)
▪SVM are among the most popular machine-
learning techniques.
▪SVM belong to the family of generalized
linear models… (capable of representing
non-linear relationships in a linear fashion).
▪SVM achieve a classification or regression
decision based on the value of the linear
combination of input features.
▪Because of their architectural similarities,
SVM are also closely associated with ANN.
Support Vector Machines (SVM)
▪Goal of SVM: to generate mathematical
functions that map input variables to desired
outputs for classification or regression type
prediction problems.
▪ First, SVM uses nonlinear kernel functions to
transform non-linear relationships among the
variables into linearly separable feature spaces.
▪ Then, the maximum-margin hyperplanes are
constructed to optimally separate different classes
from each other based on the training dataset.
▪SVM has solid mathematical foundation!
Support Vector Machines (SVM)
▪A hyperplane is a geometric concept used to
describe the separation surface between
different classes of things.
▪ In SVM, two parallel hyperplanes are constructed on
each side of the separation space with the aim of
maximizing the distance between them.
▪A kernel function in SVM uses the kernel trick
(a method for using a linear classifier algorithm
to solve a nonlinear problem)
▪ The most commonly used kernel function is the radial
basis function (RBF).
Support Vector Machines (SVM)
L1

M
X2 X2

ar
gi
L2

n
e
an
L3

l
rp
pe
hy
n
gi
ar
-m
um
im
ax
M
X1 X1

➢ Many linear classifiers (hyperplanes) may separate the data

Application Case 6.4
Managing Student Retention with
Predictive Modeling
Questions for Discussion
1. Why is attrition one of the most important issues in
higher education?
2. How can predictive analytics (ANN, SVM, and so
forth) be used to better manage student retention?
3. What are the main challenges and potential
solutions to the use of analytics in retention
management?
Application
Case 6.4

Managing Student
Retention with
Predictive Modeling
How Does an SVM Work?
▪Following a machine-learning process, an
SVM learns from the historic cases.
▪The Process of Building SVM
1. Preprocess the data
▪ Scrub and transform the data.
2. Develop the model.
▪ Select the kernel type (RBF is often a natural choice---Radial Basis
function)
▪ Determine the kernel parameters for the selected kernel type.
▪ If the results are satisfactory, finalize the model; otherwise change
the kernel type and/or kernel parameters to achieve the desired
accuracy level.
3. Extract and deploy the model.
The Process of Building an SVM
Pre-Process the Data
Training
ü Scrub the data
data
“Identify and handle missing,
incorrect, and noisy”
ü Transform the data
“Numerisize, normalize and
standardize the data”

Pre-processed data

Develop the Model

Experimentation
ü Select the kernel type “Training/Testing”
“Choose from RBF, Sigmoid
or Polynomial kernel types”
ü Determine the kernel values
“Use v-fold cross validation or
employ ‘grid-search’”

Validated SVM model

Deploy the Model

Prediction
ü Extract the model coefficients Model
ü Code the trained model into
the decision support system
ü Monitor and maintain the
model
SVM Applications
▪SVMs are the most widely used kernel-learning
algorithms for wide range of classification and
regression problems
▪SVMs represent the state-of-the-art by virtue of
their excellent generalization performance, superior
prediction power, ease of use, and rigorous
theoretical foundation
▪Most comparative studies show its superiority in
both regression and classification type prediction
problems.
▪SVM versus ANN?
k-Nearest Neighbor Method (k-NN)
▪ANNs and SVMs → time-demanding,
computationally intensive iterative derivations
▪k-NN is a simplistic and logical prediction
method, that produces very competitive results
▪k-NN is a prediction method for classification
as well as regression types (similar to ANN &
SVM)
▪k-NN is a type of instance-based learning (or
lazy learning) – most of the work takes place at
the time of prediction (not at modeling)
▪k : the number of neighbors used
k-Nearest Neighbor Method (k-NN)
Y

k=3

k=5
Yi

The answer depends on

the value of k

Xi X
The Process of k-NN Method

Training Set
Parameter Setting

Historic Data ü Distance measure

ü Value of “k”

Validation Set

Predicting
Classify (or Forecast)
new cases using k
number of most
similar cases

New Data
k-NN Model Parameter
1. Similarity Measure: The Distance Metric

▪Numeric versus nominal values?

k-NN Model Parameter
2. Number of Neighbors (the value of k)
▪The best value depends on the data
▪Larger values reduce the effect of noise but
also make boundaries between classes less
distinct
▪An “optimal” value can be found heuristically
▪Cross Validation is often used to determine
the best value for k and the distance measure

Operations Research I: Dr. Bill Corley
No ratings yet
Operations Research I: Dr. Bill Corley
147 pages
BFS and DFS
75% (4)
BFS and DFS
22 pages
Predictive Modeling BI 4
No ratings yet
Predictive Modeling BI 4
28 pages
Sharda dss10 PPT 06
No ratings yet
Sharda dss10 PPT 06
43 pages
Lecture 4-Machine Learning Applications
No ratings yet
Lecture 4-Machine Learning Applications
52 pages
Chapter 05 - In Class
No ratings yet
Chapter 05 - In Class
39 pages
Chapter 05 - Sharda 11e Full Accessible PPT 05
No ratings yet
Chapter 05 - Sharda 11e Full Accessible PPT 05
31 pages
Week 5 Slides
No ratings yet
Week 5 Slides
25 pages
Chapter 9. Classification: Advanced Methods
No ratings yet
Chapter 9. Classification: Advanced Methods
39 pages
Lecture_3_Machine_learning_Techniques_For_Predictive_Analytics
No ratings yet
Lecture_3_Machine_learning_Techniques_For_Predictive_Analytics
40 pages
Machine-Learning Techniques for Predictive Analytics
No ratings yet
Machine-Learning Techniques for Predictive Analytics
53 pages
Sharda dss10 PPT 06
No ratings yet
Sharda dss10 PPT 06
48 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
3 pages
12 Advanced Machine Learning Algorithms
No ratings yet
12 Advanced Machine Learning Algorithms
41 pages
IIS Lecture 3
No ratings yet
IIS Lecture 3
21 pages
Techniques For Predictive Modeling: Learning Objectives For Chapter 6
No ratings yet
Techniques For Predictive Modeling: Learning Objectives For Chapter 6
19 pages
06-Classification_Part2
No ratings yet
06-Classification_Part2
34 pages
Stock Trend Prediction With Neural Network Techniques
No ratings yet
Stock Trend Prediction With Neural Network Techniques
61 pages
DWDM
No ratings yet
DWDM
20 pages
This Is
No ratings yet
This Is
7 pages
Ann PDF
No ratings yet
Ann PDF
129 pages
SVM, Neural Network and Random Forest in R
No ratings yet
SVM, Neural Network and Random Forest in R
45 pages
Week 3 - Demand Forecasting by Artificial Neural Networks
No ratings yet
Week 3 - Demand Forecasting by Artificial Neural Networks
19 pages
Unec 1705121586
No ratings yet
Unec 1705121586
33 pages
1
No ratings yet
1
42 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
A Seminar Report On NEURAL NETWORK PDF
No ratings yet
A Seminar Report On NEURAL NETWORK PDF
26 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
No ratings yet
Artificial Neural Networks in Bi: Information System Dept ITS Surabaya 2009
42 pages
Questions and Answers
No ratings yet
Questions and Answers
33 pages
Ml2 Script v2
No ratings yet
Ml2 Script v2
123 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Topic: Machine Learning
No ratings yet
Topic: Machine Learning
35 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Machine Learning-Gkouzionis
No ratings yet
Machine Learning-Gkouzionis
14 pages
aimlmid2notes
No ratings yet
aimlmid2notes
4 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
DSS08 - CLS-ANN, SVM, Ensemble-Vn
No ratings yet
DSS08 - CLS-ANN, SVM, Ensemble-Vn
44 pages
Deep Learning
No ratings yet
Deep Learning
68 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
DWDM Rit-E22 Unit4
No ratings yet
DWDM Rit-E22 Unit4
139 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
Neural Network
No ratings yet
Neural Network
58 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
CV Lec5
No ratings yet
CV Lec5
54 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
lec08_Classification_kNN_ANN
No ratings yet
lec08_Classification_kNN_ANN
39 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Prediction in Data Mining
No ratings yet
Prediction in Data Mining
12 pages
MachineLearning Lecture 2
No ratings yet
MachineLearning Lecture 2
23 pages
ML 7th Sem AIML ITE Notes Complete LONG
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG
202 pages
Term Paper: Dept of CSE, GMRIT
No ratings yet
Term Paper: Dept of CSE, GMRIT
16 pages
Business Intelligence and Decision Support Systems (9 Ed., Prentice Hall)
No ratings yet
Business Intelligence and Decision Support Systems (9 Ed., Prentice Hall)
41 pages
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
No ratings yet
Sales Forecasting Using Kernel Based Support Vector Machine Algorithm
6 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
CS215 LectureSlidesSet2 IntroductionToMachineLearning AI
No ratings yet
CS215 LectureSlidesSet2 IntroductionToMachineLearning AI
112 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
MLSM Lecture1 050923
No ratings yet
MLSM Lecture1 050923
37 pages
Artificial Neural Network Bao
No ratings yet
Artificial Neural Network Bao
26 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Busiess Analytics Data Modeling Lecture 2
No ratings yet
Busiess Analytics Data Modeling Lecture 2
24 pages
Rebalancing
No ratings yet
Rebalancing
4 pages
Busiess Analytics Data Mining Lecture 3
No ratings yet
Busiess Analytics Data Mining Lecture 3
52 pages
Flipkart Wired PDF
No ratings yet
Flipkart Wired PDF
4 pages
A Whole New Ball Game: Navigating Digital Change in The Sports Industry
No ratings yet
A Whole New Ball Game: Navigating Digital Change in The Sports Industry
8 pages
Sbi Po Detailed Syllabus 2020: Useful Links
No ratings yet
Sbi Po Detailed Syllabus 2020: Useful Links
13 pages
Instructor - Dr. Preeti Khanna (Only For Academic Purpose)
No ratings yet
Instructor - Dr. Preeti Khanna (Only For Academic Purpose)
21 pages
AD-Times Amazon - Updated
No ratings yet
AD-Times Amazon - Updated
9 pages
Psychological Impact of COVID-19 Pandemic On General Population in West Bengal: A Cross-Sectional Study
No ratings yet
Psychological Impact of COVID-19 Pandemic On General Population in West Bengal: A Cross-Sectional Study
7 pages
Name Email Roll Number Contact Number Mentor Name Todo: Duplicate This File, Upload It On Your Drive and Share The Link With Your Mentor
No ratings yet
Name Email Roll Number Contact Number Mentor Name Todo: Duplicate This File, Upload It On Your Drive and Share The Link With Your Mentor
5 pages
Study Id67390 Logistics-Industry-Worldwide
No ratings yet
Study Id67390 Logistics-Industry-Worldwide
62 pages
The Effects of Covid-19 and Its Psychological Impact On People From Different Strata in India
No ratings yet
The Effects of Covid-19 and Its Psychological Impact On People From Different Strata in India
6 pages
Holistic Segmentation: Clive Brand and Sue Jarvis Source: Admap Magazine, June 1998
No ratings yet
Holistic Segmentation: Clive Brand and Sue Jarvis Source: Admap Magazine, June 1998
8 pages
Study Id70013 Tiktok
No ratings yet
Study Id70013 Tiktok
114 pages
Handout 2: Elasticity and Price Controls
No ratings yet
Handout 2: Elasticity and Price Controls
6 pages
Supply-Demand Diagrams
No ratings yet
Supply-Demand Diagrams
10 pages
Coursera - Marketing Research
No ratings yet
Coursera - Marketing Research
1 page
Consumers in 2016 Generation SW
No ratings yet
Consumers in 2016 Generation SW
14 pages
Flipkart Wired PDF
No ratings yet
Flipkart Wired PDF
4 pages
How John F. Kennedy Changed Decision Making For Us All
No ratings yet
How John F. Kennedy Changed Decision Making For Us All
4 pages
Enigma Case Study
No ratings yet
Enigma Case Study
7 pages
Apple and Samsung
No ratings yet
Apple and Samsung
5 pages
Corporate Sales - English - Complete PDF
67% (3)
Corporate Sales - English - Complete PDF
363 pages
Friedrich Nietzsche - Why Life Isn't Meaningless - Zat Rana - Medium
No ratings yet
Friedrich Nietzsche - Why Life Isn't Meaningless - Zat Rana - Medium
6 pages
Accurate Spectral Testing With Impure Source and Noncoherent Sampling
No ratings yet
Accurate Spectral Testing With Impure Source and Noncoherent Sampling
10 pages
Exercises For Graph Theory
No ratings yet
Exercises For Graph Theory
6 pages
0 Case Media Selection (L2) - DM
No ratings yet
0 Case Media Selection (L2) - DM
5 pages
Neil - Bernardo@eee - Upd.edu - PH Bernalyn - Decena@eee - Upd.edu - PH Ephraim - Lizardo@eee - Upd.edu - PH
No ratings yet
Neil - Bernardo@eee - Upd.edu - PH Bernalyn - Decena@eee - Upd.edu - PH Ephraim - Lizardo@eee - Upd.edu - PH
3 pages
Numerical Evaluation of Dynamic Response - Interpolation Method
No ratings yet
Numerical Evaluation of Dynamic Response - Interpolation Method
5 pages
Lab Manual Daa Ad3351 Aids III Sem Regulation 2021
No ratings yet
Lab Manual Daa Ad3351 Aids III Sem Regulation 2021
48 pages
Self Organizing Map
No ratings yet
Self Organizing Map
4 pages
DAA Module - 2
No ratings yet
DAA Module - 2
28 pages
Chapter 12 - Solutions: Problem 12.1. If The Sampling Frequency Is 1 (HZ), Then The Sampling Pe
No ratings yet
Chapter 12 - Solutions: Problem 12.1. If The Sampling Frequency Is 1 (HZ), Then The Sampling Pe
58 pages
Candidates Are Required To Answer Group A and Any 5 (Five) From Group B To E, Taking at Least One From Each Group
No ratings yet
Candidates Are Required To Answer Group A and Any 5 (Five) From Group B To E, Taking at Least One From Each Group
4 pages
Electronic Device Floyd Filters
No ratings yet
Electronic Device Floyd Filters
25 pages
19me21p1 PDF
No ratings yet
19me21p1 PDF
2 pages
AI
No ratings yet
AI
15 pages
The Lagrangian Relaxation Method For Solving Integer Programming Problems
No ratings yet
The Lagrangian Relaxation Method For Solving Integer Programming Problems
12 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
No ratings yet
Aut7.1.4 Recognise The Difference Between Linear and Non Linear Sequences
39 pages
Ma214 S23 Part08
No ratings yet
Ma214 S23 Part08
15 pages
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
Sampling and Aliasing: CSE 421 Digital Control
No ratings yet
Sampling and Aliasing: CSE 421 Digital Control
12 pages
Cohen-Sutherland Line Clipping Algorithm
No ratings yet
Cohen-Sutherland Line Clipping Algorithm
5 pages
KNN Presentation
No ratings yet
KNN Presentation
19 pages
Cep
No ratings yet
Cep
2 pages
LZW Encoding and Decoding
No ratings yet
LZW Encoding and Decoding
18 pages
Simplex Method
No ratings yet
Simplex Method
19 pages
Numerical Methods: Session 1: Principles of Numerical Mathematics
No ratings yet
Numerical Methods: Session 1: Principles of Numerical Mathematics
24 pages
4 Introduction To Systems
No ratings yet
4 Introduction To Systems
16 pages
OBJECTIVE
No ratings yet
OBJECTIVE
8 pages
Bilge Günsel TEL531E Detection and Estimation Theory W #1-2
No ratings yet
Bilge Günsel TEL531E Detection and Estimation Theory W #1-2
25 pages