0% found this document useful (0 votes)

25 views34 pages

Lecture 8 Logistic Regression

Detailed presentation on logistic regression

Uploaded by

Syed Abubakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views34 pages

Lecture 8 Logistic Regression

Detailed presentation on logistic regression

Uploaded by

Syed Abubakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Logistic

Regression
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

2 /33
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

3 /33
The classification probelm
The linear regression model discussed in the previous lesson assumes that the
response variable 𝑦 is quantitative (metrical)
• in many situations, the response variable is instead qualitative (categorical)

Qualitative variables take values in an unordered set 𝒞 = "cat! ", … , "cat " " , such as:
• eye color ∈ "brown", "blue", "green"
• email ∈ "spam", "not spam"

Metric data Categorical data

• Describe a quantity • Describe membership categories
• An ordering is defined • It is not meaningful to apply an ordering
• A distance is defined • It is not meaningful to compute distances

4 /33
The classification probelm
The process of estimating categorical outcomes using a set of regressors 𝝋 is called
classification

Estimating a categorical response for an observation 𝝋 can be referred to as classifying

that observation, since it involves assigning the observation to a category, or class

Often we are more interested in estimating the probabilities that 𝝋 belongs to each
category in 𝒞

The most probable category is then chosen as the class for the observation 𝝋

5 /33
Examples of classification problems
• A person arrives at the emergency room with a set of symptoms that could possibly
be attributed to one of three medical conditions
Which of the three conditions does the individual have?

• An online banking system manages transactions, storing user’s IP address, past

transaction history, and so forth
Is the transaction fraudulent or not?

• A biologist collects DNA sequence data for a number of patients with and without
a given disease
which DNA mutations are deleterious (disease-causing) and which are not?

6 /33
Example: cat vs dog classification
Suppose that we measure the weight and Classifier function 𝑓 ⋅
height of some dogs and cats

Height [cm]
We want to learn the function 𝑓 ⋅ that can
$
tell us if a given input vector 𝝋 = 𝜑! , 𝜑# is a
dog or a cat Cats
• 𝜑! : weight
𝜑" Dogs

• 𝜑# : height

QUIZ: The point is classified by the model

as a ?
𝜑! Weight [kg]
7 /33
The classification problem

QUIZ: Consider a company that produces sliding gates. The gates can have four
weights 300Kg, 400Kg, 500Kg, 600Kg . We want to detect the weight of the
gate. This is a:

q A regression problem

q A classification problem

q Both a regression and a classification problem

8 /33
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

9 /33
Why not linear regression?
Suppose that we are trying to estimate the medical condition of a patient in the
emergency room based on her symptoms

There are three possibilities: stroke, drug overdose and epileptic seizure
We could consider encoding these values as a quantitative response variable, 𝑦, as

1 if stroke
𝑦 = <2 if drug overdose
3 if epileptic seizure

However, we are implicitly saying that the «difference» between drug overdose and
stroke is the same as the «difference» between epileptic seizure and drug
overdose, which does not make much sense

10 /33
Why not linear regression?
We can also change the encoding to

1 if epileptic seizure
𝑦 = <2 if stroke
3 if drug overdose

This would imply a totally different relationship among the three conditions
• each of these codings would produce fundamentally different linear models…
• …that would ultimately lead to different sets of estimates on test observations

In general, there is no natural way to convert a qualitative response variable with more
than two levels into a quantitative response that is ready for linear regression

11 /33
Why not linear regression?
With two levels, the situation is better. For instance, perhaps there are only two
possibilities for the patient’s medical condition: stroke and drug overdose

0 if stroke
𝑦=<
1 if drug overdose

We can fit a linear regression to this binary response, and classify as drug overdose if
𝑦C > 0.5 and stroke otherwise, interpreting 𝑦C as a probability of drug overdose

However, if we use linear regression, some of our estimates might be outside the [0, 1]
interval, which does not make sense as a probability. There is nothing that “saturates” the
output between 0 and 1. Logistic function (Sigmoid)

12 /33
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

13 /33
Logistic regression
Purpose: Estimate the probability that a set of input regressors 𝝋 ∈ ℝ%×! belong to one
of two classes 𝑦 ∈ 0, 1

Define the linear combination quantity Logistic function (Sigmoid)

%*!

𝑎 = I 𝜑' ⋅ 𝜃' = 𝝋$ ⋅ 𝜽
'()
0.5
The formula 𝑠 𝑎 is the logistic function

1 𝑒$ • 𝑎≫0⇒𝑠 𝑎 =1
𝑠 𝑎 = #$ =
1+𝑒 1 + 𝑒$ • 𝑎≪0⇒𝑠 𝑎 =0 0

14 /33
Logistic regression
Purpose: Estimate the probability that a set of input regressors 𝝋 ∈ ℝ%×! belong to one
of two classes 𝑦 ∈ 0, 1

1
𝑃 𝑦=1𝝋 =𝑠 𝑎 =𝑠 𝝋%𝜽 = !𝜽
1 + 𝑒 #𝝋

The output of 𝑠 𝝋!𝜽 is interpreted as a probability

• 𝝋$ 𝜽 ≫ 0 ⇒ 𝑠 𝝋$ 𝜽 ≫ 0.5 ⇒ 𝑃 𝑦 = 1 𝝋 ≈ 1 𝝋 is classified to class 1

• 𝝋$ 𝜽 ≪ 0 ⇒ 𝑠 𝝋$ 𝜽 ≪ 0.5 ⇒ 𝑃 𝑦 = 1 𝝋 ≈ 0 𝝋 is classified to class 0

15 /33
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

16 /33
Logistic regression cost function
Suppose to have at disposal a dataset 𝒟 = 𝝋 1 ,𝑦 1 ,…, 𝝋 𝑁 ,𝑦 𝑁 where 𝝋 𝑖 ∈ ℝ%×!
and 𝑦 𝑖 ∈ 0, 1 , 𝑖 = 1, … , 𝑁, 𝑖. 𝑖. 𝑑.

1
Estimate a logistic regression model 𝑃 𝑦 𝑖 = 1 𝝋 𝑖 = ! ≡𝜋 𝑖
1+ 𝑒 *𝝋 ' 𝜽

The logistic regression cost function 𝐽 𝜽 is defined as:

*
𝐽 𝜽 = − 7 𝑦 𝑖 ⋅ ln 𝜋 𝑖 + 1 − 𝑦 𝑖 ⋅ ln 1 − 𝜋 𝑖
()!

17 /33
Logistic regression cost function
QUIZ: In the logistic regression cost function, where are the parameters 𝜽 that we
want to estimate?
*
𝐽 𝜽 = − 7 𝑦 𝑖 ⋅ ln 𝜋 𝑖 + 1 − 𝑦 𝑖 ⋅ ln 1 − 𝜋 𝑖
()!

q In the 𝑦 𝑖 terms

q In the ln terms

q In the 𝜋 𝑖 terms

18 /33
Logistic regression cost function
Cost function interpretation
Suppose there is only one datum 𝒟 = 𝝋, 𝑦

− ln 𝜋 if 𝑦 = 1
⇒𝐽 𝜽 =<
− ln 1 − 𝜋 if 𝑦 = 0

Case 𝑦 = 1

• 𝐽 𝜽 ≈ 0 if 𝑦 = 1 and 𝜋 ≈ 1
𝐽 𝜽 = −ln 𝜋
• 𝐽 𝜽 ≈ +∞ if 𝑦 = 1 and 𝜋 ≈ 0

19 /33
Logistic regression cost function
Cost function interpretation
Suppose there is only one datum 𝒟 = 𝝋, 𝑦

− ln 𝜋 if 𝑦 = 1
⇒𝐽 𝜽 =<
− ln 1 − 𝜋 if 𝑦 = 0

Case 𝑦 = 0

• 𝐽 𝜽 ≈ 0 if 𝑦 = 0 and 𝜋 ≈ 0
𝐽 𝜽 = −ln 1 − 𝜋
• 𝐽 𝜽 ≈ +∞ if 𝑦 = 0 and 𝜋 ≈ 1

20 /33
IN-DEPTH ANALYSIS
Computation of the minimum of 𝐽(𝜽)
We have to compute the gradient of 𝐽 𝜽 with respect to 𝜽 ∈ ℝ%×! . First, compute the
!
derivative of 𝑠 𝑎 =
!-. "#

𝜕𝑠 𝑎 𝜕 1 𝜕 "# "$ 𝑒 "#

= = 1 + 𝑒 = − 1+ 𝑒 "# "% 𝑒 "# −1 = − 1 + 𝑒 "# "% −𝑒 "# =
𝜕𝑎 𝜕𝑎 1 + 𝑒 "# 𝜕𝑎 1 + 𝑒 "# %

1 𝑒 "# 1 1 + 𝑒 "# − 1 1 1 + 𝑒 "# 1

= "# ⋅ "# = "# ⋅ "# = "# ⋅ "# − "# = 𝑠 𝑎 ⋅ 1−𝑠 𝑎
1+𝑒 1+𝑒 1+𝑒 1+𝑒 1+𝑒 1+𝑒 1+𝑒

In the case where 𝑎 = 𝝋$ 𝜽, we have that

𝛻𝜽𝑠 𝝋%𝜽 = 𝝋 ⋅ 𝑠 𝝋%𝜽 ⋅ 1 − 𝑠 𝝋%𝜽 =𝝋⋅𝜋⋅ 1−𝜋

𝑑×1 𝑑×1 1×1 1×1

21 /33
IN-DEPTH ANALYSIS
Computation of the minimum of 𝐽(𝜽)
We can now compute the gradient of 𝐽 𝜽
(
1
𝐽 𝜽 = − 3 𝑦 𝑖 ln 𝜋 𝑖 + 1 − 𝑦 𝑖 ln 1 − 𝜋 𝑖 𝜋 𝑖 = !𝜽
&'$
1 + 𝑒 "𝝋 &

( (
𝜋) 𝑖 −𝜋 ) 𝑖 𝝋 𝑖 𝜋 𝑖 1−𝜋 𝑖 −𝝋 𝑖 𝜋 𝑖 1 − 𝜋 𝑖
𝛻𝜽 𝐽 𝜽 = − / 𝑦 𝑖 + 1−𝑦 𝑖 = −/ 𝑦 𝑖 + 1−𝑦 𝑖
𝜋 𝑖 1−𝜋 𝑖 𝜋 𝑖 1−𝜋 𝑖
𝑑×1 %&' %&'

( (

= / −𝑦 𝑖 𝝋 𝑖 1 − 𝜋 𝑖 − 1−𝑦 𝑖 −𝝋 𝑖 𝜋 𝑖 = / 𝝋 𝑖 ⋅ −𝑦 𝑖 + 𝑦 𝑖 𝜋 𝑖 +𝝋 𝑖 ⋅ 𝜋 𝑖 −𝑦 𝑖 𝜋 𝑖
%&' %&'

( (

= / 𝝋 𝑖 ⋅ −𝑦 𝑖 + 𝑦 𝑖 𝜋 𝑖 − 𝑦 𝑖 𝜋 𝑖 + 𝜋 𝑖 = /𝝋 𝑖 ⋅ 𝜋 𝑖 − 𝑦 𝑖
%&' %&' 𝑑×1 1×1

22 /33
Gradient descent
It can be shown that:
• The cost function 𝐽 𝜽 is convex and admits a unique minimum
• The equations found by posing 𝛻𝜽 𝐽 𝜽 = 𝟎 are nonlinear in 𝜽 and it is not possible to
find a solution in closed-form
ü For this reason, we need to resort to iterative optimization algorithms

Use gradient descent:

𝜽 ! 𝑘 − 𝛼 ⋅ 𝛻𝐽 𝜽 ,
! 𝑘+1 =𝜽 𝛼 ∈ ℝ/) : learning rate
𝑑×1 𝑑×1 1×1 𝑑×1 : ;
𝜽9𝜽

23 /33
Gradient descent
*
𝐽 𝜽 = − 7 𝑦 𝑖 ⋅ ln 𝜋 𝑖 + 1 − 𝑦 𝑖 ⋅ ln 1 − 𝜋 𝑖
()!
Repeat { 0

𝜃) = 𝜃) − 𝛼 ⋅ I 𝜋 𝑖 − 𝑦 𝑖
'(!
0

𝜃! = 𝜃# − 𝛼 ⋅ I 𝜋 𝑖 − 𝑦 𝑖 ⋅ −𝜑! 𝑖
'(!

⋮
0

𝜃%! = 𝜃%! − 𝛼 ⋅ I 𝜋 𝑖 − 𝑦 𝑖 ⋅ −𝜑%*! 𝑖

} '(!

24 /33
Logistic regression recap
The logistic regression model, despite its name, is not used for regression, but for
classification

Once the model estimates the probability of a class, we can classify a point to a particular
class if the probability for that class is above a threshold (usually 0.5)

The function that now we are trying to estimate is: 𝑓 𝝋 = 𝑃 𝑦 = 1 𝝋

The logistic regression tries to model 𝑓 by using the model:

$
1
𝑠 𝝋 𝜽 = !𝜽
1 + 𝑒 *𝝋

The point 𝝋 can then be classified to class 𝑦 = +1 if 𝑠 𝝋$ 𝜽 ≥ 0.5

25 /33
Logistic regression recap
The classification boundary found by Linear classifier
logistic regression is linear

Height [cm]
Infact, classifying with the rule:
𝑦 = 1 if 𝑠 𝝋$ 𝜽 ≥ 0.5
Cats
Dogs

is the same as saying

𝑦 = 1 if 𝝋$ 𝜽 ≥ 0

Weight [kg]
26 /33
Outline

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

5. Worked examples

27 /33
Students admissions classification
% Read data from file
We want to estimate if a student will get admitted data = load(‘studentsdata.csv’);
to a university given the results on two exams (𝜑$ , 𝜑% ) Phi = data(:, [1, 2]); y = data(:, 3);

• The training set consists of 𝑁 = 100 students with % Setup the data matrix appropriately, and
add ones for the intercept term
𝜑$ 𝑖 , 𝜑% 𝑖 and 𝑦 𝑖 ∈ 0,1 , for 𝑖 = 1, … , 𝑁 [N, m] = size(Phi); d = m + 1;

% Add intercept term

Phi = [ones(N, 1) Phi];

• Φ ∈ ℝ$++×- % Initialize fitting parameters

initial_theta = zeros(d, 1);
• 𝒚 ∈ ℝ$++×$ pi_s = sigmoid(Phi*theta)
• 𝜽 ∈ ℝ-×$ J = ( -y'*log(pi_s) – (1-y)'*log(1-pi_s));
grad = Phi'*( pi_s - y);

Embed in a function and pass the function to an

optimization algoritm that iteratively computes
the gradient

28 /33
The framingham heart study
In late 1940s, U.S. Government set out to better understand cardiovascular disease

Plan: track large cohort of initially healthy patients over time

The city of Framingham (MA) was selected as site for the study in the 1948
• Appropriate size
• Stable population
• Cooperative doctors and residents

A total of 5209 patients aged 30-59 were enrolled. They had to give a survey and take
and exam every 2 years:
• Physical characteristics and behavioral characteristics
• Test results
29 /33
The framingham heart study
We will build models using the Framingham data to estimate and prevent heart disease

We will estimate the 10-year risk of Coronary Heart Disease

• CHD is a disease of the blood vessels supplying the heart

Heart disease has been the leading cause of death

worldwide since 1921:
• 7.3 million people died from CHD in 2008
• Since 1950, age-adjusted death rates have declined 60%

30 /33
The framingham heart study
Demographic risk factors
• male: sex of patient • age: age in years at first examination
• education: Some high school (1), high school (2), some college (3), college (4)

Behavioral risk factors

• currentSmoker: 0/1 • cigsPerDay: cigarettes per day

Behavioral risk factors

• BPmeds: On blood pressure medication at time of first examination
• prevalentStroke: Previously had a stroke
• prevalentHyp: Currently hypertensive • Diabetes: Currently has diabetes

31 /33
The framingham heart study
Risk factors from first examination
• totChol: Total cholesterol (mg/dL)
• sysBP: Systolic blood pressure
• diaBP: Diastolic blood pressure
• BMI: Body Mass Index (kg/m# )
• heartRate: Heart rate (beats/minute)
• glucose: Blood glucose level (mg/dL)

Use logistic regression to estimate whether or not a patient experienced CHD within
10 years of first examination

32 /33
The framingham heart study

Most critical identified risk

factors

33 /33

Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Classification-Introduction, Logistic Regression
No ratings yet
Classification-Introduction, Logistic Regression
26 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Algorithms Notes
No ratings yet
Algorithms Notes
66 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Lecture Note #9 - PEC-CS701E
No ratings yet
Lecture Note #9 - PEC-CS701E
41 pages
Lecture3 Logistic Regression Regularization
No ratings yet
Lecture3 Logistic Regression Regularization
39 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
Class
No ratings yet
Class
102 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Lec18 Logistic Regression
No ratings yet
Lec18 Logistic Regression
17 pages
Exp 2
No ratings yet
Exp 2
7 pages
Logistic - Regression Class 3
No ratings yet
Logistic - Regression Class 3
88 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
4.logistic Regression
No ratings yet
4.logistic Regression
16 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Lecture 08
No ratings yet
Lecture 08
42 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
ML-Unit I - Logistic Regression
No ratings yet
ML-Unit I - Logistic Regression
102 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Lec 3
No ratings yet
Lec 3
22 pages
Sample Research Paper
No ratings yet
Sample Research Paper
26 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Lecture3 Logistic Regression Classifier V0
No ratings yet
Lecture3 Logistic Regression Classifier V0
41 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
06 Logistic Regression
No ratings yet
06 Logistic Regression
55 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Lecture 05
No ratings yet
Lecture 05
5 pages
Unit II
100% (1)
Unit II
13 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Session 5 - Logistic Regression
No ratings yet
Session 5 - Logistic Regression
69 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
AML AfterMid Merged
No ratings yet
AML AfterMid Merged
389 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Logistic Regression by Nirzona
No ratings yet
Logistic Regression by Nirzona
11 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Tmi05 2 Logistic Regression
No ratings yet
Tmi05 2 Logistic Regression
29 pages
05 LogisticRegression PDF
No ratings yet
05 LogisticRegression PDF
23 pages
Algebra Secret Revealed Complete Guide to Mastering Solutions to Algebraic Equations
From Everand
Algebra Secret Revealed Complete Guide to Mastering Solutions to Algebraic Equations
Joseph McDavid
No ratings yet
1 ORSolution Manual Ch01
No ratings yet
1 ORSolution Manual Ch01
8 pages
Assignment of Nmop
No ratings yet
Assignment of Nmop
3 pages
Oscillating Mass Lab: Review: Characterizing A Spring
No ratings yet
Oscillating Mass Lab: Review: Characterizing A Spring
8 pages
Practical Hand Book
No ratings yet
Practical Hand Book
16 pages
Session 3 Part 4 - Hazard Time Series & Predictive Methods NF
No ratings yet
Session 3 Part 4 - Hazard Time Series & Predictive Methods NF
28 pages
1291TAManual F16
No ratings yet
1291TAManual F16
134 pages
CSC 406 Uniabuja 1
No ratings yet
CSC 406 Uniabuja 1
267 pages
Statistical Anaylsis For Industrial Engineering 2
No ratings yet
Statistical Anaylsis For Industrial Engineering 2
2 pages
Department of Mathematics: Question Bank
100% (1)
Department of Mathematics: Question Bank
23 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Financial Model (Amara Raja)
No ratings yet
Financial Model (Amara Raja)
20 pages
Activity 1
No ratings yet
Activity 1
13 pages
526-Article Text-1404-1-10-20230127
No ratings yet
526-Article Text-1404-1-10-20230127
9 pages
Curve Fitting
No ratings yet
Curve Fitting
5 pages
5) Mba Assignment 4
No ratings yet
5) Mba Assignment 4
2 pages
Econoch 7
No ratings yet
Econoch 7
32 pages
Plots Transformations and Regression An Introduction To Graphical Methods of Diagnostic Regression Analysis 0198533713 9780198533719
100% (1)
Plots Transformations and Regression An Introduction To Graphical Methods of Diagnostic Regression Analysis 0198533713 9780198533719
300 pages
Dataset Amusement Park
No ratings yet
Dataset Amusement Park
21 pages
Crossvalidation - 1
No ratings yet
Crossvalidation - 1
30 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
42 pages
Lecture 2.3 Model Validation
No ratings yet
Lecture 2.3 Model Validation
16 pages
Cont933 Module 3 Culminating Task Jasmin Mckenzie
No ratings yet
Cont933 Module 3 Culminating Task Jasmin Mckenzie
3 pages
Table 20 Murder by State Types of Weapons 2013
No ratings yet
Table 20 Murder by State Types of Weapons 2013
13 pages
Data Mentah Mix
No ratings yet
Data Mentah Mix
5 pages
Practica Macro 22
No ratings yet
Practica Macro 22
9 pages
Rohini 73149042113
No ratings yet
Rohini 73149042113
11 pages
Age and Illuminance Effects in The Farnsworth-Munsell 100-Hue Test
No ratings yet
Age and Illuminance Effects in The Farnsworth-Munsell 100-Hue Test
8 pages
HOMEWORK 3 Rishabh Arora
No ratings yet
HOMEWORK 3 Rishabh Arora
6 pages
QUA2311 Assignment - 6
No ratings yet
QUA2311 Assignment - 6
6 pages
Curve Fitting Toolbox™ II
No ratings yet
Curve Fitting Toolbox™ II
703 pages

Lecture 8 Logistic Regression

Uploaded by

Lecture 8 Logistic Regression

Uploaded by

Logistic

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

Metric data Categorical data

Estimating a categorical response for an observation 𝝋 can be referred to as classifying

• An online banking system manages transactions, storing user’s IP address, past

QUIZ: The point is classified by the model

q Both a regression and a classification problem

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

Define the linear combination quantity Logistic function (Sigmoid)

The output of 𝑠 𝝋!𝜽 is interpreted as a probability

• 𝝋$ 𝜽 ≫ 0 ⇒ 𝑠 𝝋$ 𝜽 ≫ 0.5 ⇒ 𝑃 𝑦 = 1 𝝋 ≈ 1 𝝋 is classified to class 1

• 𝝋$ 𝜽 ≪ 0 ⇒ 𝑠 𝝋$ 𝜽 ≪ 0.5 ⇒ 𝑃 𝑦 = 1 𝝋 ≈ 0 𝝋 is classified to class 0

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

The logistic regression cost function 𝐽 𝜽 is defined as:

𝜕𝑠 𝑎 𝜕 1 𝜕 "# "$ 𝑒 "#

1 𝑒 "# 1 1 + 𝑒 "# − 1 1 1 + 𝑒 "# 1

In the case where 𝑎 = 𝝋$ 𝜽, we have that

𝛻𝜽𝑠 𝝋%𝜽 = 𝝋 ⋅ 𝑠 𝝋%𝜽 ⋅ 1 − 𝑠 𝝋%𝜽 =𝝋⋅𝜋⋅ 1−𝜋

Use gradient descent:

𝜃%*! = 𝜃%*! − 𝛼 ⋅ I 𝜋 𝑖 − 𝑦 𝑖 ⋅ −𝜑%*! 𝑖

The function that now we are trying to estimate is: 𝑓 𝝋 = 𝑃 𝑦 = 1 𝝋

The logistic regression tries to model 𝑓 by using the model:

The point 𝝋 can then be classified to class 𝑦 = +1 if 𝑠 𝝋$ 𝜽 ≥ 0.5

is the same as saying

1. The classification problem

2. Why not linear regression?

3. Logistic regression formulation

4. Logistic regression cost function

% Add intercept term

• Φ ∈ ℝ$++×- % Initialize fitting parameters

Embed in a function and pass the function to an

Plan: track large cohort of initially healthy patients over time

We will estimate the 10-year risk of Coronary Heart Disease

Heart disease has been the leading cause of death

Behavioral risk factors

Behavioral risk factors

Most critical identified risk

You might also like

𝜃%! = 𝜃%! − 𝛼 ⋅ I 𝜋 𝑖 − 𝑦 𝑖 ⋅ −𝜑%*! 𝑖