PA Lec 8 2024
PA Lec 8 2024
PA Lec 8 2024
Session 08
2
Learning Outcome
3
◼ Let random variable X1 represents the no. of PGP students arriving at Nescafe
between 2 PM to 4PM
◼ Let random variable X2 represents the no. of IPM students arriving at Nescafe
between 2 PM to 4PM
Y= No. of students from IPM and PGP jointly arriving at IIM Rohtak Nescafe Store 4
◼ Y = X1 +X2
5
Discrete Probability Distributions
6
Distribution of Random Variable
7
Bernoulli Random Variable
• If an experiment consists of a single trial and the outcome of the trial can only be
either a success* or a failure, then the trial is called a Bernoulli trial.
• 𝑓 𝑥 = 𝑝 𝑥 (1 − 𝑝)(1−𝑥)
• E(X) = p and Var(X) = p(1 – p).
* The terms success and failure are simply statistical terms, and do not have positive or
negative implications. In a production setting, finding a defective product may be
termed a “success,” although it is not a positive result.
𝒇 𝒙 = 𝒑𝒙 (𝟏 − 𝒑)(𝟏−𝒙)
9
Binomial Probability Distribution
◼ Four Properties of a Binomial Experiment
1. The experiment consists of a sequence of n identical bernoulli trials.
2. Two outcomes, success and failure, are possible on each trial.
3. The probability of a success, denoted by p, does not change from trial
to trial. (This is referred to as the stationarity assumption.)
4. The trials are independent.
10
Binomial Probability Distribution
11
Binomial Probability Distribution
• Binomial Probability Function
𝑛!
𝑓 𝑥 = 𝑝 𝑥 (1 − 𝑝)(𝑛−𝑥)
𝑥! 𝑛 − 𝑥 !
where:
x = the number of successes
p = the probability of a success on one trial
n = the number of trials
f(x) = the probability of x successes in n trials
n! = n(n – 1)(n – 2) ….. (2)(1)
12
Mean, Variance, and Standard Deviation of the Binomial Distribution
• Expected Value
E(x) = = np
• Variance
Var(x) = 2 = np(1 – p)
• Standard Deviation
𝜎= 𝑛𝑝(1 − 𝑝)
13
Geometric Distribution
or
Pascal Distribution
𝑓 𝑥 = 𝑝 (1 − 𝑝)(𝑥−1)
14
Mean, Variance, and Standard Deviation of the Geometric Distribution
• Expected Value
E(x) = = 1/p
• Variance
Var(x) = 2 = (1 – p)/ p2
• Standard Deviation
𝜎= (1 – p)/ p2
15
The Poisson Distribution
16
The Poisson Distribution
◼ The number of occurrences in each interval can vary from zero to infinity.
◼ The probability of an occurrence is the same for any two intervals of equal
length.
17
Poisson Distribution - Applications
19
◼ A property of the Poisson distribution is that the mean and variance
are equal.
=2
20
Poisson Distribution - Formula
−
e x
P( X = x) =
x!
where:
x = number of events in an area of opportunity
= expected number of events/average arrival rate
e = base of the natural logarithm system
(2.71828...)
Poisson Distribution - Characteristics
◼ If X ~ Poisson(λ), Then
◼ Mean
= E( X ) =
◼ Variance
Standard Deviation σ = λ
2
◼
σ= λ
Caselet 1.5
24
Hypergeometric Probability Function
◼ Sampling without replacement: i.e., after an item is drawn, it is not put back for
subsequent draws.
Trials not independent
The probability of success changes from trial to trial
26
◼ The probability of x successes in a random selection of n items is
𝑟 𝑁−𝑟
𝑥 𝑛−𝑥
𝑓 𝑥 =𝑃 𝑋=𝑥 =
𝑁
𝑛
N is the population size, n is the sample size or no of trials.
r is the number of elements in the population labeled success,
𝑥 = the number of successes
𝑟 𝑁−𝑟
𝑥 𝑛 − 𝑥 for 0 < x < r
𝑓 𝑥 =
𝑁
𝑛
number of ways
n – x failures can be selected
from a total of N – r failures
number of ways in the population
x successes can be selected
from a total of r successes
in the population
number of ways
n elements can be selected
from a population of size N
28
𝑟
◼ 𝐸 𝑋 =𝜇=𝑛 ,
𝑁
2 𝑟 𝑟 𝑁−𝑛
◼ 𝑉𝑎𝑟 𝑥 = 𝜎 = 𝑛 1−
𝑁 𝑁 𝑁−1
29
Caselet 1.7
◼ Wooden boxes are commonly used for the packaging and transportation of
mangoes. A convenience store in Morganville, New Jersey, regularly buys
mangoes from a wholesale dealer. For every shipment, the manager randomly
inspects five mangoes from a box containing 20 mangoes for damages due to
transportation. Suppose the chosen box contains exactly two damaged mangoes.
1. What is the probability that one out of five mangoes used in the inspection is
damaged?
2. If the manager decides to reject the shipment if one or more of the mangoes are
damaged, what is the probability that the shipment will be rejected?
3. Calculate the expected value, the variance, and the standard deviation of the
number of damaged mangoes used in the inspection.
30
◼ Inspect five mangoes from a box containing 20 mangoes with exactly two damaged mangoes.
◼ What is the probability that one out of the five mangoes is damaged?
𝑟 𝑁−𝑟
2 20 − 2
𝑥 𝑛−𝑥 𝑃 𝑋 = 1 = 1 5 − 1 = 0.3947
𝑓 𝑥 =𝑃 𝑋=𝑥 = 20
𝑁
𝑛 5
◼ If the manager decides to reject the shipment if one or more of the mangoes are damaged, what is
the probability that the shipment will be rejected?
2 20−2
0 5−0
𝑃 𝑋=0 = 20 = 0.5526
5
𝑃 𝑋 ≥ 1 = 1 − 𝑃 𝑋 = 0 = 1 − 0.5526 = 0.4474
31
◼ Calculate the expected value, the variance, and the standard deviation.
32
Inspect five mangoes from a box containing 20 mangoes
with exactly two damaged mangoes.
33
Quick Revision
Practice: Bayes Theorem
◼ A consulting firm submitted a bid for a large research project. The firm’s
management initially felt they had a 50 –50 chance of getting the project.
However, the agency to which the bid was submitted subsequently requested
additional information on the bid. Past experience indicates that for 75% of the
successful bids and 40% of the unsuccessful bids the agency requested additional
information.
◼ What is the prior probability of the bid being successful (that is, prior to the
request for additional information)?
◼ What is the conditional probability of a request for additional information given
that the bid will ultimately be successful?
◼ Compute the posterior probability that the bid will be successful given a request
for additional information.
35
S1 = successful, S2 = not successful, and B = request received
for additional information.
a. P(S1) = .50
(.50)(.75) .375
c. P(S1 B) = (.50)(.75) + (.50)(.40) = .575 = .65
36
Measures of Association Between Two Variables
37
Covariance
σ(𝑥𝑖 −𝑥)(𝑦
ҧ ത
𝑖 −𝑦)
For samples: 𝑠𝑥𝑦 =
𝑛−1
39
Correlation Coefficient
40
Correlation Coefficient
• The correlation coefficient is computed as follows:
𝑠𝑥𝑦
For samples: 𝑟𝑥𝑦 =
𝑠𝑥 𝑠𝑦
𝜎𝑥𝑦
For populations: 𝜌𝑥𝑦 =
𝜎𝑥 𝜎𝑦
Correlation Coefficient
42
43
Chebyshev’s Theorem
• At least (1 - 1/z2) of the items in any data set will be within z standard
deviations of the mean, where z is any value greater than 1.
• Chebyshev’s theorem requires z > 1, but z need not be an integer.
44
Chebyshev’s Theorem
• At least 75% of the data values must be within z = 2 standard
deviations of the mean.
• At least 89% of the data values must be within z = 3 standard
deviations of the mean.
• At least 94% of the data values must be within z = 4 standard
deviations of the mean.
45
Continuous random variable
Continuous Random Variables
49
Cumulative Probability Distribution
f(x)
Area = F(a)
x
a
f(x) P(a ≤ X ≤b) = Area under f(x)
Between a and b = F(b) – F(a)
x
a b
Continuous Probability Distributions
◼ Uniform distribution
◼ Normal distribution
◼ The t distribution
◼ The chi-square distribution
◼ Exponential distribution
◼ F distribution
◼ ...
f (x) Exponential
x
x
53
Continuous Probability Distributions
x x x
x1 x2 x1 x2 x1 x2
54