03 - Probability Distributions and Estimation
03 - Probability Distributions and Estimation
03 - Probability Distributions and Estimation
and
Random Variables
Introduction
Probability is the measure of uncertainty and have always been an
important aspect in the reliability assessment of industrial products
and/or equipments.
Good product design is of course essential for products with high
reliability. However, no matter how good the product design is,
products deteriorate over time since they are operating under certain
stress or load in the real environment, often involving randomness.
Maintenance has, thus, been introduced as an efficient way to assure a
satisfactory level of reliability during the useful life of a physical asset.
Lecture Overview
Definitions
Mathematical Derivations
Example /Tutorial
Probability definition?
Random Variable
A random variable is defined as a function that assigns a real value to every
possible outcome or event of an experiment or observation.
In many applications ranging from manufacturing engineering and elsewhere, the
outcomes x1, x2,,xn of events that constitute a sample over an interval of time
(or space) take real numerical values. It is therefore convenient, and sometimes
necessary, to express all events using numerical values on a real line. The functions
that establish such transformations to a real line are called random variables.
Random variables can be classified into two types: discrete and continuous
random variables.
Probability distribution
Example
h
.
n
nh
h
1 1 p 1 P ( E ).
n
n
Example
If E1 and E2 are two events, the probability that event E2 occurs given
that E1 has occurred is denoted by P(E2 | E1) (conditional probability
of E2 given has E1 occurred). If the two events are mutually exclusive
then P(E1 E2) = 0. Then E1 E2 denotes the event that either E1 or
E2 or both occur, then
P(E1 E2) = P(E1) + P(E2) P(E1 E2).
Discrete distribution
all x
10
Continuous distribution
Then a probability distribution of X is a function f (x) such that for any two
numbers a and b,
P a X b
f ( x )dx
f ( x) 0
2)
f ( x) 1
11
The area of the region between the graph of f and the x axis
is equal to 1.
Area = 1
12
F ( x) P X x
f ( y )dy
For each x, F(x) is the area under the density curve to the
left of x.
13
P X a 1 F (a )
and for any numbers a and b with a < b,
P a X b F (b) F (a )
14
F ( x) f ( x).
15
X E X
x f ( x)dx
16
E h( x ) h ( X )
h( x) f ( x)dx
17
V ( x)
(x )
f ( x)dx
E[ X ]
The standard deviation is
X V ( x).
18
E ( X )
V (X ) E X
19
Important Distributions
Binomial distribution
The Binomial Distribution with parameters n 0 and 0 < p < 1 its probability distribution
is expressed as:
n x
p (1 p ) n x
x
p ( x)
x 0,1, ..., n
Variance
2 np (1 p)
S 2 np (1 p )
In the process of manufacturing a product, inspection may test whether the product is good
or defective. The probability outcomes for analysing several products would follow a
binomial distribution.
21
Binomial Example
Example 1.
coin is
2
6 1 1
p ( x)
2 2 2
Example 1.
62
6! 1 1
2!4! 2 2
15
0.23
64
4
64
5
6 1 1
6 1 1
p ( x)
4 2 2
5 2 2
15
6
1
11
0.34
64 64 64 32
6 5
6
6 1 1
6 2 2
6 6
22
Binomial Exercise
To guard against spurious failure causing a plant
outage, automatic protective system are often
designed with three protective channels. Any 2 out
of 3 channels need to be in a failed state to initiate
a system shutdown. Assuming that the reliability
is 0.99, determine the probability of such an
automatic system being in a failed state when an
inspection is carried out on the system
23
Solution
p ( X 2)
2.97 10 4
3 3
3!
p 1 p 3 3
0.01 3 (1) 1 10 6
3!1!
3
p ( X 3)
24
25
Poison distribution
e x
f ( x)
x!
x 0,1,....
where the parameter (lamda) is greater than zero, and e is a constant equal to
approximately 2.71828.
Note that the mean and variance of the Poison distribution are equal
26
Poisson Example
Suppose the average number of lions seen on a 1 day safari is 5. What is the
probability that tourists will see fewer than four lions on the next 1-day safari?
Solution: This is a Poisson experiment in which we know the following:
= 5; since 5 lions are seen per safari, on average.
x = 0, 1, 2, or 3; since we want to find the likelihood that tourists will see fewer
than 4 lions; that is, we want the probability that they will see 0, 1, 2, or 3 lions.
since e = 2.71828.
27
Poisson Solutions
To solve this problem, we need to find the probability that tourists will see 0, 1, 2, or 3
lions.
We need to calculate the sum of four probabilities: P(0; 5) + P(1; 5) + P(2; 5) + P(3; 5).
To compute this sum, we use the Poisson formula:
P(x < 3, 5) = P(0; 5) + P(1; 5) + P(2; 5) + P(3; 5)
P(x < 3, 5) = [ (e-5)(50) / 0! ] + [ (e-5)(51) / 1! ] + [ (e-5)(52) / 2! ] + [ (e-5)(53) / 3! ]
P(x < 3, 5) = [ (0.006738)(1) / 1 ] + [ (0.006738)(5) / 1 ] + [ (0.006738)(25) / 2 ] +
[ (0.006738)(125) / 6 ]
P(x < 3, 5) = [ 0.0067 ] + [ 0.03369 ] + [ 0.084224 ] + [ 0.140375 ]
P(x < 3, 5) = 0.2650
The probability of seeing no more than 3 lions is 0.2650.
28
Poison distribution
Characteristics of a Poisson experiment are:
29
Normal Distribution
1
f ( x)
e
2
1 x
2
, where - x .
f(x)
30
Normal Distribution
Normal distributions are defined using 2 parameters: mean and standard deviation:
N(,).
Bell Shaped
f(x)
Symmetrical
Mean, Median and Mode are Equal
Location is determined by the mean,
Spread of S.D
32
1
z2 / 2
f ( z; 0,1)
e
2
z
The cumulative distribution function is z
( z ) P( Z z )
f ( y; 0,1) dy
33
Shaded area = (z )
Standard
normal
curve
0
34
Standard Normal
1.0
2.0
0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.500 0.504 0.508 0.512 0.516 0.520 0.524 0.528 0.532 0.536
0.540 0.544 0.548 0.552 0.556 0.560 0.564 0.567 0.571 0.575
0.841 0.844 0.846 0.848 0.851 0.853 0.855 0.858 0.860 0.862
0.977 0.978 0.978 0.979 0.979 0.980 0.980 0.981 0.981 0.982
35
Example
200 100
2.0
50
36
Empirical Rule
2 covers about 95% of Xs
3 covers about 99.7% of Xs
95.44%
99.72%
37
P (Z < 2.0)=0.9772
0
2.00
38
The row
shows the
value of Z to
the first
decimal point
0.00
0.01
0.02
0.0
0.1
.
.
.
2.0
.9772
Log-Normal distribution
f ( x)
x 2
1 ln x
for ( x 0)
Mean e
for ( x 0)
SD
e 2 2 e 2
2
40
Exponential distribution
e x
f ( x)
0
x0
otherwise
Mean
1
,
Variance
1
2
41
Gamma distribution
A continuous random variable X has a gamma distribution if the probability
density function is
f ( x)
1
1 x /
x
e
( )
0
otherwise
x0
where the parameter satisfy > 0, > 0. The standard gamma distribution has
= 1.
For > 0 the () is defined as
( )
1 x
x
e dx
Weibull Distribution
The Weibull is a very flexible life distribution model with two parameters. It
has probability density function given as
x 1e ( x / )
f ( x)
x0
x0
1
1
2
1
1 1
2
43
Cumulative distribution
function (cdf) F(t)
F (t ) P(T t )
f (s) ds
d F (t )
.
dt
44
Conditional probability
f (t1 | t 2 )
f (t1 , t 2 )
f (t 2 )
where f (t1, t2) is equal to the joint density function of T1 and T2,and f (t1) is equal the
marginal density function of T2 given by
f ( t2 )
f (t1 , t 2 ) dt1 .
f ( t 2 | t1 )
f (t1 , t 2 )
f (t1 )
45
F (t )
.
t
That is
n F (t1 , t 2 ,......., t n )
f (t1 , t 2 ,......., t n )
.
t1t 2 .....t n
Maximum Likelihood
Estimator (MLE)
L( ; t1 , t 2 ,......, t n ) f (ti | )
i 1
Log-likelihood estimates
l ( ) ln f (ti | ).
i 1
Alternatively, the maximum likelihood estimates may be found from the loglikelihood by setting the derivative to zero. The solution sometimes involves
numerical methods such as Newton-Raphson or Quasi-Newton algorithms.
Such methods are available in subroutine libraries such as NAG, Crowder et
al. (1993). However, from a theoretical point of view, the maximumlikelihood estimation method ensures that most statistical problems of
parameter estimation likely to arise in reliability contexts are easily dealt with.
48
Exponential MLE
For both complete and censored data the MLE estimator for lambda is given as
r
T
Where r is the number of failure and T is the cumulative test time (censored)
f (ti ) i e t
i 1,2,......r
L(t1 ,....t 2 )
exp(t )
nr
nr
i 1
exp ti (n r )t r
i 1
r
d ln L( ) r
ln L( ) t ln ti (n r )t r . Therefore,
t i ( n r )t r 0
d
i 1
i 1
Solving for lambda
r
r
r
t
i 1
( n r )t r
r
T
49
Exponential MLE
L ( )
i 1
exp(ti ) exp(ti )
i 1
i 1
i 1
ln L( ) r ln ti ti
n
n
d ln L( ) r
ti ti
d
i 1
i 1
r
n
i 1
50
51
Lower
Confidence
Limit
Point Estimate
Upper
Confidence
Limit
Width of
confidence interval
52
Concepts of Confidence
Intervals
54
Point estimators
Characteristics
Unbiased: E ( x) x
Efficiency: as n increases s get closer to
s
Standard error
n
Sampling error:
55
x Z
Assumptions
57
Common Z levels of
confidence
Z value
1.28
1.645
1.96
2.33
2.58
3.08
3.27
58
Assumptions
X t n-1
S
n
Example
If we want to estimate the average ages of kids
that ride a particular roller coaster ride in
Blackpool, and we take a random sample of 8 kids
exiting the ride, and find that their ages are: 2, 3,
4, 5, 6, 6, 7, 7.
a. Calculate the sample mean.
b. Calculate the sample standard deviation.
c. Calculate the standard error of the mean.
d. Calculate the 99% confidence interval.
61
Answer (a, b)
a. To calculate the sample mean.
8
X8
X
i 1
2 3 4 5 6 6 7 7 40
5.0
8
8
s X2
i 1
( X i 5) 2
8 1
s X 3.4 1.9
32 2 2 12 0 2(12 ) 2(2 2 ) 24
3.4
7
7
62
Answer (c)
c. Calculate the standard error of the mean.
s x 1 .9
sx
.67
n
8
63
Confidence Interval
Revision: Possible
Questions/Problems
QUESTIONS?
66