survival lec 6-1
survival lec 6-1
Analysis
Parametric Methods
Table of contents
Multivariate Univariate
Taylor Expansion:
Expand g(Z) around μ using the first-order Taylor approximation:
g(Z) ≈ g(μ)+(Z−μ)g′(μ)
E[g(Z)]≈E[g(μ)+(Z−μ)g′(μ)]
Expand Expectation:
E[g(Z)] ≈ E[g(μ)]+E[(Z−μ)g′(μ)]
≈ g(μ)+ E[(Z−μ)]g′(μ)
Var(g(Z)) ≈ Var(g(μ)+(Z−μ)g′(μ))
≈ Var((Z−μ) g′(μ))
Since g′(μ) is constant:
≈ (g′(μ) )2 Var(Z−μ)
Var(Z−μ)=Var(Z)
the approximate variance is:
Var(g(Z)) ≈ (g′(μ) )2Var(Z).
Formula for the Delta Method
Final Approximate :
Mean of g(Z) :
mean( g(Z) ) ≈ g(μ)
Variance of g(Z):
var( g(Z) ) ≈ ( g′(μ) )2 σ2
Final Approximate Distribution:
If Z ∼ Normal(μ,σ2),
then:
g(Z)∼Normal( g(μ) , (g′( μ ) )2 σ2)
Applications
The Delta Method is widely used in :
g(θ^)±zα/2⋅√( g′(θ) )2 σ2
After running the experiment and analyzing the data, they estimate:
● θ=−2.5
● θ0=0
● I(θ0)=0.4
This means the data does not provide sufficient evidence to conclude
that the medication significantly affects blood pressure.
code
Output
Rao Statistic (Score Test): Evaluates the likelihood function slope at the null hypothesis,
providing an alternative way to test parameter significance.
Note that Rao’s method does not use the MLE. Hence, no iterative calculation is necessary.
Example
A researcher is studying the effectiveness of two drugs, A and B, in reducing cholesterol levels. They fit a
regression model with two coefficients:
The observed log-likelihood function is analyzed, and the following data are
provided:
There is strong evidence to suggest that at least one of the drugs (A or B) has a
significant effect on cholesterol levels.
code
output
Table of contents
- Null Hypothesis ( H₀): There is no difference between the reduced model and the full model.
- Alternative Hypothesis ( H₁): The full model is better than the reduced model.
2. Calculate Log-Hazard:-
LRT= -2 . ( l(reduced)-l(full) )
5. Decision:-
A study has been conducted to investigate the factors influencing the time until failure (in hours) of a
specific type of device. Data has been collected on two variables for each device:
Hours of Use (): The total number of hours the device has been used.
Device Age (): The age of the device, measured in years.
The response variable is Time Until Failure (), which represents the duration (in hours) until the device
fails.
Given the data provided below, your task is to assess whether including Device Age () as a predictor
variable significantly improves the model's ability to predict Time Until Failure (), compared to using
Hours of Use () alone. Perform a Likelihood Ratio Test at a significance level of alpha=0.05, λ1=0.06, to
determine whether the inclusion of Device Age adds predictive value to the model.
Solution 1. State the Hypotheses:
● Null Hypothesis (H₀):
Adding( Device Age) does not improve the model. This implies that the
rate parameter for the full model is not significantly different from the
rate parameter for the reduced model.
● Let: -
- λ₁ be the rate parameter for the reduced model (which only
uses Hours of Use).
- λ₂ be the rate parameter for the full model (which uses both
Hours of Use and Device Age).
2. Reduced Model:-
In this model, only (Hours of Use ( X₁ )) is used as a predictor for Time Until Failure ( Y ):-
Y = X1
2.1 (λ1) for the Reduced Model and (λ1) are given :-
λ1= 0.06
n:no.of devices
∑Yi : sum for Hours of use values
2.2 Calculate the Log-Hazard for the Reduced Model:-
The log-Hazard for the reduced model is calculated using the formula: -
Y=X1+X2
1)Sum of Y-values for the Full Model:-
The predicted Yi values for the full model are calculated as the sum of x1 and x2 so
Yi=[12,18,24,30,36,42,48]
LRT= -2 . ( l(reduced)-l(full) )
Types
of
Censoring
Randomly Censored Data
Definition: Occurs when the censoring time is independent of the
survival time, This means that the reason for censoring is not
related to the event of interest.
represents the
is the probability that the event
Parameters of the For uncensored data, Ti is the
has not occurred by time Ti,
statistical model event time
given the parameter θ.
P(T > t)
2. left Censoring:
3.Interval Censoring:
• General Form:
upper bound
Lower bound
Note
Covariates: such as age and gender, can influence survival times and are modeled using the Cox
proportional hazards approach, as detailed in Chapter 4.
Total Likelihood Function for Mixed Data
• For a dataset with a mixture of left, right, and interval censoring, the likelihood
function combines all these components.
• General Formula:
Steps to Apply Likelihood Functions for
Randomly Censored Data
1 5 1 (Uncensored)
2 8 0 (Right Censored)
3 3 1 (Uncensored)
5 2 0 (Left Censored)
Step2
Solution Set Up the Likelihood Function for
Each observation
Step3: Form and Calculate the Total Likelihood Function
Step4: Maximizing the Likelihood
Code: log likelihood in R
Output
Thanks! Team Work
Mariam Mansour
Norhan Mohamed
Mai Osama
Jilan Mohamed
Mariam Magdy
Aya Esmaeel
Pola Samuel