Lecture 7
Lecture 7
1
Statistical significance
2
Statistical significance
• For example, one may choose a significance level of, say, 5%, and
calculate a critical value of a statistic (such as the mean) so that the
probability of it exceeding that value, given the truth of the
null hypothesis, would be 5%. If the actual, calculated statistic value
exceeds the critical value, then it is significant "at the 5% level".
Symbolically speaking, the significance level is denoted by α.
3
Statistical significance
4
Statistical significance
5
Receiver-Operator-Characteristic
(ROC)
• It is Statistics
A ROC curve shows the relationship of probability of false alarm
(x-axis) to probability of detection (y-axis) for a certain test.
• Expressed in medical terms: the probability of a positive test,
given no disease to the probability of a positive test, given
disease.
• The ROC curve may be used to determine an optimal cutoff point
for the test.
6
Determining significance of
database matches
• When searching a DB, the challenge for
analysis methods is to determine if
matches are related (true-positive, TP) or
unrelated (true-negative, TN)
• At a given scoring threshold, it is likely that
unrelated sequences will be matched
erroneously (false-positives, FP) & some
correct matches will be missed (false-
negative, FN)
• The aim is to improve the resolution
between the curves - in the overlap, it is
difficult or impossible to establish if
matches are significant
• Different methods tackle this problem in
different ways
7
Practical Problem
• For example, doctors have measured the S100 protein in serum and
found that higher values tend to be associated with Creutzfeldt-
Jakob disease. The median value is 395 pg/ml for the 108 patients
with the disease and only 109 pg/ml for the 74 patients without the
disease.
• The doctors set a cut off of 213 pg/ml, even though they realized
that 22.2% of the diseased patients had values below the cut off and
18.9% of the disease-free patients had values above the cut off.
8
Practical Problem
• The two percentages listed above are the false negative and false
positive rates, respectively.
• If we lowered the cut off value, we would decrease the false
negative rate probability, but we would also increase the false
positive rate.
• Similarly, if we raised the cut off, we would decrease the false
positive rate, but we would increase the false negative rate.
9
Practical Problem: Graphical
Representation
N
True negative
True positive
10
ROC
11
Sensitivity (Sn)
# TP
Sn
# TP # FN
12
Specificity (Sp)
13
Precision
# TP
Precision
# TP # FP
14
Accuracy
# TP # TN
Accuracy
# TP # TN # FP # FN
15
How tell a good ROC curve from a
bad one?
• It is the diagnostic test which can be good or bad.
• A good diagnostic test is one that has small FP and FN rates across
a reasonable range of cut off values.
• A bad diagnostic test is one where the only cut offs that make the
FP rate low have a high FN rate and vice versa.
• Good test: ROC curve climbs rapidly towards upper left hand corner
of the graph. This means that (1- FN) rate is high and the FP rate is
low.
• Poor test: ROC curve follows a diagonal path from the lower left
hand corner to the upper right hand corner. This means that every
improvement in FP rate is matched by a corresponding decline in
the FN rate.
16
Area under ROC curve
• You can quantify how quickly the ROC curve rises to the upper left
hand corner by measuring the area under the curve.
• The larger the area, the better the diagnostic test.
– If the area is 1.0, you have an ideal test, because it achieves
both 100% sensitivity and 100% specificity.
– If the area is 0.5, then you have a test which has effectively 50%
sensitivity and 50% specificity. This is a test that is no better
than flipping a coin.
• Area under the curve does have one direct interpretation. If you take
a random healthy patient and get a score of X and a random
diseased patient and get a score of Y, then the area under the curve
is an estimate of P[Y>X] (assuming that large values of the test are
indicative of disease).
17
Area under ROC curve (AROC)
18
Example of an ROC curve
19
• convert this table to cumulative
percentages. Test
• A row (*) to represent the Value Diseased Healthy
cumulative percentage of 0%
which will end up corresponding to * 0 0
a diagnostic test where all the
results are considered positive
A 4 56
regardless of the test value.
B 12 84
• The last row represents the other
extreme, where all the results are C 32 94
considered negative regardless of
the test value. D 60 98
E 100 100
20
• The complementary Test Value TP FP
percentages shown above
represent the TP rate (or Sn)Always
and the FP rate (or 1-Sp). Postive 100 100
A is negative
B-E is positive 96 44
A-B is negative
C-E is positive 88 16
A-C is negative
D-E is positive 68 6
A-D is negative
E is positive 40 2
Never positive 0 0
21
22
• Here is information about Area Under the Curve. This area (0.91) is
quite good; it is close to the ideal value of 1.0 and much larger than
worst case value of 0.5.
23
24
What's a good value for the area
under the curve?
• Tricky and it depends a lot on the context of your individual problem.
25
Reference
26