-
-
Notifications
You must be signed in to change notification settings - Fork 26.2k
Closed
Labels
Description
scikit-learn/sklearn/metrics/classification.py
Line 1852 in a24c8b4
Therefore, the lower the Brier score is for a set of predictions, the |
Therefore, the lower the Brier score is for a set of predictions, the better the predictions are calibrated.
As far as I can tell, this claim is incorrect (also incorrect in the Wikipedia entry cited) and should be deleted or detailed (one component of the two-term decomposition of the Brier score does assess calibration).
Here's a counter-example on a dataset with 12 points and 2 estimators.
model A | model B | y |
---|---|---|
0.00 | 0.00 | 0 |
0.25 | 0.25 | 0 |
0.25 | 0.25 | 0 |
0.25 | 0.25 | 0 |
0.25 | 0.75 | 1 |
0.50 | 0.50 | 0 |
0.50 | 0.50 | 1 |
0.75 | 0.25 | 0 |
0.75 | 0.75 | 1 |
0.75 | 0.75 | 1 |
0.75 | 0.75 | 1 |
1.00 | 1.00 | 1 |
Brier scores:
model A | model B |
---|---|
0.166666666667 | 0.0833333333333 |
Calibration curves: