DOC add example to show how to deal with cross-validation #18821

glemaitre · 2020-11-12T14:09:10Z

This new example shows which question cross-validation can answer.
It is also used to show how to inspect model parameters and hyperparameters and more importantly their variances.

glemaitre · 2020-11-12T14:10:27Z

I found that maybe our documentation was missing such an example regarding the way to inspect parameters and you should do while inspecting them.

glemaitre · 2020-11-12T14:12:11Z

We have a similar analysis in the example of the interpretation of coefficients of linear models.
However, I think that the point here is to give the recipe of how to extract the information instead of making an interpretation.
In this regard, I did not want to modify any other example to not modify the take home message.

examples/inspection/plot_model_inspection_cross_validation.py

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

lucyleeow

This is really interesting and useful. A lot of nitpicks - feel free to ignore the more opinionated ones!

Also for some reason the seaborn boxplot is giving some warnings:

examples/inspection/plot_model_inspection_cross_validation.py

Co-authored-by: Lucy Liu <jliu176@gmail.com>

examples/inspection/plot_model_inspection_cross_validation.py

thomasjpfan · 2020-12-21T22:17:52Z

examples/inspection/plot_model_inspection_cross_validation.py

+# regressor is also optimized via another internal cross-validation. This
+# process is called a nested cross-validation and should always be implemented
+# whenever model's parameters need to be optimized.


Nested cross validation takes much longer to train and I do not know if it is always recommended.

In this example, the nested cross validation only searches through one parameter and that parameter does not vary across folds. This leads to a model selection process leading to alpha=40.

In general, one can search through many parameter combinations, which would lead to different models. I.E, there would be one model for each loop of the outer fold. I do not think there is a way to decide which hyper parameter combination one should use in this case. (Unless you are ensembling them)

In general, one can search through many parameter combinations, which would lead to different models. I.E, there would be one model for each loop of the outer fold. I do not think there is a way to decide which hyper parameter combination one should use in this case. (Unless you are ensembling them)

I see your point here.

I still think this is beneficial to do the nested-cross-validation even with many hyper-parameters. The variability of the hyper-parameters would be informative. And I am not aware of any other way to make this analysis with a less costly strategy. However, this is true that it would not necessarily help you in choosing a specific configuration.

Do you have a proposal to mitigate the statement?

Looking at this again, I think I am okay with the wording as is.

For me, the wording is a bit absolute. One could state that it is recommended, but certainly not "should always be implemented".

thomasjpfan

I think techniques in this example is very specific to linear models and takes advantage of RidgeCV's ability to do "cv for free".

The first part is seeing how stability the algorithm is by using repeated k-fold cross validation by looking at the scores and coefficients. I think this makes sense for seeing how the linear model behaves with this dataset.

When it gets to "Putting the model in production", I think this example suggest that stable coefficients and similar alpha values implies that we can move forward with putting the model into production. I do not think this is always the case. Imagine if the distribution of scores had a bimodal distribution. In this case, one would select the hyper-parameter config that maximizes the score.

From my understanding, connecting the stability of the coefficients to "good for production" means that the application requires model inspection. I do not think this is true in general?

examples/inspection/plot_model_inspection_cross_validation.py

thomasjpfan · 2021-01-04T21:43:49Z

examples/inspection/plot_model_inspection_cross_validation.py

+# regressor is also optimized via another internal cross-validation. This
+# process is called a nested cross-validation and should always be implemented
+# whenever model's parameters need to be optimized.


Looking at this again, I think I am okay with the wording as is.

examples/inspection/plot_model_inspection_cross_validation.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

…ross_val

examples/inspection/plot_model_inspection_cross_validation.py

…ross_val

thomasjpfan

Minor comment, otherwise LGTM.

examples/inspection/plot_model_inspection_cross_validation.py

…ross_val

ogrisel

Some more comments, otherwise LGTM!

examples/inspection/plot_model_inspection_cross_validation.py

lorentzenchr · 2021-08-28T09:19:19Z

examples/inspection/plot_model_inspection_cross_validation.py

+# To conclude, cross-validation allows us to answer to two questions: are the
+# results reliable and, if it is, how good is the predictive model.
+#
+# Model inspection


This whole section is very specific for linear models. I would be nice to use some model agnostic methods, too.

My initial thoughts when building the example was to show how to access a fitted attribute of one of the models fitted during cross-validation. So this is not really specific to linear models in this regard.

In terms of "model-agnostic" approaches, what are you thinking about regarding the message to convey?

How would you inspect gradient boosted models?

lorentzenchr · 2021-08-28T09:20:27Z

examples/inspection/plot_model_inspection_cross_validation.py

+# We see that the regularization parameter, `alpha`, values are centered and
+# condensed around 40. This is a good sign and means that most of the models


I find the plot not that much convincing as there are several values of alpha far beyond 40.

examples/inspection/plot_model_inspection_cross_validation.py

lorentzenchr · 2021-08-28T09:31:26Z

examples/inspection/plot_model_inspection_cross_validation.py

+# information about the variance of the model. It should never be used to
+# evaluate the model itself.


What exaclty is meant by that?

I think here I wanted again to point out that you get a single point and not a score distribution.

I'm still confused. Let's say you evaluate MSE on the test set. Then, one contribution of your MSE sore comes from the variance of you model—variance of the model's prediction to be precise—the other from the bias term and the variance of the target.
If you mean variance of the algorithm, that's something slightly different. Additionally, the variance of the coefficients could be inferred in-sample (not very unbiased with the cross validated alpha, but still). But for some reasons, this is not done in scikit-learn.

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

ArturoAmorQ

Hi @glemaitre, are you still interested in working on this PR? If so here is a batch of comments.

Notice that you will also have to merge main with this branch, as it is a bit outdated.

examples/inspection/plot_model_inspection_cross_validation.py

…ross_val

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com> Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

examples/inspection/plot_model_inspection_cross_validation.py

lorentzenchr

@glemaitre Do you intend to finish this one?

lorentzenchr · 2023-01-25T17:27:49Z

examples/inspection/plot_model_inspection_cross_validation.py

+# on the coefficients. Thus, the penalty parameter `alpha` has to be tuned.
+# More importantly, this parameter needs to be tuned for our specific problem:
+# tuning on another dataset does not ensure an optimal parameter value for the
+# current dataset.


Could this be phrased more clearly and more elegant?

lorentzenchr · 2023-01-25T17:28:27Z

examples/inspection/plot_model_inspection_cross_validation.py

+#
+# Here, we define a machine learning pipeline `model` made of a preprocessing
+# stage to :ref:`standardize <preprocessing_scaler>` the data such that the
+# regularization strength is applied homogeneously on each coefficient; followed


Suggested change

# regularization strength is applied homogeneously on each coefficient; followed

# regularization strength `alpha` is applied homogeneously on each coefficient; followed

lorentzenchr · 2023-01-25T17:33:35Z

examples/inspection/plot_model_inspection_cross_validation.py

+# production, it is a good practice and generally advisable to perform an
+# unbiased evaluation of the performance of the model.
+#
+# Cross-validation should be used to make this analysis. First, it allows us to


"Fist, it allows..." What is the second point?

We should better state why and when to use cross-validation. For instance, if I have a huge amount of very identically distributed data, then I don't need CV. Also, it is still best practice to keep one test set away from CV to evaluate the final model (possibly retrained on the whole train-validation set) - as is done in this example.

Also, CV assesses more a training algorithm and not a single model.

lorentzenchr · 2023-01-25T17:35:49Z

examples/inspection/plot_model_inspection_cross_validation.py

+# interpret findings built on internal model's parameters. One possible cause
+# of large variations are small sample sizes.


With perfect iid data, I can trust the variance / CI interval of a score, which are usually just a mean over the data.
A more important reason, IMO, is violation of iid.

lorentzenchr · 2023-01-25T17:37:21Z

examples/inspection/plot_model_inspection_cross_validation.py

+# regressor is also optimized via another internal cross-validation. This
+# process is called a nested cross-validation and should always be implemented
+# whenever model's parameters need to be optimized.


For me, the wording is a bit absolute. One could state that it is recommended, but certainly not "should always be implemented".

lorentzenchr · 2023-01-25T17:38:08Z

examples/inspection/plot_model_inspection_cross_validation.py

+# regressor. We can therefore use this predictive model as a baseline against
+# more advanced machine learning pipelines.
+#
+# To conclude, cross-validation allows us to answer to two questions: are the


Suggested change

# To conclude, cross-validation allows us to answer to two questions: are the

# To conclude, cross-validation allows us to answer two questions: are the

lorentzenchr · 2023-01-25T17:38:28Z

examples/inspection/plot_model_inspection_cross_validation.py

+# more advanced machine learning pipelines.
+#
+# To conclude, cross-validation allows us to answer to two questions: are the
+# results reliable and, if it is, how good is the algorithm used to create


Suggested change

# results reliable and, if it is, how good is the algorithm used to create

# results reliable and, if they are, how good is the algorithm used to create

lorentzenchr · 2023-01-25T17:40:26Z

examples/inspection/plot_model_inspection_cross_validation.py

+cv_pipelines = cv_results["estimator"]
+
+# %%
+# While the cross-validation allows us to know if our models are reliable, we


"models are reliable" sound more like reliability diagrams so I could rephrase a bit.

lorentzenchr · 2023-01-25T17:45:29Z

examples/inspection/plot_model_inspection_cross_validation.py

+# However, you should be aware that this latest step does not give any
+# information about the variance of our final model. Thus, if we want to
+# evaluate our final model, we should get a distribution of scores if we would
+# like to get this information.


What is "variance of our final model". Do you have the bias variance decomposition in mind?
How would we get the "distribution of scores"?
I'm inclined to remove this remark.

DOC add example to show how to deal with cross-validation

b703f60

github-actions bot added the Documentation label Nov 12, 2020

ogrisel reviewed Nov 12, 2020

View reviewed changes

examples/inspection/plot_model_inspection_cross_validation.py Outdated Show resolved Hide resolved

glemaitre and others added 2 commits November 12, 2020 18:20

Update examples/inspection/plot_model_inspection_cross_validation.py

9b868de

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

iter

8847c4e

lucyleeow reviewed Nov 16, 2020

View reviewed changes

Apply suggestions from code review

7ec751a

Co-authored-by: Lucy Liu <jliu176@gmail.com>

This was referenced Nov 27, 2020

Get the number of iterations in SVR #18928

Closed

Add how to inspect a model within cross-validation INRIA/scikit-learn-mooc#124

Open

glemaitre added 5 commits December 19, 2020 17:57

PEP8

3631e8f

iter

558875e

chage title

89e0999

remove well-posed and explain the reason

ea8c9f3

style

8794d83

thomasjpfan reviewed Dec 21, 2020

View reviewed changes

apply reviews of thomas

6c28c49

thomasjpfan reviewed Jan 4, 2021

View reviewed changes

glemaitre and others added 3 commits January 6, 2021 17:54

Update examples/inspection/plot_model_inspection_cross_validation.py

c1179c8

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Apply suggestions from code review

60f803c

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

use striplot instead of swarmplot

1ef716d

Base automatically changed from master to main January 22, 2021 10:53

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

137ef1c

…ross_val

thomasjpfan reviewed Apr 5, 2021

View reviewed changes

examples/inspection/plot_model_inspection_cross_validation.py Outdated Show resolved Hide resolved

glemaitre added 4 commits June 22, 2021 15:13

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

b0b35ce

…ross_val

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

36f57cf

…ross_val

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

7ad1951

…ross_val

iter

8386149

thomasjpfan approved these changes Jul 10, 2021

View reviewed changes

examples/inspection/plot_model_inspection_cross_validation.py Outdated Show resolved Hide resolved

glemaitre and others added 3 commits July 20, 2021 20:00

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

237095f

…ross_val

iter

f474a8a

Merge branch 'main' into examples_inspection_cross_val

eabfe96

ogrisel reviewed Aug 23, 2021

View reviewed changes

lorentzenchr reviewed Aug 28, 2021

View reviewed changes

glemaitre and others added 2 commits August 30, 2021 21:05

Apply suggestions from code review

8119fb8

Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

iter

7170ecf

cmarmo added the Waiting for Reviewer label Jan 4, 2022

cmarmo added Waiting for Second Reviewer First reviewer is done, need a second one! and removed Waiting for Reviewer labels Oct 20, 2022

ArturoAmorQ reviewed Nov 10, 2022

View reviewed changes

glemaitre and others added 2 commits November 15, 2022 20:16

Merge remote-tracking branch 'origin/main' into examples_inspection_c…

110bef5

…ross_val

Apply suggestions from code review

d2c5efb

Co-authored-by: Arturo Amor <86408019+ArturoAmorQ@users.noreply.github.com> Co-authored-by: Christian Lorentzen <lorentzen.ch@gmail.com>

glemaitre commented Dec 28, 2022

View reviewed changes

examples/inspection/plot_model_inspection_cross_validation.py Outdated Show resolved Hide resolved

Apply suggestions from code review

9737fc6

lorentzenchr reviewed Jan 25, 2023

View reviewed changes

Micky774 removed the Waiting for Second Reviewer First reviewer is done, need a second one! label Jul 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC add example to show how to deal with cross-validation #18821

DOC add example to show how to deal with cross-validation #18821

glemaitre commented Nov 12, 2020

glemaitre commented Nov 12, 2020

glemaitre commented Nov 12, 2020

lucyleeow left a comment •

edited

Loading

thomasjpfan Dec 21, 2020

glemaitre Dec 22, 2020

thomasjpfan Jan 4, 2021

lorentzenchr Jan 25, 2023

thomasjpfan left a comment

thomasjpfan Jan 4, 2021

thomasjpfan left a comment

ogrisel left a comment

lorentzenchr Aug 28, 2021

glemaitre Aug 30, 2021 •

edited

Loading

lorentzenchr Sep 1, 2021

lorentzenchr Aug 28, 2021

lorentzenchr Aug 28, 2021

glemaitre Aug 30, 2021

lorentzenchr Sep 1, 2021

ArturoAmorQ left a comment

lorentzenchr left a comment

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

lorentzenchr Jan 25, 2023

		# We see that the regularization parameter, `alpha`, values are centered and
		# condensed around 40. This is a good sign and means that most of the models

		# information about the variance of the model. It should never be used to
		# evaluate the model itself.

	# regularization strength is applied homogeneously on each coefficient; followed
	# regularization strength `alpha` is applied homogeneously on each coefficient; followed

		# interpret findings built on internal model's parameters. One possible cause
		# of large variations are small sample sizes.

	# To conclude, cross-validation allows us to answer to two questions: are the
	# To conclude, cross-validation allows us to answer two questions: are the

	# results reliable and, if it is, how good is the algorithm used to create
	# results reliable and, if they are, how good is the algorithm used to create

DOC add example to show how to deal with cross-validation #18821

Are you sure you want to change the base?

DOC add example to show how to deal with cross-validation #18821

Conversation

glemaitre commented Nov 12, 2020

glemaitre commented Nov 12, 2020

glemaitre commented Nov 12, 2020

lucyleeow left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomasjpfan left a comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glemaitre Aug 30, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArturoAmorQ left a comment

Choose a reason for hiding this comment

lorentzenchr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lucyleeow left a comment •

edited

Loading

glemaitre Aug 30, 2021 •

edited

Loading