BUG Fix covariance and stdev shape in GPR with normalize_y #22199

Tenavi · 2022-01-12T21:58:40Z

Reference Issues/PRs

Resolves #22174
Resolves #22175

What does this implement/fix? Explain your changes.

Makes _y_train_std the appropriate shape in case normalize_y=False. Then numpy.outer(y_var, self._y_train_std**2) and numpy.outer(y_cov, self._y_train_std**2) always make n_targets predictions. These still get squeezed as per convention. This fixes the shapes of arrays returned by predict when return_std=True or return_cov=True.
Applies a squeeze to y_mean inpredict if the shape is (n_samples, 1) to make this (n_samples,), following the documentation.
In sample_y, when n_targets > 1 then the loop in multivariate normal predictions also loops over the last axis (targets) of the predicted covariance array. This fixes the problem with incorrect shapes returned by sample_y.

Any other comments?

An old test had to be modified to reflect the correct shapes output by predict.

glemaitre · 2022-01-27T15:45:36Z

sklearn/gaussian_process/tests/test_gpr.py

@@ -682,3 +683,43 @@ def test_y_std_with_multitarget_normalized():
    assert y_pred.shape == (n_samples, n_targets)
    assert y_std.shape == (n_samples, n_targets)
    assert y_cov.shape == (n_samples, n_samples, n_targets)
+
+
+def test_y_std_cov_with_multitarget():


For this test, it would be better to use pytest.mark.parametrize on the previous test to avoid code redundancy.

In addition, I see that I already did a PR but forgot about it: #21996

It was covering one of the problems. Could you check that the solution is equivalent? I also see that we made an additional test to check the value of the std. dev. and covariance. This should be added in yours as well.

Hi! Yes your solution in #21996 is equivalent. In addition, I added line 399-400 which make sure that y_mean is also the right shape corresponding to the docstring for gpr.predict.

I merged your fixes in with this branch and updated the test use pytest.mark.parametrize as suggested. Is the test for the values of std. dev. and covariance that you mentioned the one included in #21996?

…to use pytest.mark.parametrize

…nto gpr_shapes

…to 1.1

glemaitre · 2022-01-28T10:41:50Z

sklearn/gaussian_process/tests/test_gpr.py

-    """Check the proper normalization of `y_std` and `y_cov` in multi-target scene.
+@pytest.mark.parametrize("normalize_y", [True, False])
+@pytest.mark.parametrize("n_targets", [0, 1, 10])
+def test_multitarget_shape(normalize_y, n_targets):


Since this test is only testing multi-target I think that we don't need to parameterize it with n_targets.

Oops, yes I think the name on the test should be changed. I included n_targets = 0 and 1 because it was having problems with that, too.

doc/whats_new/v1.1.rst

sklearn/gaussian_process/tests/test_gpr.py

glemaitre

Otherwise LGTM

doc/whats_new/v1.1.rst

sklearn/gaussian_process/tests/test_gpr.py

… n_targets=None in test parameterization

sklearn/gaussian_process/tests/test_gpr.py

glemaitre

I merged the comment. The PR LGTM.
@thomasjpfan since you reviewed my original PR, you might want to have a look such that we can merge this fix. I thought that we added it in 1.0.2 indeed.

thomasjpfan

Minor nits in the test, otherwise LGTM

sklearn/gaussian_process/_gpr.py

sklearn/gaussian_process/tests/test_gpr.py

thomasjpfan · 2022-02-14T21:18:57Z

Thanks for working on this PR @Tenavi !

…arn#22199) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Nakamura-Zimmerer, Tenavi (ARC-AF) <tenavi.nakamura-zimmerer@nasa.gov>

* Updating to scipy 1.12 * Scikitlearn 1.0 is incompatible with scipy 1.12, so updating to 1.1 * Sometimes only a 1d array is returned. * Removing testing kulsinski metric. * simps is deprecated so switching to simpson * Handle change from scikit-learn/scikit-learn#22199 * Improving test to classify better. * Regold due to scikit change scikit-learn/scikit-learn#22604 * Regold PolyExponential files (had rel err of 1e-03 or less) * Make timestep uniform for scipy update. * Regolding because of changes in scipy 1.12 * Increase limits to improve convergence. * Unpinning xarray and updating numpy * Updating various libraries. * Fix working with newer tensorflow. * Values need to be switched to tuples for hstack in numpy 1.26 * Updating to new ray version. * The deque size can be bigger in python 3.11 * Report difference in row lengths, instead of crashing OrderedCSVDiffer. Also report gold file name. * Remove Fourier__signal_f__period10.0__phase This was either +pi or -pi semirandomly, so nolonger testing it. * Regolding changes to ROM/TimeSeries/DMD/BOPDMD because of library changes. * Support xarray 2024.7 and newer. Pre 2024.7 automatically squeeze()ed groupby results, so now need to explicitly call squeeze(). * Fixing long line. * Increasing zero threshold because of change in libraries. * Remove version from setuptools since ray updated. * Optimizing persistence in BayesianMatyas. * Switch OVO to use estimator that is not constantly zero. * Use keepdims instead of try catch block. * Updating to default using python 3.11

glemaitre and others added 6 commits December 16, 2021 10:56

FIX make GPR works with multi-target and normalize_y=False

76f8680

update gpr multitarget

329dee6

TST add additional case with constant target

361d3d0

add original PR author

4d6a510

TST improve assert

75a28bd

shape fixes

ab057a0

github-actions bot added the module:gaussian_process label Jan 12, 2022

added pr number

c82ad11

This was referenced Jan 12, 2022

Multi-target GPR sample_y fails when normalize_y=True #22175

Closed

Multi-target GPR predicts only 1 std when normalize_y=False #22174

Closed

Merge branch 'main' into gpr_shapes

511043f

glemaitre reviewed Jan 27, 2022

View reviewed changes

glemaitre changed the title ~~[MRG] gaussian process shape fixes~~ BUG fix the shape of the covariance and std. dev. in GPR depending on normalize_y Jan 27, 2022

Tenavi mentioned this pull request Jan 27, 2022

Is/gpr multitarget Tenavi/scikit-learn#1

Merged

Nakamura-Zimmerer, Tenavi (ARC-AF) added 3 commits January 27, 2022 10:09

merged with mostly equivalent solution PR 21996; updated shape tests …

87a9819

…to use pytest.mark.parametrize

Merge branch 'gpr_shapes' of https://github.com/Tenavi/scikit-learn i…

128f39d

…nto gpr_shapes

consolidated gpr shape tests; moved whats new documentation from 1.0 …

e1c0df8

…to 1.1

glemaitre reviewed Jan 28, 2022

View reviewed changes

added new test for gpr.sample_y shape, changed name of other new test

a8908a8

glemaitre reviewed Jan 31, 2022

View reviewed changes

doc/whats_new/v1.1.rst Show resolved Hide resolved

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

sklearn/gaussian_process/tests/test_gpr.py Show resolved Hide resolved

added check of y_samples.shape before fitting; changed n_targets=0 to…

b1a68dc

… n_targets=None in test parameterization

glemaitre reviewed Feb 9, 2022

View reviewed changes

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

glemaitre mentioned this pull request Feb 10, 2022

FIX make GPR works with multi-target and normalize_y=False #21996

Closed

glemaitre reviewed Feb 10, 2022

View reviewed changes

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

Update sklearn/gaussian_process/tests/test_gpr.py

da42222

glemaitre approved these changes Feb 10, 2022

View reviewed changes

thomasjpfan approved these changes Feb 13, 2022

View reviewed changes

sklearn/gaussian_process/_gpr.py Outdated Show resolved Hide resolved

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

sklearn/gaussian_process/tests/test_gpr.py Outdated Show resolved Hide resolved

Nakamura-Zimmerer, Tenavi (ARC-AF) added 3 commits February 14, 2022 08:13

little code cleanup suggested by thomasjpfan

80027a7

merge with commented FIXME

9985964

typo fix

bdcb2e7

thomasjpfan changed the title ~~BUG fix the shape of the covariance and std. dev. in GPR depending on normalize_y~~ BUG Fix covariance and stdev shape in GPR with normalize_y Feb 14, 2022

thomasjpfan merged commit 3786daf into scikit-learn:main Feb 14, 2022

mmahsereci mentioned this pull request May 27, 2022

small fix in quadrature GPy wrapper EmuKit/emukit#418

Merged

eddiebergman mentioned this pull request Nov 15, 2022

Update scikit learn 1.2 automl/auto-sklearn#1611

Closed

54 tasks

This was referenced May 1, 2023

ANM incompatible with scikit-learn >= 1.1.3 py-why/causal-learn#112

Merged

ANM incompatible with scikit-learn >= 1.1.3 FenTechSolutions/CausalDiscoveryToolbox#155

Open

ErdunGAO mentioned this pull request May 19, 2023

Update ANM to be compatible with the latest version of sklearn py-why/causal-learn#113

Merged

joshua-cogliati-inl added a commit to joshua-cogliati-inl/raven that referenced this pull request Dec 6, 2024

Handle change from scikit-learn/scikit-learn#22199

2640624

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG Fix covariance and stdev shape in GPR with normalize_y #22199

BUG Fix covariance and stdev shape in GPR with normalize_y #22199

Uh oh!

Tenavi commented Jan 12, 2022 •

edited

Loading

Uh oh!

glemaitre Jan 27, 2022

Uh oh!

Tenavi Jan 27, 2022

Uh oh!

glemaitre Jan 28, 2022

Uh oh!

Tenavi Jan 28, 2022

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasjpfan commented Feb 14, 2022

Uh oh!

Uh oh!

Uh oh!

BUG Fix covariance and stdev shape in GPR with normalize_y #22199

BUG Fix covariance and stdev shape in GPR with normalize_y #22199

Uh oh!

Conversation

Tenavi commented Jan 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

glemaitre Jan 27, 2022

Choose a reason for hiding this comment

Uh oh!

Tenavi Jan 27, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 28, 2022

Choose a reason for hiding this comment

Uh oh!

Tenavi Jan 28, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasjpfan commented Feb 14, 2022

Uh oh!

Uh oh!

Tenavi commented Jan 12, 2022 •

edited

Loading