CLN Improve doc/error consistency for GaussianProcessRegressor #19687

chrisyeh96 · 2021-03-16T07:30:12Z

Improves the documentation for GaussianProcessRegressor.

correctly links to Glossary
fixes typo (BGFS should be BFGS)
consistently uses n_targets instead of a combination of n_targets and n_output_dims

Regarding the last note though, it seems like scikit-learn hasn't standardized on the proper word to describe a multi-target / multi-output regression. Some models use "target," while others use "output." I chose to use n_targets here because it is used by most of the linear models, and no other page used n_output_dims. (Several others use n_outputs.)

thomasjpfan

Thank you for the PR @chrisyeh96 !

Minor comment, otherwise LGTM

sklearn/gaussian_process/_gpr.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

chrisyeh96 · 2021-03-22T21:54:08Z

I made the suggested changes, but I noticed an additional problem. The function sample_y() itself has a n_samples parameter, and the argument X has shape (n_samples, n_features), but the two n_samples are not the same. This can lead to a lot of confusion.

def sample_y(self, X, n_samples=1, random_state=0):
    """Draw samples from Gaussian process and evaluate at X.
    Parameters
    ----------
    X : array-like of shape (n_samples, n_features) or list of object
        Query points where the GP is evaluated.
    n_samples : int, default=1
        The number of samples drawn from the Gaussian process
     ...

The existing documentation uses n_samples_X in the return value to distinguish between the two n_samples, but that still leaves the two n_samples in the parameters descriptions.

Thoughts on how to best resolve this duplicity? @thomasjpfan

thomasjpfan · 2021-03-23T11:45:34Z

What do you think of updating the docstring of X to the following?

         X : array-like of shape (n_samples_X, n_features) or list of object

chrisyeh96 · 2021-03-23T20:52:53Z

@thomasjpfan: agreed

I also made some other small fixes / clarifications as well.

sklearn/gaussian_process/_gpr.py

chrisyeh96 · 2021-03-23T22:22:16Z

@thomasjpfan I undid the np.dot -> @ code changes

chrisyeh96 · 2021-04-11T22:36:05Z

Bump. Will these changes be merged soon? @thomasjpfan

thomasjpfan

PRs with code changes need 2 reviews. I think if we revert the code changes, I would be okay with merging with only the documentation changes.

thomasjpfan · 2021-04-12T02:16:43Z

sklearn/gaussian_process/_gpr.py

+                raise ValueError("alpha must be a scalar or an array "
+                                 "with same number of entries as y. (%d != %d)"


This is a code change. Given the title of the PR, I would revert.

thomasjpfan · 2021-04-12T02:17:28Z

sklearn/gaussian_process/_gpr.py

@@ -315,8 +315,7 @@ def predict(self, X, return_std=False, return_cov=False):
        """
        if return_std and return_cov:
            raise RuntimeError(
-                "Not returning standard deviation of predictions when "
-                "returning full covariance.")
+                "At most one of return_std or return_cov can be requested.")


This is a code change. Given the title of the PR, I would revert.

chrisyeh96 · 2021-04-12T03:55:52Z

I'd rather not have to waste time creating a 2nd pull request for the code changes. Is there any way to get a 2nd reviewer on this? Thanks!

thomasjpfan

I'll merge because this PR changes to the error messages are small and are an improvement.

…t-learn#19687) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Improve documentation consistency for GaussianProcessRegressor

8db148a

github-actions bot added the module:gaussian_process label Mar 16, 2021

thomasjpfan approved these changes Mar 20, 2021

View reviewed changes

sklearn/gaussian_process/_gpr.py Outdated Show resolved Hide resolved

sklearn/gaussian_process/_gpr.py Outdated Show resolved Hide resolved

thomasjpfan changed the title ~~Improve documentation consistency for GaussianProcessRegressor~~ DOC Improve documentation consistency for GaussianProcessRegressor Mar 20, 2021

github-actions bot added the Documentation label Mar 20, 2021

Apply suggestions from code review

44c834f

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

More code / documentation improvements for clarity in GPR

68b04ec

thomasjpfan reviewed Mar 23, 2021

View reviewed changes

sklearn/gaussian_process/_gpr.py Outdated Show resolved Hide resolved

Undo np.dot -> @

fe67c9e

thomasjpfan approved these changes Mar 24, 2021

View reviewed changes

thomasjpfan requested changes Apr 12, 2021

View reviewed changes

thomasjpfan changed the title ~~DOC Improve documentation consistency for GaussianProcessRegressor~~ CLN Improve documentation consistency for GaussianProcessRegressor Apr 12, 2021

thomasjpfan approved these changes Apr 12, 2021

View reviewed changes

thomasjpfan changed the title ~~CLN Improve documentation consistency for GaussianProcessRegressor~~ CLN Improve doc/error consistency for GaussianProcessRegressor Apr 12, 2021

thomasjpfan merged commit 7b343dd into scikit-learn:main Apr 12, 2021

chrisyeh96 deleted the patch-1 branch April 12, 2021 17:06

thomasjpfan added a commit to thomasjpfan/scikit-learn that referenced this pull request Apr 19, 2021

CLN Improve doc/error consistency for GaussianProcessRegressor (sciki…

f45a723

…t-learn#19687) Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

glemaitre mentioned this pull request Apr 22, 2021

Release 0.24.2 #19954

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CLN Improve doc/error consistency for GaussianProcessRegressor #19687

CLN Improve doc/error consistency for GaussianProcessRegressor #19687

Uh oh!

chrisyeh96 commented Mar 16, 2021

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

chrisyeh96 commented Mar 22, 2021 •

edited

Loading

Uh oh!

thomasjpfan commented Mar 23, 2021

Uh oh!

chrisyeh96 commented Mar 23, 2021 •

edited

Loading

Uh oh!

Uh oh!

chrisyeh96 commented Mar 23, 2021

Uh oh!

chrisyeh96 commented Apr 11, 2021

Uh oh!

thomasjpfan left a comment

Uh oh!

thomasjpfan Apr 12, 2021

Uh oh!

thomasjpfan Apr 12, 2021

Uh oh!

chrisyeh96 commented Apr 12, 2021

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

		raise ValueError("alpha must be a scalar or an array "
		"with same number of entries as y. (%d != %d)"

Uh oh!

CLN Improve doc/error consistency for GaussianProcessRegressor #19687

CLN Improve doc/error consistency for GaussianProcessRegressor #19687

Uh oh!

Conversation

chrisyeh96 commented Mar 16, 2021

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chrisyeh96 commented Mar 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan commented Mar 23, 2021

Uh oh!

chrisyeh96 commented Mar 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chrisyeh96 commented Mar 23, 2021

Uh oh!

chrisyeh96 commented Apr 11, 2021

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

chrisyeh96 commented Apr 12, 2021

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chrisyeh96 commented Mar 22, 2021 •

edited

Loading

chrisyeh96 commented Mar 23, 2021 •

edited

Loading