FIX ConvergenceWarning in plot_gpr_on_structured_data (#31164) #31289

EngineerDanny · 2025-05-01T13:40:47Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR fixes the ConvergenceWarning and subsequent L-BFGS abort in the structured-sequence Gaussian Process example by freezing baseline_similarity_bounds, exactly as core tests already do in test_gpr.py.
No API change.

Any other comments?

Added a one-word typo correction (“operate”) in the example narrative.

…1164)

github-actions · 2025-05-01T13:41:45Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: a377b3e. Link to the linter CI: here}

EngineerDanny

Thanks @StefanieSenger and @ogrisel for your detailed investigation.

I’ve kept baseline_similarity_bounds="fixed" and added a note in the example explaining why this parameter is frozen.
Please let me know if more context is needed!

StefanieSenger

Thanks for your work, @EngineerDanny!

I have added a suggestion on how to make this part of the example a bit simpler. The reasonale behind this is to not burden the users with all the details, which derails their attention from the actual intend of the example (which is not to demonstrate how to tune the baseline similarity, and more on model accuracy). They don't need to bother with convergence here, because you have fixed the problem for them to see what is really important. And it is enough to let them know what "fixed" does and that they might want to tune on the hyperparam if they apply this example to their use case. This suggestion is not to diminish your work (I can't even judge without spending a few days on it), but to keep the example a simple read.

Looking forward to see this merged.

examples/gaussian_process/plot_gpr_on_structured_data.py

StefanieSenger · 2025-05-08T06:15:42Z

examples/gaussian_process/plot_gpr_on_structured_data.py

+#
+# %%
+# Freeze baseline_similarity to avoid ill‑conditioned optimisation
+kernel = SequenceKernel(baseline_similarity_bounds="fixed")


Now there is baseline_similarity_bounds="fixed" that overwrites the current baseline_similarity_bounds=(1e-5, 1)) in SequenceKernel.__init__() which is still present in the code in line 57, which is unnecessarily complex.

Let's simplify by using either one of these.

I have updated the line 57 to use baseline_similarity_bounds="fixed" instead.
What do you think?

Thank you. I think we could then leave out setting the param again uppon instantiation.

examples/gaussian_process/plot_gpr_on_structured_data.py

Co-authored-by: Stefanie Senger <91849487+StefanieSenger@users.noreply.github.com>

StefanieSenger

Thanks for your updates, @EngineerDanny!

There is a rendering issue related to a character following directly the backticks in the note, but apart from that to me this PR looks very good.

What do you think, @ogrisel?

StefanieSenger · 2025-05-09T07:27:34Z

examples/gaussian_process/plot_gpr_on_structured_data.py

+#
+# %%
+# Freeze baseline_similarity to avoid ill‑conditioned optimisation
+kernel = SequenceKernel(baseline_similarity_bounds="fixed")


Thank you. I think we could then leave out setting the param again uppon instantiation.

StefanieSenger · 2025-05-09T07:39:26Z

examples/gaussian_process/plot_gpr_on_structured_data.py

+#    to show this example without ``ConvergenceWarning``s that would otherwise raise.
+#    In another use case, you probably want to optimise on
+#    ``baseline_similarity_bounds``  and should set bounds by passing a tuple of


Suggested change

# to show this example without ``ConvergenceWarning``s that would otherwise raise.

# In another use case, you probably want to optimise on

# ``baseline_similarity_bounds`` and should set bounds by passing a tuple of

# to show this example without ``ConvergenceWarning`` that would otherwise raise.

# In another use case, you probably want to optimise on

# ``baseline_similarity_bounds`` and should set bounds by passing a tuple of

This should fix the rendering issues that show up in the CI build.

ogrisel · 2025-05-09T09:38:30Z

examples/gaussian_process/plot_gpr_on_structured_data.py

+# .. note::
+#    Here, we freeze ``baseline_similarity_bounds`` by setting it to `"fixed"` in order
+#    to show this example without ``ConvergenceWarning``s that would otherwise raise.
+#    In another use case, you probably want to optimise on
+#    ``baseline_similarity_bounds``  and should set bounds by passing a tuple of


Let's keep it simple:

Suggested change

# .. note::

# Here, we freeze ``baseline_similarity_bounds`` by setting it to `"fixed"` in order

# to show this example without ``ConvergenceWarning``s that would otherwise raise.

# In another use case, you probably want to optimise on

# ``baseline_similarity_bounds`` and should set bounds by passing a tuple of

# .. note::

# Here, we freeze the value of ``baseline_similarity`` by setting

# `baseline_similarity_bounds="fixed"` as LBFGS would otherwise fail

# to optimize the value of this kernel parameter for some unknown reason.

I think the observed lack of convergence when passing non-fixed bounds reveals a bug, but I am not familiar enough with the GP literature w.r.t. kernel parameter tuning guarantees to be 100% sure whether this is expected or not in this particular. And if this is a bug, I don't know if it's in the kernel code, or the choice of parametrization or in the way we use LBFGS in the tuning code.

Maybe @snath-xoc has an idea? See the discussion in the linked issue for more background.

snath-xoc · 2025-05-12T16:05:43Z

Hi all, so at first I thought it may be due to the implementation of the GPC (it uses a binary Laplace method of approximating the posterior), however the same issue occurs with the GPR. Monitoring the log-marginal likelihood values we get, there also seems to be no change (I get steady values of 3.88 if I increase the number of iterations). Moreover, I get an error of "ABNORMAL_TERMINATION_IN_LNSRCH" from the lfbgs with ore iterations. This typically occurs when the gradient does not correctly match the function, and so the line-search fails. I think the problem may lie in the way we are therefore calculating _f and _g i.e., kernel specification and not a bug in the GPR or GPC code (phew).

I am not entirely sure what the best fix here is, I think fixing the baseline similarity bounds seems to give a good enough solution, or else we would need to try a different way of calculating the gradient/jac (any ideas?)

ogrisel · 2025-05-13T13:37:05Z

Thanks for your analysis @snath-xoc. @EngineerDanny I am +1 for merging this PR with the more direct message suggested in #31289 (comment) for the time being, but if someone understands how to fix the gradient computation of the kernel to make the optimization work in this example, feel free to say so or open a follow-up PR.

adrinjalali · 2025-06-12T09:04:01Z

Seems like we have a solution proposal in #31366. Shall we wait for that issue to be closed then?

StefanieSenger · 2025-06-21T13:31:28Z

We could also merge this PR for now and replace the "fixed" bounds with real bounds in the actual fix.

FIX ConvergenceWarning in plot_gpr_on_structured_data (scikit-learn#3…

850c2ea

…1164)

EngineerDanny added 2 commits May 1, 2025 16:13

Merge branch 'main' into fix-seqkernel-convergence

f6c79cf

DOC Add note on frozen baseline similarity in SequenceKernel example

186be15

EngineerDanny commented May 7, 2025

View reviewed changes

StefanieSenger reviewed May 8, 2025

View reviewed changes

EngineerDanny and others added 4 commits May 8, 2025 08:42

Update examples/gaussian_process/plot_gpr_on_structured_data.py

70423d4

Co-authored-by: Stefanie Senger <91849487+StefanieSenger@users.noreply.github.com>

Update examples/gaussian_process/plot_gpr_on_structured_data.py

f5c941e

Co-authored-by: Stefanie Senger <91849487+StefanieSenger@users.noreply.github.com>

Update examples/gaussian_process/plot_gpr_on_structured_data.py

551af71

Co-authored-by: Stefanie Senger <91849487+StefanieSenger@users.noreply.github.com>

REF Update SequenceKernel to use 'fixed' for baseline_similarity_bounds

a377b3e

StefanieSenger approved these changes May 9, 2025

View reviewed changes

ogrisel reviewed May 9, 2025

View reviewed changes

StefanieSenger added the Waiting for Second Reviewer First reviewer is done, need a second one! label May 22, 2025

snath-xoc mentioned this pull request May 28, 2025

Gaussian Process Log Likelihood Gradient Incorrect #31366

Open

snath-xoc mentioned this pull request Jun 13, 2025

Add stricter gradient check for log marginal likelihood in Gaussian Processes #31543

Open

2 tasks

Uh oh!

FIX ConvergenceWarning in plot_gpr_on_structured_data (#31164) #31289

Are you sure you want to change the base?

FIX ConvergenceWarning in plot_gpr_on_structured_data (#31164) #31289

Uh oh!

Conversation

EngineerDanny commented May 1, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

EngineerDanny left a comment

Choose a reason for hiding this comment

Uh oh!

StefanieSenger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

StefanieSenger May 8, 2025

Choose a reason for hiding this comment

Uh oh!

EngineerDanny May 8, 2025

Choose a reason for hiding this comment

Uh oh!

StefanieSenger May 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

StefanieSenger left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StefanieSenger May 9, 2025

Choose a reason for hiding this comment

Uh oh!

StefanieSenger May 9, 2025

Choose a reason for hiding this comment

Uh oh!

ogrisel May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

snath-xoc commented May 12, 2025

Uh oh!

ogrisel commented May 13, 2025

Uh oh!

adrinjalali commented Jun 12, 2025

Uh oh!

StefanieSenger commented Jun 21, 2025

Uh oh!

Uh oh!

github-actions bot commented May 1, 2025 •

edited

Loading

StefanieSenger left a comment •

edited

Loading

ogrisel May 9, 2025 •

edited

Loading