Fix GaussianMixture UnboundLocalError #20030

tliu68 · 2021-05-03T15:14:16Z

Reference Issues/PRs

Fixes issue #18216

What does this implement/fix? Explain your changes.

Updates the condition on lower_bound to ensure that best_params is set even if convergence was not reached.

Any other comments?

tliu68 · 2021-05-03T16:13:43Z

@jjerphan could you please take a look?

jjerphan · 2021-05-03T16:34:34Z

Hi again @tliu68

Thanks for the PR!
Could you add a regression test for #18216 failing on main but passing on this branch?

tliu68 · 2021-05-05T09:33:13Z

Hi again @tliu68

Thanks for the PR!
Could you add a regression test for #18216 failing on main but passing on this branch?

I am wondering whether you think it's fine to specify the initialization parameters directly? They are actually calculated using the part of the code in GaussianMixtureIC that fits AgglomerativeClustering, runs OneHotEncoder, and then computes Gaussian parameters from onehot. But that chunk of code seems a bit too much to me to add to test_gaussian_mixture.py. What do you think would be a better way to compute/define them here?

jjerphan

Here are a few suggestions @tliu68.

Actually your contribution fixes more than just the best parameters in the case of divergence for mixture.GaussianMixture: a similar problem could also occur for mixture.BayesianGaussianMixture, where their best parameters (weight_concentration_, mean_precision_, means_, degrees_of_freedom_, covariances_, precisions_cholesky_) not being set.

Regarding your previous comment, in-line definition of parameters is fine as long as you comment like you did. 👍

jjerphan · 2021-05-05T11:36:07Z

sklearn/mixture/tests/test_gaussian_mixture.py

+def test_gaussian_mixture_setting_best_params():
+    # test that best_params is set appropriately
+    # even if convergence was not reached
+    # also a regression test for issue #18216


Ideally, we would like to create another test for BayesianMixture, but this might be hard (we need to find a set of initial parameters which would cause it to diverge). You can try to see if the parameters you've in-lined cause this problem for BayesianMixture. If it doesn't, it's fine not going down the rabbit hole to find ones that do.

Suggested change

def test_gaussian_mixture_setting_best_params():

# test that best_params is set appropriately

# even if convergence was not reached

# also a regression test for issue #18216

def test_gaussian_mixture_params_setting():

"""`GaussianMixture`'s best_parameters, `n_iter_` and `lower_bound_`

must be set appropriately in the case of divergence.

Non-regression test for: https://github.com/scikit-learn/scikit-learn/issues/18216

"""

sklearn/mixture/tests/test_gaussian_mixture.py

jjerphan

LGTM!

lorentzenchr

@tliu68 Thanks for the fix.
Could you also replace the 2 orccurrances of np.infty by np.inf in mixture/_base.py?

sklearn/mixture/tests/test_gaussian_mixture.py

lorentzenchr · 2021-05-22T19:00:44Z

@tliu68 Could you also put an entry in whats_new?

…nto GM_set_best_params

lorentzenchr

LGTM

…xture Model PR#11101, focused on allowing sample-based and k-means++ based GMM initialization.. Also made the enhancements compatible with PRs scikit-learn#17937 and scikit-learn#20030.

…sian Mixture Model PR#11101, focused on allowing sample-based and k-means++ based GMM initialization.. Also made the enhancements compatible with PRs scikit-learn#17937 and scikit-learn#20030.

update condition on lower_bound

969bdbc

github-actions bot added the module:mixture label May 3, 2021

add regression test

15f90d0

jjerphan requested changes May 5, 2021

View reviewed changes

update test

9f8aaeb

jjerphan approved these changes May 5, 2021

View reviewed changes

cmarmo added Waiting for Reviewer Bug labels May 18, 2021

lorentzenchr linked an issue May 22, 2021 that may be closed by this pull request

GaussianMixture UnboundLocalError #18216

Closed

lorentzenchr reviewed May 22, 2021

View reviewed changes

sklearn/mixture/tests/test_gaussian_mixture.py Outdated Show resolved Hide resolved

sklearn/mixture/tests/test_gaussian_mixture.py Show resolved Hide resolved

sklearn/mixture/tests/test_gaussian_mixture.py Outdated Show resolved Hide resolved

tliu68 added 3 commits May 24, 2021 20:03

update based on new comments

b413917

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

eb7aad9

…nto GM_set_best_params

check converge; fix doc in test

284d20a

lorentzenchr approved these changes May 25, 2021

View reviewed changes

lorentzenchr merged commit 88be3c1 into scikit-learn:main May 25, 2021

jjerphan mentioned this pull request Jun 10, 2021

FIX GaussianMixture UnboundLocalError #18216 #19974

Closed

alceballosa mentioned this pull request Jun 26, 2021

Solve Gaussian Mixture Model sample/kmeans++-based initialisation merge conflicts #11101 #20408

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix GaussianMixture UnboundLocalError #20030

Fix GaussianMixture UnboundLocalError #20030

Uh oh!

tliu68 commented May 3, 2021

Uh oh!

tliu68 commented May 3, 2021

Uh oh!

jjerphan commented May 3, 2021

Uh oh!

tliu68 commented May 5, 2021

Uh oh!

jjerphan left a comment

Uh oh!

jjerphan May 5, 2021

Uh oh!

Uh oh!

jjerphan left a comment

Uh oh!

lorentzenchr left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented May 22, 2021

Uh oh!

lorentzenchr left a comment

Uh oh!

Uh oh!

Uh oh!

Fix GaussianMixture UnboundLocalError #20030

Fix GaussianMixture UnboundLocalError #20030

Uh oh!

Conversation

tliu68 commented May 3, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

tliu68 commented May 3, 2021

Uh oh!

jjerphan commented May 3, 2021

Uh oh!

tliu68 commented May 5, 2021

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

jjerphan May 5, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented May 22, 2021

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!