[MRG + 1] replace mutable default argument in _RidgeGCV #4376

DonBeo · 2015-03-10T11:17:01Z

This is a very small change to make me familiar with the repository and the git system. The mutable default value is replaced with a numpy array. This code passes al the tests.

DonBeo · 2015-03-10T12:34:40Z

Sorry I did a mistake. The numpy array are mutable as well. Maybe we should stop this push. Anyway this should not influence the library at all.

amueller · 2015-03-10T13:21:39Z

Well, you could use a tuple as you said. It would be good to add a test, though.

amueller · 2015-03-10T15:10:17Z

Why did you close this?

DonBeo · 2015-03-10T16:52:54Z

I was going to open a new one with the tuple instead of array. Is this not the correct procedure?

DonBeo · 2015-03-10T16:59:22Z

Sorry I did not know that was possible to update the branch during the pull. I reopen the pull request. I hope that this does not lead to confusion. Regarding the test I need some help. I will ask on the mail list.

amueller · 2015-03-10T17:51:06Z

You can ask here ;)
The pull request was already updated when you closed. The pull request will keep tracking your branch.
Actually, I don't think this can easily be tested. I'll try to add a test to the common_tests, but that might be a bit too tricky for this PR.

amueller · 2015-03-10T17:51:45Z

I think this looks good to merge.
We'll need another review though. The travis failure is unrelated and I restarted the tests.

arjoly · 2015-03-10T18:25:20Z

sklearn/linear_model/ridge.py

                 fit_intercept=True, normalize=False, scoring=None,
                 score_func=None, loss_func=None, cv=None, gcv_mode=None,
                 store_cv_values=False):
-        self.alphas = alphas
+        self.alphas = np.asarray(alphas)


This line mutates the init and we shouldn't.

argh indeed. this needs to go into the fit method.

arjoly · 2015-03-10T18:25:34Z

Whenever my remarks is addressed, +1!

DonBeo · 2015-03-10T18:46:12Z

Sorry I need a clarification.
Should we never modify the init method?
Should I change it?

Let me know!

amueller · 2015-03-10T18:48:54Z

Yes, we should never modify anything in the __init__ method, as parameters might get set via set_params and therefore bypass __init__.
Please remove the conversion from __init__ and create a local variable in fit that is set using the conversion.

DonBeo · 2015-03-10T19:16:39Z

I have replaced the self.alphas value in fit. I think this make the code more clear. Otherwise I can define a local variable alphas_loc = np.asarray(self.alphas) and replace all the occurencies of self.alphas with alphas_loc in the fit method. I do not know which approach is preffered.

amueller · 2015-03-10T19:46:04Z

the second approach, though you can just call the variable alphas.
The reason is that otherwise you overwrite the user input.

coveralls · 2015-03-10T20:17:10Z

Coverage remained the same at 95.12% when pulling 70c3f7c on DonBeo:my-feature into 588b3f7 on scikit-learn:master.

DonBeo · 2015-03-11T12:33:59Z

in the linereader function it was quiet difficult to change the default value. I have been able to replace it but I am not happy of the quality of the solution. Maybe it can be improved

arjoly · 2015-03-11T13:40:44Z

sklearn/linear_model/ridge.py

@@ -847,8 +847,10 @@ def fit(self, X, y, sample_weight=None):
        -------
        self : Returns self.
        """
+        alphas = np.asarray(self.alphas)


Is it necessary?

the tests are passed also without that line. The problem is that the default argiment was a numpy array. If you want I can delete it and just use self.alphas in all the method.

The problem is that the default argiment was a numpy array.

Have I missed something? It seems to be a list.

check https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/linear_model/ridge.py line 832. The default argument for init was a numpy array. The code works with both self.alphas everywhere or with alphas=np.array(self.alphas). Let me know what version should I keep

I would go with the least number of lines to modify and go with the tuple.
Though, the doc says that it accepts (only) numpy array. http://scikit-learn.org/dev/modules/generated/sklearn.linear_model.RidgeClassifierCV.html#sklearn.linear_model.RidgeClassifierCV From a user perpsective, I would expect array-like for alphas.

In that case, I would add a regression support and add support for array-like.

ok then I think the last updated version should be ok. I am using tuples as default value and we expect the users to use np.array. I think that this should work well. Unless I am not missing something. In that case explain me :-)

In that case, I would add a regression support and add support for array-like.

I mean. I would add a test for this. By the way, we should update the documentation.

I am afraid that you have to help me with the test. I am not sure of what to test and how to do it.

For instance, you could test that passing a tuple of alpha, a numpy array of alpha or a list of alpha give similar results.

ok I have added a test. Let me know if you think that this is fine.

… casted to a numpy array anymore

…ha value in ridgeCV

DonBeo · 2015-03-12T14:24:57Z

I think that this conflict can be caused by the test. My understanding is that to solve it I have to write the original sklearn repository. I think that only a moderator can do that.

DonBeo · 2015-03-20T19:38:50Z

Hi,
I did rebase again. Hopefully now it should be fine.
My understanding is that with the command git rebase -i master I can delete some of the commits but at the moment all the commits are related to changes that I did.

Let me know if this is fine.

Sorry for the confusion I am starting to understand git and the next pull will hopefully be less painful

amueller · 2015-03-20T20:17:11Z

when you do rebase -i, you can mark all but the first commit as squash so they will be resolved to a single commit.

# The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 4 commits. # The first commit's message is: # This is a combination of 18 commits. # The first commit's message is: replace mutable default argument in _RidgeGCV # This is the 2nd commit message: use tuples instead of mutable default values # This is the 3rd commit message: use tuples instead of mutable default values # This is the 4th commit message: redefinition of alphas is removed from init and added to fit method # This is the 5th commit message: use tuples instead of mutable default values # This is the 6th commit message: joblib is restored to its original version. In ridge.py alphas is not casted to a numpy array anymore # This is the 7th commit message: collections library not required anymore and os it is removed # This is the 8th commit message: use tuples instead of mutable default values # This is the 9th commit message: conflict solved # This is the 10th commit message: update documentation # This is the 11th commit message: documentation updated # This is the 12th commit message: use tuples instead of mutable default values # This is the 13th commit message: solve conflict # This is the 14th commit message: up # This is the 15th commit message: conflict solved # This is the 16th commit message: conflict solved # This is the 17th commit message: remose white lines # This is the 18th commit message: delete white lines # This is the 2nd commit message: use tuples instead of mutable default values # This is the 3rd commit message: use tuples instead of mutable default values # This is the 4th commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: conflict solved # This is the 3rd commit message: use tuples instead of mutable default values # This is the 2nd commit message: documentation updated # This is the 2nd commit message: use tuples instead of mutable default values # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: conflict solved # This is the 2nd commit message: use tuples instead of mutable default values # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method

# The first commit's message is: conflict solved # This is the 2nd commit message: documentation updated # This is the 3rd commit message: remose white lines # This is the 4th commit message: delete white lines

DonBeo · 2015-03-20T21:02:43Z

I did rebase again. The number of commits now is lower. My understanding is that I can not have a single commit for certain addition.

I hope that this if fine now.

coveralls · 2015-03-20T21:16:28Z

Changes Unknown when pulling fb11bc4 on DonBeo:my-feature into * on scikit-learn:master*.

…e mode

[MRG + 1] Prevent nose from using docstring to name the tests in results

amueller · 2015-03-23T16:10:23Z

Can you please look at the commit count shown a the top? It shows 97 commits.
Also, you can do anything within a single commit (apart from a merge, which you should not have).

[MRG+2] FIX make StandardScaler & scale more numerically stable

# The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: documentation updated use tuples instead of mutable default values solve conflict up conflict solved conflict solved remose white lines delete white lines use tuples instead of mutable default values use tuples instead of mutable default values redefinition of alphas is removed from init and added to fit method conflict solved use tuples instead of mutable default values documentation updated use tuples instead of mutable default values redefinition of alphas is removed from init and added to fit method conflict solved use tuples instead of mutable default values redefinition of alphas is removed from init and added to fit method conflict solved documentation updated remose white lines delete white lines update documentation replace mutable default argument in _RidgeGCV # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: use tuples instead of mutable default values # This is the 3rd commit message: conflict solved # This is the 2nd commit message: documentation updated # This is the 2nd commit message: remose white lines # This is the 3rd commit message: delete white lines # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: conflict solved # This is the 3rd commit message: use tuples instead of mutable default values # This is the 2nd commit message: documentation updated # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: conflict solved # This is the 2nd commit message: redefinition of alphas is removed from init and added to fit method # This is the 2nd commit message: documentation updated # This is the 3rd commit message: remose white lines # This is the 4th commit message: delete white lines

[MRG+1] FIX LDA(solver="lsqr"): make sure the right error is raised on transform

amueller · 2015-03-23T23:02:42Z

sorry, I'd be happy to help you, but I just missed you on IRC. you can ping me as amueller, then something will bounce on my desktop ;)

coveralls · 2015-03-24T13:12:17Z

Changes Unknown when pulling 6d2c599 on DonBeo:my-feature into * on scikit-learn:master*.

TomDLT · 2015-05-13T11:52:40Z

#4713 merged
This PR can be closed

replace mutable default argument in _RidgeGCV

70c3f7c

use tuples instead of mutable default values

fcab7f6

DonBeo closed this Mar 10, 2015

DonBeo reopened this Mar 10, 2015

amueller changed the title ~~replace mutable default argument in _RidgeGCV~~ [MRG + 1] replace mutable default argument in _RidgeGCV Mar 10, 2015

amueller mentioned this pull request Mar 10, 2015

[MRG + 1] TST test that all default arguments are not mutable #4379

Merged

arjoly reviewed Mar 10, 2015
View reviewed changes

use tuples instead of mutable default values

90780f6

DonBeo added 2 commits March 10, 2015 21:18

redefinition of alphas is removed from init and added to fit method

49b0e04

use tuples instead of mutable default values

a135f7b

arjoly reviewed Mar 11, 2015
View reviewed changes

DonBeo added 3 commits March 11, 2015 15:01

joblib is restored to its original version. In ridge.py alphas is not…

ed19d8e

… casted to a numpy array anymore

collections library not required anymore and os it is removed

e198fc8

test added to verify that both tuples and np.array can be used as alp…

adf45c7

…ha value in ridgeCV

DonBeo added 2 commits March 20, 2015 19:10

rebase

135abd1

removed default mutable argument

2ab83c7

DonBeo added 4 commits March 20, 2015 20:50

# This is a combination of 4 commits.

49cbdd9

# The first commit's message is: conflict solved # This is the 2nd commit message: documentation updated # This is the 3rd commit message: remose white lines # This is the 4th commit message: delete white lines

update documentation

0224ff0

conflict solved

fb11bc4

raghavrv and others added 4 commits March 21, 2015 11:16

MAINT docstring --> comments to prevent nose from using doc in verbos…

cd2ee7e

…e mode

MAINT use yield for cleaner output in verbose mode

f763a34

Merge pull request scikit-learn#4432 from ragv/travis_ignore_docstring

8f67a81

[MRG + 1] Prevent nose from using docstring to name the tests in results

COSMIT missing newlines in metrics test

12b2f16

amueller and others added 8 commits March 23, 2015 11:12

Merge pull request scikit-learn#4436 from ogrisel/rebased-pr-3747

574ebfd

[MRG+2] FIX make StandardScaler & scale more numerically stable

remove default mutable arguments

cfa86e3

Merge pull request scikit-learn#4427 from amueller/lda_lsqr_predict

689388a

[MRG+1] FIX LDA(solver="lsqr"): make sure the right error is raised on transform

remove default mutable arguments

4c2106b

remove default mutable values

5b60532

conflict solved

a50f719

conflict solved

8182408

solved conflict

6d2c599

amueller mentioned this pull request May 12, 2015

[MRG] don't use mutable defaults in RidgeCV. #4713

Merged

arjoly closed this May 13, 2015

DonBeo deleted the my-feature branch May 25, 2015 08:52

[MRG + 1] replace mutable default argument in _RidgeGCV #4376

[MRG + 1] replace mutable default argument in _RidgeGCV #4376

Conversation

DonBeo commented Mar 10, 2015

DonBeo commented Mar 10, 2015

amueller commented Mar 10, 2015

amueller commented Mar 10, 2015

DonBeo commented Mar 10, 2015

DonBeo commented Mar 10, 2015

amueller commented Mar 10, 2015

amueller commented Mar 10, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arjoly commented Mar 10, 2015

DonBeo commented Mar 10, 2015

amueller commented Mar 10, 2015

DonBeo commented Mar 10, 2015

amueller commented Mar 10, 2015

coveralls commented Mar 10, 2015

DonBeo commented Mar 11, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DonBeo commented Mar 12, 2015

DonBeo commented Mar 20, 2015

amueller commented Mar 20, 2015

DonBeo commented Mar 20, 2015

coveralls commented Mar 20, 2015

amueller commented Mar 23, 2015

amueller commented Mar 23, 2015

coveralls commented Mar 24, 2015

TomDLT commented May 13, 2015