-
-
Notifications
You must be signed in to change notification settings - Fork 25.8k
Better documentation for random_state #15222
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Just had a brief look at this. It seems that approporiate documentation is already in place, for e.g. these two: |
+1 for this, thanks for the report @mschaffenroth! It was already done for sklearn/svm/classes.py - 90, 544 and 750. See issue #9497 and PR #9703. For the svm module, it thus only remains to do |
svm/_base.py 852 and svm/_classes.py 310 as detailed in scikit-learn#15222.
Hi, I am new to contributing and would like to help out with this. I have done the two instances mentioned by @albertcthomas in the svm module above and can continue working through the above list. |
I think it would be helpful, @NicolasHug, to give some examples of what this should look like. Thanks for continuing this work. |
closing in favour of #10548 |
Sort of like #14228, but for random_state.
For any public object that accepts a
random_state
parameter, we should document what parts of the algorithm are randomized. It's not always obvious what is and what isn't randomized. We should also always link to the glossary, where the different possible values ofrandom_state
are clearly explained.For example for the random forest estimators, it would be helpful to indicate that
random_state
determines in particular the subsampling of the samples and the subsampling of the features. Something like:The text was updated successfully, but these errors were encountered: