[MRG] documentation for random_state in forest.py #15516

MDouriez · 2019-11-02T22:58:50Z

Reference Issues/PRs

part of this issue: #15222
(related to #15264)

What does this implement/fix? Explain your changes.

This PR improves the documentation for RandomForestClassifier, RandomForestRegressor, ExtraTreesClassifier and ExtraTreesRegressor regarding the source of randomness. The only changes were made in the docstrings of sklearn/ensemble/_forest.py.

Any other comments?

eickenberg · 2019-11-02T23:01:06Z

Looks good to me pending CI

MDouriez · 2019-11-16T21:25:34Z

@NicolasHug Let me know what you think. Thanks!

cmarmo · 2019-11-17T15:14:15Z

Also @TomDLT or maybe @adrinjalali (seems to me that you volunteered to be pinged... ;) )... this PR looks pretty ready for merging?

adrinjalali · 2019-11-18T09:36:06Z

Thanks for the ping @cmarmo

I like the description, but I'm not sure if the part which is related to the randomness in trees should be included here or not. What do you think of adding a link to the tree docstring intead for that part @MDouriez ?

NicolasHug

This is a clear improvement, thanks @MDouriez . Made some comments but LGTM when addressed

NicolasHug · 2019-11-18T13:20:21Z

sklearn/ensemble/_forest.py

+        Also note that the features are always randomly permuted at each split.
+        Therefore, the best found split may vary, even with the same training
+        data, ``max_features=n_features`` and ``bootstrap=False``, if the
+        improvement of the criterion is identical for several splits enumerated
+        during the search of the best split. To obtain a deterministic
+        behaviour during fitting, ``random_state`` has to be fixed.


I think we can omit this last part (from "Also note that..." to "has to be fixed").

The rest above is perfect

This note was already there. I just moved it. Should I still remove it?

I see, maybe just leave it where it was in the notes then

sklearn/ensemble/_forest.py

MDouriez · 2019-11-19T04:27:01Z

Should be ready for final review @NicolasHug @adrinjalali.

Left the note where it was
Added line for rendering
Added description for RandomTreesEmbedding as well

MDouriez · 2019-11-19T05:50:40Z

Also interestingly, RandomTreesEmbedding has a max_samples parameter but no bootstrap. In the __init__, bootstrap is set to False. Looks like max_samples is never used?

NicolasHug

Thanks @MDouriez !

NicolasHug · 2019-11-19T13:52:58Z

Also interestingly, RandomTreesEmbedding has a max_samples parameter but no bootstrap. In the init, bootstrap is set to False. Looks like max_samples is never used?

Good catch, could you please open an issue for this?

adrinjalali

Thanks @MDouriez

MDouriez · 2019-11-20T06:38:00Z

filed an issue #15670

Also interestingly, RandomTreesEmbedding has a max_samples parameter but no bootstrap. In the init, bootstrap is set to False. Looks like max_samples is never used?

Good catch, could you please open an issue for this?

* documentation for random_state in forests * move note to parameter * same for RandomForestRegressor * add doc for ExtraTreesRegressor and ExtraTreesClassifier * skip line * lint * move note back to where it was * add Glossary in RandomForestRegressor * adding description for RandomTreesEmbedding * small fix * correct description for RandomTreesEmbedding

documentation for random_state in forests

ca334af

TomDLT added the Documentation label Nov 2, 2019

MDouriez added 3 commits November 2, 2019 16:01

move note to parameter

70d63d1

same for RandomForestRegressor

8ff9ed8

add doc for ExtraTreesRegressor and ExtraTreesClassifier

48ed17a

MDouriez changed the title ~~[WIP] documentation for random_state in forests~~ [MRG] documentation for random_state in forests Nov 2, 2019

MDouriez changed the title ~~[MRG] documentation for random_state in forests~~ [MRG] documentation for random_state in forest.py Nov 2, 2019

NicolasHug reviewed Nov 18, 2019

View reviewed changes

Marie Douriez and others added 7 commits November 18, 2019 09:18

skip line

07204b3

lint

07ae312

move note back to where it was

3d810f9

add Glossary in RandomForestRegressor

27e5b62

adding description for RandomTreesEmbedding

c8fd235

Merge branch 'master' into 15222_forest

04b6ec3

small fix

bcdd5de

correct description for RandomTreesEmbedding

5c8c098

NicolasHug approved these changes Nov 19, 2019

View reviewed changes

adrinjalali approved these changes Nov 19, 2019

View reviewed changes

adrinjalali merged commit 663d052 into scikit-learn:master Nov 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] documentation for random_state in forest.py #15516

[MRG] documentation for random_state in forest.py #15516

Uh oh!

MDouriez commented Nov 2, 2019 •

edited

Loading

Uh oh!

eickenberg commented Nov 2, 2019

Uh oh!

MDouriez commented Nov 16, 2019

Uh oh!

cmarmo commented Nov 17, 2019

Uh oh!

adrinjalali commented Nov 18, 2019

Uh oh!

NicolasHug left a comment

Uh oh!

NicolasHug Nov 18, 2019

Uh oh!

MDouriez Nov 18, 2019

Uh oh!

NicolasHug Nov 18, 2019

Uh oh!

Uh oh!

MDouriez commented Nov 19, 2019

Uh oh!

MDouriez commented Nov 19, 2019

Uh oh!

NicolasHug left a comment

Uh oh!

NicolasHug commented Nov 19, 2019

Uh oh!

adrinjalali left a comment

Uh oh!

MDouriez commented Nov 20, 2019

Uh oh!

Uh oh!

Uh oh!

[MRG] documentation for random_state in forest.py #15516

[MRG] documentation for random_state in forest.py #15516

Uh oh!

Conversation

MDouriez commented Nov 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

eickenberg commented Nov 2, 2019

Uh oh!

MDouriez commented Nov 16, 2019

Uh oh!

cmarmo commented Nov 17, 2019

Uh oh!

adrinjalali commented Nov 18, 2019

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

MDouriez Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 18, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MDouriez commented Nov 19, 2019

Uh oh!

MDouriez commented Nov 19, 2019

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Nov 19, 2019

Uh oh!

adrinjalali left a comment

Choose a reason for hiding this comment

Uh oh!

MDouriez commented Nov 20, 2019

Uh oh!

Uh oh!

MDouriez commented Nov 2, 2019 •

edited

Loading