@@ -1932,8 +1932,9 @@ class ShuffleSplit(_UnsupportedGroupCVMixin, BaseShuffleSplit):
1932
1932
Yields indices to split data into training and test sets.
1933
1933
1934
1934
Note: contrary to other cross-validation strategies, random splits
1935
- do not guarantee that all folds will be different, although this is
1936
- still very likely for sizeable datasets.
1935
+ do not guarantee that test sets across all folds will be mutually exclusive,
1936
+ and might include overlapping samples. However, this is still very likely for
1937
+ sizeable datasets.
1937
1938
1938
1939
Read more in the :ref:`User Guide <ShuffleSplit>`.
1939
1940
@@ -2049,6 +2050,11 @@ class GroupShuffleSplit(GroupsConsumerMixin, BaseShuffleSplit):
2049
2050
``LeavePGroupsOut(p=10)`` would be
2050
2051
``GroupShuffleSplit(test_size=10, n_splits=100)``.
2051
2052
2053
+ Contrary to other cross-validation strategies, the random splits
2054
+ do not guarantee that test sets across all folds will be mutually exclusive,
2055
+ and might include overlapping samples. However, this is still very likely for
2056
+ sizeable datasets.
2057
+
2052
2058
Note: The parameters ``test_size`` and ``train_size`` refer to groups, and
2053
2059
not to samples as in :class:`ShuffleSplit`.
2054
2060
@@ -2176,13 +2182,14 @@ class StratifiedShuffleSplit(BaseShuffleSplit):
2176
2182
2177
2183
Provides train/test indices to split data in train/test sets.
2178
2184
2179
- This cross-validation object is a merge of StratifiedKFold and
2180
- ShuffleSplit, which returns stratified randomized folds. The folds
2185
+ This cross-validation object is a merge of :class:` StratifiedKFold` and
2186
+ :class:` ShuffleSplit` , which returns stratified randomized folds. The folds
2181
2187
are made by preserving the percentage of samples for each class.
2182
2188
2183
- Note: like the ShuffleSplit strategy, stratified random splits
2184
- do not guarantee that all folds will be different, although this is
2185
- still very likely for sizeable datasets.
2189
+ Note: like the :class:`ShuffleSplit` strategy, stratified random splits
2190
+ do not guarantee that test sets across all folds will be mutually exclusive,
2191
+ and might include overlapping samples. However, this is still very likely for
2192
+ sizeable datasets.
2186
2193
2187
2194
Read more in the :ref:`User Guide <stratified_shuffle_split>`.
2188
2195
0 commit comments