Skip to content

FIX Fixes HistGradientBoosting bug fail when early stopping + no validation + warm starting #16662

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 448 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
448 commits
Select commit Hold shift + click to select a range
5a02de8
ENH Add verbose option to VotingClassifier and VotingRegre… (#16069)
cmarmo Jan 9, 2020
a4ec379
MNT remove unused variables in elkan algorithm (#16082)
kno10 Jan 9, 2020
1c422ca
DOC Update docs to indicate shuffle=True still maintains sample order…
madhuracj Jan 10, 2020
63f2cca
DOC Incorrectly-rendered entries in what's new (#16091)
david-cortes Jan 10, 2020
5609eb4
DOC Rearrange status of the Consortium members (#16089)
cmarmo Jan 10, 2020
e3ee768
DOC Docstring improvements to confusion_matrix and splitters (#16072)
keyianpai Jan 11, 2020
073de7c
FIX Fix string formatting in the error message in CategoricalNB (#16090)
madhuracj Jan 12, 2020
c643a40
DOC Corrects random_state default value in FactorAnalysis (#16098)
miladsade96 Jan 12, 2020
e754903
MNT Uploads coverage in pylatest_pip_openblas_pandas CI (#16092)
thomasjpfan Jan 12, 2020
133d1ce
DOC Add link for detail explanation in glossary/estimato-tags (#14616)
sameshl Jan 12, 2020
dd93e01
DOC Add missing backquote in LogisticRegression docstring (#16106)
johannfaouzi Jan 13, 2020
594fc85
DOC fix docstring in semi_supervised module following doc guideline (…
shubchat Jan 13, 2020
9355b3c
DOC Add Gitter details and remove IRC details (#16107)
madhuracj Jan 13, 2020
4953ec3
MNT remove more deprecations for 0.23 (#15860)
NicolasHug Jan 13, 2020
19479d7
DOC Updates kernel function to be consistent (#16099)
thomasjpfan Jan 13, 2020
d2febf2
ENH Reduce memory footprint in MLP when shuffle activated (#14075)
meyer89 Jan 13, 2020
8e61534
MNT create issue templates with automatic tagging (#16053)
glemaitre Jan 13, 2020
0d99615
Delete ISSUE_TEMPLATE.md
jnothman Jan 13, 2020
fa5817b
MNT create config.yml for new issue templates
jnothman Jan 13, 2020
562c08b
MNT remoe deprecated _refresh_cache usage (joblib related) (#16108)
adrinjalali Jan 14, 2020
54344b1
DOC Correct docstring definition for log_loss function (#16037)
sidgupta234 Jan 14, 2020
b84428e
TST Updates test for deprecation in pandas.SparseArray (#16040)
thomasjpfan Jan 14, 2020
88eadf0
DOC Better docstring and User Guide for PDPs (#16114)
NicolasHug Jan 14, 2020
c4ea377
FIX ensure object array are properly casted when dtype=object (#16076)
alexshacked Jan 15, 2020
999f288
DOC improve docstring of covariance module following doc guideline (#…
mghah Jan 15, 2020
d7c3758
[MRG] BUG Clips log_loss calculation in neural_network (#16117)
thomasjpfan Jan 15, 2020
da6065b
[MRG] Fix documentation of the tol parameter of kmeans (#16125)
madhuracj Jan 17, 2020
856d273
FIX Expose SelectorMixin through sklearn/feature_selection/ (#16132)
trimeta Jan 17, 2020
882a675
MNT remove tag help wanted in doc template (#16122)
glemaitre Jan 17, 2020
00fe3d6
Validate sample weight with check_sample weight in kernel_ridge (#16154)
lithomas1 Jan 20, 2020
2f99272
DOC Documentation Improvement in _stochastic_optimizers.py (#16166)
greatsharma Jan 21, 2020
a89d4a5
DOC add clear default values to estimator tags (#16168)
adrinjalali Jan 21, 2020
4f32360
DOC better doc for calibration_curve (#16172)
NicolasHug Jan 22, 2020
76314fb
MNT Small refactoing of CalibratedClassifier (#16174)
NicolasHug Jan 22, 2020
1e8ea9f
DOC adding CoC to the repo (#16173)
adrinjalali Jan 22, 2020
f6a19a7
DOC Format docstrings in validation module (#16134)
indecisiveuser Jan 22, 2020
ac9c3e1
ENH adding return_centers param in make_blobs (#15709)
venkyyuvy Jan 22, 2020
3f89a41
DOC Mention in FAQ that we will not accept new deep learning features…
qinhanmin2014 Jan 22, 2020
fd12d56
BUG take n_components strictly greater than fraction of explai… (#15669)
krishnachaitanya7 Jan 22, 2020
5c36df6
MNT remove check for deprecated behavior in test.py (#16109)
adrinjalali Jan 24, 2020
1ad8a36
DOC Updates random_state descriptions for sklearn/preprocessing/_data…
Olks Jan 25, 2020
3de415a
DOC Fix links in KernelRidge user guide (#16235)
NicolasHug Jan 26, 2020
c4592ad
MNT Skips failing SpectralCoclustering doctest (#16232)
thomasjpfan Jan 26, 2020
ede7905
[MRG] DOC improve random state docstring in manifold module (#16204)
daphn3k Jan 26, 2020
c23ed60
DOC improve random state docstring in HistGBDT (#16205)
Malesche Jan 26, 2020
1640d8c
DOC improve random state description in gaussian_proccess module (#16…
Jan 26, 2020
4f3878e
DOC improve random state in neighbors module (#16217)
Jan 26, 2020
617206d
DOC improve random state docsting in covariance module (#16218)
daphn3k Jan 26, 2020
ea5e4d0
DOC added feature_importances_ to ExtraTreeRegressor (#16225)
marielledado Jan 26, 2020
4f28ba5
DOC improve docsting following the user guide in mean-shift module (#…
ETay203 Jan 26, 2020
5c99577
DOC improve random state docstring in the _validation module (#16231)
rachelcjordan Jan 26, 2020
3ae671e
ENH Improve error message in clone when passed class instances (#16189)
olicairns Jan 27, 2020
c45721d
ENH check for invalid batch_size in gen_batches (#16181)
bfgray3 Jan 27, 2020
b7c4f4f
MNT Use np.asarray to get numpy data type descriptors for C structs i…
alexhenrie Jan 27, 2020
c2ede74
ENH add warning for pandas sparse Dataframe in check_array (#16021)
rushabh-v Jan 27, 2020
872468f
DOC improve random state docsting for extmath module (#16239)
fabi-cast Jan 27, 2020
3cff0f9
DOC improve random state docsting in permutation_importance (#16215)
mojc Jan 27, 2020
d67647f
DOC improve docstring following doc guideline in model_selection modu…
marielledado Jan 27, 2020
a429bd5
DOC fix default values in isotonic module (#16198)
adrinjalali Jan 27, 2020
3424f72
DOC spelling fixes (#16247)
Jan 28, 2020
a5b6894
MAINT/EXA replaced deprecated NavigationToolbar2TkAgg (#16187)
waelbenamara Jan 28, 2020
002f891
DOC Removing word optional from parameter docstring in pipeline modul…
lopusz Jan 28, 2020
1382831
DOC clarifications on the release process (#15759)
adrinjalali Jan 28, 2020
cc2fbed
BUG max_depth=1 should be decision stump in HistGradientBoosti… (#16182)
SanthoshBala18 Jan 28, 2020
8ea05cb
[MRG] API Deprecates probA_ and probB_ in OneClassSVM and svm.SVR (#1…
thomasjpfan Jan 28, 2020
b92393a
CI Action to auto assign issues (#16197)
lithomas1 Jan 28, 2020
c6fe262
ENH Improve error message for not fitted trees in plot_tree (#16253)
Rick-Mackenbach Jan 29, 2020
1fa689c
[MRG] more informative random_state doc in multilayer_perceptron (#16…
GregoireMialon Jan 29, 2020
c33ce76
DOC fix docstring in dbscan referencing glossary (#16267)
mani2106 Jan 29, 2020
dfe4449
DOC fix conda command to check if compilers and llvm are installed (#…
alonsosilvaallende Jan 29, 2020
20a431f
DOC add a coc reference to the main page (#16262)
adrinjalali Jan 29, 2020
84628b0
ENH use utility _check_sample_weight in _BaseDiscreteNB (#16263)
Batalex Jan 29, 2020
a3ca523
random_state for iforest update (#16259)
SergioDSR Jan 29, 2020
a58c6c8
DOC remove caching in wikipedia_principal_eigenvector example (#16271)
nicolasservel Jan 29, 2020
10e7b2b
DOC Improve random_state descriptions for BaggingClassifier (#16264)
DatenBiene Jan 29, 2020
6a27d4d
DOC Better UG for calibration (#16175)
NicolasHug Jan 29, 2020
df64d1e
DOC ensure all attributes are documented for PassiveAggressiveClassif…
judithabk6 Jan 29, 2020
46bd39d
MNT avoid running doctests locally on Windows (#16269)
Batalex Jan 29, 2020
fe7edc3
Fix typos in doc/developers/develop.rst (#16162)
gholdman1 Jan 29, 2020
605c6fe
[DOC] Make random_state descriptions for AdaBoost (#16278)
DatenBiene Jan 29, 2020
e61fc6c
DOC Fix alpha param of ridge estimators (#16288)
NicolasHug Jan 29, 2020
3109add
[MRG] Fix FutureWarning in plot_partial_dependence_visualization_api.…
ksslng Jan 29, 2020
7d10be4
ENH Changed implementation of Birch.predict to use pairwise_distances…
alexshacked Jan 29, 2020
00841fa
DOC Minor doc update to KernelRidge (#16295)
NicolasHug Jan 29, 2020
6d2b7bc
Fix User guide description for the Linnerud dataset #16294 (#16297)
vandana-iyer Jan 30, 2020
98f0eb8
DOC improve random state docstring in ClassifierChain and RegressorCh…
DatenBiene Jan 30, 2020
e336d80
Specify compilers version to work-around issues on MacOS (#16282)
maskani-moh Jan 30, 2020
323b4a1
DOC Cleaning parameter docstrings in discriminant_analysis (#15761) (…
lopusz Jan 30, 2020
034c021
Fix Typo in random_state for multioutput.py (#16309)
DatenBiene Jan 30, 2020
d2b8542
EXA change max_iter and enable early_stopping to get rid of convergen…
TiphaineV Jan 30, 2020
9c2d889
Update random_state gradient_boosting doc (#16315)
DatenBiene Jan 30, 2020
5ea6a26
ENH use utility _check_sample_weight in IsotonicRegression (#16322)
Batalex Jan 30, 2020
06b1a19
[DOC] Make random_state descriptions for Mixture Models (#16307)
DatenBiene Jan 30, 2020
43fd8c9
DOC exchanged boston for diabetes dataset in plot_cv_predict (#16312)
maikia Jan 30, 2020
932c606
MNT update imputer example to remove FutureWarning (#16302)
glemaitre Jan 30, 2020
e05b9e1
MNT Comment out instructions in bug report template (#16325)
thomasjpfan Jan 30, 2020
70829b3
MNT Small check sample weight refactor (#16324)
thomasjpfan Jan 30, 2020
91badfa
DOC Format docstrings in _coordinate_descent.py (#16285)
hs-nazuna Jan 30, 2020
3743a55
Fix: Possible unnecessary iterations in Newton-CG algorithm (#16266)
rushabh-v Jan 31, 2020
e1c7649
DOC Cleaning parameter spec in docstrings of base module (#16284)
lopusz Jan 31, 2020
528b044
FIX ColumnTransformer.get_feature_names with for empty list of (#15963)
rth Jan 31, 2020
ccc81cc
DOC Improve random_state descriptions for GradientBoosting (#16314)
DatenBiene Jan 31, 2020
05476a1
TST Add test for documentation of the classes_ attribute (#16277)
judithabk6 Jan 31, 2020
83985e9
DOC Sort estimator tags in alphabetical order (#16334)
ksslng Jan 31, 2020
c664df4
DOC ensure all attributes are documented for RBFSampler (#16276)
judithabk6 Jan 31, 2020
2eff916
DOC improve random state descriptions for SVMs (#16316)
DatenBiene Jan 31, 2020
86f47da
Fix warning in approximate_nearest_neighbors.py (#16339)
hs-nazuna Jan 31, 2020
6b646da
fix space missing around parameter in documentation (#16351)
judithabk6 Jan 31, 2020
0396c88
FIX mean_squared_error ignores the `squared` argument if multioutput=…
rushabh-v Feb 1, 2020
8666f57
[MRG] Add requires_positiv_X tag to AdditiveChi2Sampler (#16332)
ksslng Feb 1, 2020
4a18796
DOC Clarify that feature_importance_ are impurity based (#16237)
ysunmi0427 Feb 1, 2020
7c8af44
DOC improved descrition of the linnerud dataset (#16308)
maikia Feb 1, 2020
5234efa
TST Add estimator check for pandas objects (#12218)
mc4229 Feb 1, 2020
afd75e2
DOC Fix minor quirks in multiclass docs (#16356)
JarnoRFB Feb 1, 2020
0db80b9
FIX support of float32 read only input in trees (#16331)
Batalex Feb 1, 2020
406184e
[MRG] DOC update UG and docstrings for isotonic regression (#16234)
NicolasHug Feb 1, 2020
84bc8d3
FIX Arbitrary SVC kernels (#11296)
georgipeev Feb 1, 2020
c91f0c9
TST Add common check for classiffiers / regressors with float32 (#16359)
rth Feb 2, 2020
962ee78
DOC Mark tag multioutput as being used (#16363)
Ankit810 Feb 2, 2020
418e0d5
[MRG] CLN Remove the use of assert_raises in utils/ (#16337)
Batalex Feb 2, 2020
54a09dc
Fix typo: datset -> dataset (#16366)
ckastner Feb 2, 2020
8c8ea9e
MNT Common tests between KDTree and BallTree (#16338)
VarIr Feb 2, 2020
ce91b6a
ENH add as_frame functionality for toy datasets (#15980)
wconnell Feb 2, 2020
01c9ef3
BUG Clips mutual_info_score below (#16362)
thomasjpfan Feb 2, 2020
f2146dd
DOC Cleaning parameter spec in docstrings for ensemble m… (#16330)
lopusz Feb 2, 2020
ad2af90
DOC more informative description of random state in _rbm.py (#16318)
GregoireMialon Feb 3, 2020
daec09e
DOC added the link for the reference in permutation_test_score (#16202)
stareh Feb 3, 2020
636e54a
DOC Added example for plot_confusion_matrix (#16361)
talgatomarov Feb 4, 2020
951ad8d
DOC Update links to CircleCI (#16377)
NicolasHug Feb 4, 2020
63cfc5f
ENH add multioutput support for RFE (#16103)
divyaprabha123 Feb 4, 2020
5bb6913
DOC/FIX indentation in deprecated estimators (#16379)
johannfaouzi Feb 4, 2020
1070355
DOC Adds example for empirical_covariance (#16229)
fraboeni Feb 4, 2020
32cce97
DOC Update random_state entry for dummy / random_projection (#16347)
DatenBiene Feb 5, 2020
6324e40
EXA diabetes instead of Boston dataset for feature selection (#16305)
maikia Feb 5, 2020
54c3a1f
DOC More explicit warnings about impurity based feat. imp. (#16382)
ogrisel Feb 5, 2020
76de883
DOC plot_partial_dependence_visualisation_api exchanged boston for di…
maikia Feb 5, 2020
87a5930
TST Fixes test for California housing (#16389)
thomasjpfan Feb 5, 2020
b8768a0
MNT Adds filters to jinja template (#16133)
thomasjpfan Feb 5, 2020
91261c2
FIX Elkan k-means does not stop if tol=0 (#16075)
kno10 Feb 5, 2020
ae9eaf8
ENH Perform KNN imputation without O(n^2) memory cost
jnothman Feb 6, 2020
0c4252c
Revert "ENH Perform KNN imputation without O(n^2) memory cost"
jnothman Feb 6, 2020
7e7e115
ENH Add 'if_binary' option to drop argument of OneHotEncoder (#16245)
rushabh-v Feb 7, 2020
4a3b436
[DOC] Update random_state descriptions for mutual_info, unsupervised,…
DatenBiene Feb 7, 2020
98b3c7c
FIX max_leaf_node and max_depth interaction in GBDT (#16183)
NicolasHug Feb 7, 2020
3f0b6c0
ENH Improve error message for sparse multilabel-indicator y in Random…
rushabh-v Feb 10, 2020
42706eb
MNT CI dropping python-3.5 (#15106)
adrinjalali Feb 10, 2020
09bd9ee
DOC: Mark the sentence end in `classification_report` (#16411)
jhlegarreta Feb 10, 2020
14e597c
DOC Fix syntax in `model_evaluation` UG (#16410)
jhlegarreta Feb 10, 2020
c79a5b4
DOC promote shallow copy in the docs (#16423)
adrinjalali Feb 10, 2020
97d49f2
BLD Specify build time dependencies via pyproject.toml (#16244)
jeremiedbb Feb 11, 2020
933b4cf
MNT/CI install scikit-image if we test doc on azure (#15065)
adrinjalali Feb 11, 2020
2821abc
DOC add Bunch to public docs and API (#16404)
adrinjalali Feb 11, 2020
32d3335
DOC Docstring example of classifier should import classifier (#16430)
oleksandr-pavlyk Feb 11, 2020
0904058
DOC improve random state docstring in model_selection/split (#15575)
edwardcqian Feb 11, 2020
db85b12
DOC improve the documentation of OneHotEncoder for if_binary (#16428)
glemaitre Feb 11, 2020
62ce1ba
DOC follow doc formatting guideline for module gaussian_process (#16415)
lopusz Feb 12, 2020
4913037
MNT Deprecate public attributes in SGD and PassiveAggresive classes (…
chbrandt Feb 12, 2020
ee6b369
FEA Turn on early stopping in histogram GBDT by default (#14516)
johannfaouzi Feb 12, 2020
a3fad52
DOC Fix Tutorial example code lacking context #13566 (#14675)
ngonthier Feb 12, 2020
1801c4b
ENH Make NMF preserve floating dtype (#16280)
jeremiedbb Feb 12, 2020
d6bb321
MNT Update nightly build URL and release staging config (#16435)
ogrisel Feb 12, 2020
fa9cf22
DOC add examples for Gaussian Processes and improve User Guide (#15221)
corriebar Feb 13, 2020
4d92d0a
DOC Improves readability of kernel functions in SVM docs (#16440)
thomasjpfan Feb 13, 2020
d591d8c
DOC Improve documentation for permutation importance (#16402)
ogrisel Feb 13, 2020
c2e742c
API Deprecates the fit_grid_point function (#16401)
ariepratama Feb 14, 2020
9b39c4c
TST Fix unreachable code in tests (#16110)
VarIr Feb 16, 2020
21686b7
ENH Sample weights for ElasticNet (#15436)
Feb 16, 2020
06c71ec
BUG raise value error when distance matrix is not square and affinity…
simonamaggio Feb 17, 2020
d4a3ac4
MAINT split partial dependence and visualization (#16453)
glemaitre Feb 18, 2020
3e92edb
DOC plot voting regression now uses diabetes dataset (#16387)
maikia Feb 18, 2020
ae0dcdd
BUG Fixes benchmark script for early stopping (#16476)
thomasjpfan Feb 18, 2020
feb80ea
ENH adding fit_params option for RegressorChain (#16111)
venkyyuvy Feb 19, 2020
bbfa2d4
DOC Fix new sphinx warnings (#16483)
cmarmo Feb 19, 2020
11934e1
BUG accept all integer types as n_clusters in Birch (#16484)
jeremiedbb Feb 19, 2020
0cb2ddb
ENH IterativeImputer support array-like max and min (#16403)
narendramukherjee Feb 20, 2020
2467e1b
API make a few estimators' inits kw-only (#16474)
adrinjalali Feb 20, 2020
baa4f07
FIX _get_estimators_indices() in BaseBagging (#16437)
chofchof Feb 20, 2020
7697942
[MRG] new K-means implementation for improved performances (#11950)
jeremiedbb Feb 20, 2020
93776b9
DOC improve docstring following doc guide in module compose (#16416)
lopusz Feb 20, 2020
1c1f273
ENH XFAIL in common tests with estimator tags (#16502)
rth Feb 20, 2020
6464e15
[MRG] Document bounds='fixed' in GP kernels (#16209)
Feb 21, 2020
7f9bf5d
MAINT Clean root of sklearn repo (#16485)
jeremiedbb Feb 21, 2020
78dc597
TST check sample_weight shape added to common tests (#11598)
sergulaydore Feb 21, 2020
c1696ed
TST Sets random state in test_csr_row_norms (#16509)
thomasjpfan Feb 21, 2020
f703b85
ENH Improves efficiency of ColumnTransformer for string keys (#16431)
thomasjpfan Feb 21, 2020
366d103
DOC Fixes link in column transformer (#16516)
thomasjpfan Feb 21, 2020
17e6c6b
DOC remove backticks in plain text in developer guide (#16521)
Feb 22, 2020
7366a5a
BUG fix behaviour in confusion_matrix with with empty array-like as i…
parsons-kyle-89 Feb 23, 2020
0ad7481
BUG ensure that estimator_name is properly stored in the ROC display …
glemaitre Feb 23, 2020
bbb1e3b
BUG ensure that name is properly stored in the precision/recall displ…
glemaitre Feb 23, 2020
68197ff
DOC use make_column_selector inside the mixed type example (#16472)
glemaitre Feb 23, 2020
244d118
ENH Perform KNN imputation without O(n^2) memory cost (#16397)
jnothman Feb 24, 2020
9412f50
ENH Fast PDP for DecisionTreeRegressor and RandomForestRegressor (#15…
NicolasHug Feb 24, 2020
1f87aa4
MNT Removed out of date note about KMeans parallelism on macOS (#16527)
NicolasHug Feb 24, 2020
e24998f
ENH Support sample weights in HGBT (#14696)
adrinjalali Feb 24, 2020
136ef79
TST Checks can now skip test based on estimator tag _xfail_test (#16510)
thomasjpfan Feb 24, 2020
4c29be4
FIX Predicted standard deviation values of Gaussian Processes are onl…
plgreenLIRU Feb 25, 2020
b8c4027
BUG avoid nan variance with sparse input in StandardScaler (#16466)
glemaitre Feb 26, 2020
54cbf42
DOC Note on commit co-authorship when merging PRs (#16550)
rth Feb 26, 2020
ca78d75
DOC Fixed documents that refer to Bunch object #16438 (#16447)
CastaChick Feb 27, 2020
9b1928d
MNT Adds autolabler for modules (#16520)
thomasjpfan Feb 27, 2020
8caa938
API make cluster's __init__ parameters kwonly (#16501)
adrinjalali Feb 27, 2020
496e710
MNT Fix autolabeler by pinning the version (#16566)
thomasjpfan Feb 27, 2020
6da44dd
MAINT Use "Bug: triage" tag for user reports (#16560)
rth Feb 27, 2020
1b00c8e
MNT Removes autolabeler (#16577)
thomasjpfan Feb 28, 2020
e8928e1
ENH Include verbose message when sample_weight is provided. (#16564)
cmarmo Feb 28, 2020
b3e0138
DOC Remove 3.5 from Python badge in README (#16583)
arunavkonwar Feb 28, 2020
b6bbf58
DOC Updated gradient boosting UG (#16178)
NicolasHug Feb 28, 2020
b14d420
move 0.22.2 what's new entries (#16586)
jeremiedbb Feb 28, 2020
5009d11
DOC remind to update the version in maintainers guide (#16589)
jeremiedbb Feb 28, 2020
d205638
MNT Introduction of n_features_in_ attr with _validate_data mtd (#16112)
NicolasHug Feb 29, 2020
eb540f3
DOC Rename clf to regr in SVR examples (#16598)
jumon Feb 29, 2020
1c74490
DOC Adds example for RandomTreesEmbedding (#15202)
timnon Feb 29, 2020
68a639e
MNT rename _parallel_fit_estimator to _fit_single_estimator to reflec…
Feb 29, 2020
d86f8fd
DOC Adds examples to GradientBoostingClassifier and GradientBoostingR…
pspachtholz Feb 29, 2020
94f877b
Fix format of values in confusion matrix plot. (#16159)
Rick-Mackenbach Mar 1, 2020
8868ec7
DOC Add formula for binary balanced accuracy in UG (#16604)
Shiki-H Mar 1, 2020
cd622df
MNT Download and test datasets in cron job (#16348)
VarIr Mar 2, 2020
76ef4e1
DOC Added MLPRegressor and MLPClassifier examples (#15228)
FollonSaxBass Mar 2, 2020
0e4f85f
MNT Periodic adds labels based on module (#16596)
thomasjpfan Mar 2, 2020
6464724
MNT Enables error printing correctly in autolabeler (#16614)
thomasjpfan Mar 2, 2020
e275218
API make __init__ params in covariance kw-only (#16544)
adrinjalali Mar 2, 2020
989579f
MNT Updates autolabeler to use per_page=100 to help with rate… (#16616)
thomasjpfan Mar 2, 2020
e5d7f1d
MNT Reduces github API calls greatly in autolabeler (#16617)
thomasjpfan Mar 2, 2020
72b3041
DOC Capitalize Kaggle as proper noun (#16618)
erictleung Mar 3, 2020
48738f7
in plot_stack_predictors exchanged boston dataset for Ames housing da…
maikia Mar 3, 2020
e6e5811
[MRG] Update What's new for 0.22.2.post1 (#16610)
jeremiedbb Mar 3, 2020
8910b14
TST More stable test_uniform_grid (#16621)
ogrisel Mar 3, 2020
9766acd
DOC Fix "more..." link in "Model selection" section of 'index.… (#16620)
echasnovski Mar 3, 2020
bbcfad8
MNT Uses azure pipelines for scipy-dev (#16603)
thomasjpfan Mar 3, 2020
0f0f29a
MNT refactoring of sgd utilities (#16528)
NicolasHug Mar 3, 2020
c573d80
BLD Removes post from release on website (#16624)
thomasjpfan Mar 3, 2020
7765904
FIX overlapping titles in plot example
TomDLT Mar 4, 2020
91d5ac8
TST Fixes test so that whole test isn't skipped if pandas not… (#16627)
lrjball Mar 4, 2020
dbd7937
DOC improve diabetes dataset description (#16534)
maikia Mar 4, 2020
60b8fb2
TST add test of fit attributes (#16286)
agramfort Mar 4, 2020
69ea066
ENH Minimal Generalized linear models implementation (L2 + lbfgs) (#1…
rth Mar 4, 2020
ea31818
FIX Adress decomposition.PCA mle option problem (#16224)
lschwetlick Mar 4, 2020
1cdf608
DOC add 0.22.2 in website news (#16631)
jeremiedbb Mar 4, 2020
c3969f4
TST Enable california_housing pandas test in cron job (#16547)
rth Mar 4, 2020
df338cd
EXA align lorenz curves between the two examples with GLMs (#16640)
rth Mar 5, 2020
83ec08b
DOC update n_jobs description in DBSCAN (#16615)
adrinjalali Mar 5, 2020
f1acf83
FIX Pass sample_weight when predicting on stacked folds (#16539)
Mar 6, 2020
eb3f5df
BLD Turns off memory_profiler in examples to fix CircleCI (#16629)
thomasjpfan Mar 9, 2020
018c6dc
BLD Updates osx vm image in azure pipelines (#16647)
thomasjpfan Mar 9, 2020
b189bf6
FIX: normalizer l_inf should take maximum of absolute values (#16633)
maurapintor Mar 10, 2020
f763c61
ENH Add check for non binary variables in OneHotEncoder. (#16585)
cmarmo Mar 10, 2020
ae159ec
DOC Update LICENSE Year (#16660)
merrcury Mar 10, 2020
535ef55
BUG Fix issue with KernelPCA.inverse_transform (#16655)
lrjball Mar 10, 2020
9135e0b
BUG Fixes histgradientboosting bug
thomasjpfan Mar 10, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .binder/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
--find-links https://sklearn-nightly.scdn8.secure.raxcdn.com scikit-learn
--extra-index https://pypi.anaconda.org/scipy-wheels-nightly/simple scikit-learn
--pre
matplotlib
scikit-image
Expand Down
12 changes: 6 additions & 6 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,17 +9,17 @@ jobs:
- MKL_NUM_THREADS: 2
- MINICONDA_PATH: ~/miniconda
- CONDA_ENV_NAME: testenv
- PYTHON_VERSION: 3.5
- NUMPY_VERSION: 1.11.0
- SCIPY_VERSION: 0.17.0
- MATPLOTLIB_VERSION: 1.5.1
- PYTHON_VERSION: 3.6
- NUMPY_VERSION: 1.13.3
- SCIPY_VERSION: 0.19.1
- MATPLOTLIB_VERSION: 2.1.1
# on conda, this is the latest for python 3.5
# The following places need to be in sync with regard to Cython version:
# - .circleci config file
# - sklearn/_build_utils/__init__.py
# - advanced installation guide
- CYTHON_VERSION: 0.28.5
- SCIKIT_IMAGE_VERSION: 0.12.3
- SCIKIT_IMAGE_VERSION: 0.13
steps:
- checkout
- run: ./build_tools/circle/checkout_merge_commit.sh
Expand Down Expand Up @@ -101,7 +101,7 @@ jobs:

pypy3:
docker:
- image: pypy:3.6-7.1.1
- image: pypy:3.6-7.2.0
steps:
- restore_cache:
keys:
Expand Down
10 changes: 10 additions & 0 deletions .github/ISSUE_TEMPLATE/blank_template.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
name: Other
about: For all other issues to reach the community...
title: ''
labels: ''
assignees: ''

---


36 changes: 25 additions & 11 deletions ISSUE_TEMPLATE.md → .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,36 @@
---
name: Bug report
about: Create a report to help us reproduce and correct the bug
title: ''
labels: 'Bug: triage'
assignees: ''

---

<!--
If your issue is a usage question, submit it here instead:
- StackOverflow with the scikit-learn tag: https://stackoverflow.com/questions/tagged/scikit-learn
- Mailing List: https://mail.python.org/mailman/listinfo/scikit-learn
For more information, see User Questions: http://scikit-learn.org/stable/support.html#user-questions
Before submitting a bug, please make sure the issue hasn't been already
addressed by searching through the past issues.
-->

<!-- Instructions For Filing a Bug: https://github.com/scikit-learn/scikit-learn/blob/master/CONTRIBUTING.md#filing-bugs -->

#### Description
<!-- Example: Joblib Error thrown when calling fit on LatentDirichletAllocation with evaluate_every > 0-->
#### Describe the bug
<!--
A clear and concise description of what the bug is.
-->

#### Steps/Code to Reproduce
<!--
Please add a minimal example that we can reproduce the error by running the
code. Be as succinct as possible, do not depend on external data. In short, we
are going to copy-paste your code and we expect to get the same
result as you.

Example:
```python
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.decomposition import LatentDirichletAllocation

docs = ["Help I have a bug" for i in range(1000)]

vectorizer = CountVectorizer(input=docs, analyzer='word')
lda_features = vectorizer.fit_transform(docs)

lda_model = LatentDirichletAllocation(
n_topics=10,
learning_method='online',
Expand All @@ -34,6 +43,10 @@ If the code is too long, feel free to put it in a public gist and link
it in the issue: https://gist.github.com
-->

```
Sample code to reproduce the problem
```

#### Expected Results
<!-- Example: No error is thrown. Please paste or describe the expected results.-->

Expand All @@ -51,6 +64,7 @@ import sys; print("Python", sys.version)
import numpy; print("NumPy", numpy.__version__)
import scipy; print("SciPy", scipy.__version__)
import sklearn; print("Scikit-Learn", sklearn.__version__)
import imblearn; print("Imbalanced-Learn", imblearn.__version__)
-->


Expand Down
1 change: 1 addition & 0 deletions .github/ISSUE_TEMPLATE/config.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
blank_issues_enabled: false
20 changes: 20 additions & 0 deletions .github/ISSUE_TEMPLATE/doc_improvement.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
name: Documentation improvement
about: Create a report to help us improve the documentation. Alternatively you can just open a pull request with the suggested change.
title: ''
labels: Documentation
assignees: ''

---

#### Describe the issue linked to the documentation

<!--
Tell us about the confusion introduced in the documentation.
-->

#### Suggest a potential alternative/fix

<!--
Tell us how we could improve the documentation in this regard.
-->
22 changes: 22 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
name: Feature request
about: Suggest a new algorithm, enhancement to an existing algorithm, etc.
title: ''
labels: New Feature
assignees: ''

---

<!--
If you want to propose a new algorithm, please refer first to the scikit-learn
inclusion criterion:
https://scikit-learn.org/stable/faq.html#what-are-the-inclusion-criteria-for-new-algorithms
-->

#### Describe the workflow you want to enable

#### Describe your proposed solution

#### Describe alternatives you've considered, if relevant

#### Additional context
20 changes: 20 additions & 0 deletions .github/ISSUE_TEMPLATE/usage_question.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
name: Usage question
about: If you have a usage question
title: ''
labels: Question
assignees: ''

---

<!--
**If your issue is a usage question, please submit it in one of these other channels instead:**
- **StackOverflow with the scikit-learn tag: https://stackoverflow.com/questions/tagged/scikit-learn**
- **Mailing List: https://mail.python.org/mailman/listinfo/scikit-learn**
- **Gitter: https://gitter.im/scikit-learn/scikit-learn**
- **For more information, see User Questions: http://scikit-learn.org/stable/support.html#user-question**

The issue tracker is used only to report issues and feature requests. For
questions, please use either of the above platforms. Most question issues are
closed without an answer on this issue tracker. Thanks for your understanding.
-->
80 changes: 80 additions & 0 deletions .github/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
module:cluster:
- sklearn/cluster/**/*

module:common:
- sklearn/common/**/*

module:compose:
- sklearn/compose/**/*

module:covariance:
- sklearn/covariance/**/*

module:cross_decomposition:
- sklearn/cross_decomposition/**/*

module:datasets:
- sklearn/datasets/**/*

module:decomposition:
- sklearn/decomposition/**/*

module:ensemble:
- sklearn/ensemble/**/*

module:feature_extraction:
- sklearn/feature_extraction/**/*

module:feature_selection:
- sklearn/feature_selection/**/*

module:gaussian_process:
- sklearn/gaussian_process/**/*

module:impute:
- sklearn/impute/**/*

module:inspection:
- sklearn/inspection/**/*

module:linear_model:
- sklearn/linear_model/**/*

module:manifold:
- sklearn/manifold/**/*

module:metrics:
- sklearn/metrics/**/*

module:mixture:
- sklearn/mixture/**/*

module:model_selection:
- sklearn/model_selection/**/*

module:naive_bayes:
- sklearn/naive_bayes.py

module:neighbors:
- sklearn/neighbors/**/*

module:neural_network:
- sklearn/neural_network/**/*

module:pipeline:
- sklearn/pipeline.py

module:preprocessing:
- sklearn/preprocessing/**/*

module:semi_supervised:
- sklearn/semi_supervised/**/*

module:svm:
- sklearn/svm/**/*

module:tree:
- sklearn/tree/**/*

module:utils:
- sklearn/utils/**/*
16 changes: 16 additions & 0 deletions .github/workflows/assign.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@

name: Assign
on:
issue_comment:
types: created

jobs:
one:
runs-on: ubuntu-latest
steps:
- if: github.event.comment.body == 'take'
name:
run: |
echo "Assigning issue ${{ github.event.issue.number }} to ${{ github.event.comment.user.login }}"
curl -H "Authorization: token ${{ secrets.GITHUB_TOKEN }}" -d '{"assignees": ["${{ github.event.comment.user.login }}"]}' https://api.github.com/repos/${{ github.repository }}/issues/${{ github.event.issue.number }}/assignees
curl -H "Authorization: token ${{ secrets.GITHUB_TOKEN }}" -X "DELETE" https://api.github.com/repos/${{ github.repository }}/issues/${{ github.event.issue.number }}/labels/help%20wanted
14 changes: 14 additions & 0 deletions .github/workflows/labeler.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
name: "Pull Request Labeler"
on:
schedule:
- cron: "*/10 * * * *"

jobs:
triage:
runs-on: ubuntu-latest
steps:
- uses: thomasjpfan/labeler@v2.4.6
if: github.repository == 'scikit-learn/scikit-learn'
with:
repo-token: "${{ secrets.GITHUB_TOKEN }}"
max-labels: "3"
26 changes: 26 additions & 0 deletions .github/workflows/twitter.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
# Tweet the URL of a commit on @sklearn_commits whenever a push event
# happens on the master branch
name: Twitter Push Notification


on:
push:
branches:
- master


jobs:
tweet:
name: Twitter Notification
runs-on: ubuntu-latest
steps:
- name: Tweet URL of last commit as @sklearn_commits
if: github.repository == 'scikit-learn/scikit-learn'
uses: xorilog/twitter-action@0.1
with:
args: "-message \"https://github.com/scikit-learn/scikit-learn/commit/${{ github.sha }}\""
env:
TWITTER_CONSUMER_KEY: ${{ secrets.TWITTER_CONSUMER_KEY }}
TWITTER_CONSUMER_SECRET: ${{ secrets.TWITTER_CONSUMER_SECRET }}
TWITTER_ACCESS_TOKEN: ${{ secrets.TWITTER_ACCESS_TOKEN }}
TWITTER_ACCESS_SECRET: ${{ secrets.TWITTER_ACCESS_SECRET }}
14 changes: 14 additions & 0 deletions .github/workflows/unassign.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
name: Unassign
#Runs when a contributor has unassigned themselves from the issue and adds 'help wanted' and 'stalled' tags
on:
issues:
types: unassigned

jobs:
one:
runs-on: ubuntu-latest
steps:
- name:
run: |
echo "Marking issue ${{ github.event.issue.number }} as stalled"
curl -H "Authorization: token ${{ secrets.GITHUB_TOKEN }}" -d '{"labels": ["help wanted","Stalled"]}' https://api.github.com/repos/${{ github.repository }}/issues/${{ github.event.issue.number }}/labels
5 changes: 0 additions & 5 deletions .landscape.yml

This file was deleted.

16 changes: 16 additions & 0 deletions CODE_OF_CONDUCT.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# Code of Conduct

We are a community based on openness and friendly, didactic, discussions.

We aspire to treat everybody equally, and value their contributions.

Decisions are made based on technical merit and consensus.

Code is not the only way to help the project. Reviewing pull requests,
answering questions to help others on mailing lists or issues, organizing and
teaching tutorials, working on the website, improving the documentation, are
all priceless contributions.

We abide by the principles of openness, respect, and consideration of others of
the Python Software Foundation: https://www.python.org/psf/codeofconduct/

2 changes: 1 addition & 1 deletion COPYING
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
New BSD License

Copyright (c) 2007–2019 The scikit-learn developers.
Copyright (c) 2007–2020 The scikit-learn developers.
All rights reserved.


Expand Down
1 change: 1 addition & 0 deletions MANIFEST.in
Original file line number Diff line number Diff line change
Expand Up @@ -5,3 +5,4 @@ recursive-include sklearn *.c *.h *.pyx *.pxd *.pxi *.tp
recursive-include sklearn/datasets *.csv *.csv.gz *.rst *.jpg *.txt *.arff.gz *.json.gz
include COPYING
include README.rst
include pyproject.toml
Loading