[MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section #11328

jeremiedbb · 2018-06-20T13:10:15Z

This PR is just the continued work started in #11180 for the reorganization of the datasets loading utilities
section in the doc discussed in #11083.

There were TODOs left out in #11180 that this PR aims to fill. It's essentially adding/reworking introductions of the new datasets loading utilities sections.

jnothman · 2018-06-20T13:12:34Z

And again, I'm sorry for messing up the merge...

jeremiedbb · 2018-06-20T13:15:21Z

It's really not a big deal, and thank YOU for reviewing all my PRs so fast !

ogrisel

Here are some comments. Also please prefix the titles of your PR with [WIP] (Work In Progress) or [MRG] (Ready to merge) to let the reviewers know whether you are still working on improvements to an early PR or not.

ogrisel · 2018-06-21T12:13:21Z

doc/datasets/index.rst


-*desc*
+They can be loaded using the following functions :


No whitespace before ":" in English.

ogrisel · 2018-06-21T12:15:01Z

doc/datasets/index.rst

@@ -262,6 +277,8 @@ Generators for decomposition
 Loading other datasets
 ======================

+This section gathers different tools to load other kinds of datasets.


This sentence brings very little information I would rather remove it to avoid diluting the actual information with boilerplate English connection sentences.

ogrisel · 2018-06-21T12:16:07Z

doc/datasets/index.rst

-require to download any file from some external website. 
+scikit-learn comes with a few small standard datasets that do not require to 
+download any file from some external website. A description of each dataset is
+available below.


"A description of each dataset is available below." could probably be trimmed to keep the documentation as informative as possible.

ogrisel · 2018-06-21T12:17:11Z

doc/datasets/index.rst

+some contain ``feature_names`` and ``target_names``. See the dataset 
+descriptions below for details.  
+
+**The dataset generation functions.** They can be used to generate controled 


typo: controlled

ogrisel · 2018-06-21T12:18:06Z

doc/datasets/index.rst

@@ -9,49 +9,61 @@ Dataset loading utilities
 The ``sklearn.datasets`` package embeds some small toy datasets
 as introduced in the :ref:`Getting Started <loading_example_dataset>` section.

+This package also features helpers to fetch larger datasets commonly
+used by the machine learning community to benchmark algorithm on data


... to benchmark algorithms ...

qinhanmin2014

LGTM, thanks @jeremiedbb.

qinhanmin2014 · 2018-06-30T08:12:31Z

doc/datasets/index.rst

-interface, returning a tuple ``(X, y)`` consisting of a ``n_samples`` *
+Both loaders and fetchers functions return a dictionary-like object holding 
+at least two items: an array of shape ``n_samples`` * ``n_features`` with 
+key ``data``(except for 20newsgroups) and a numpy array of 


not sure why but there's formatting issue here. See https://25798-843222-gh.circle-artifacts.com/0/doc/datasets/index.html. I guess we should have a blank before the left bracket

I didn't notice, thanks !

qinhanmin2014 · 2018-07-02T11:32:20Z

Thanks @jeremiedbb for the great work :)

fill TODOs in index.rst

e069af9

jeremiedbb mentioned this pull request Jun 20, 2018

Reorganize dataset section in user guide #11083

Closed

empty, travis http error

4232c87

TomDLT added the Documentation label Jun 21, 2018

ogrisel reviewed Jun 21, 2018

View reviewed changes

typos and pruning

fd1a94c

jeremiedbb changed the title ~~DOC: complete PR #11180 for the reorganization of the dataset loading utilities section~~ [MRG] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section Jun 21, 2018

qinhanmin2014 approved these changes Jun 30, 2018

View reviewed changes

qinhanmin2014 changed the title ~~[MRG] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section~~ [MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section Jun 30, 2018

qinhanmin2014 added this to the 0.20 milestone Jun 30, 2018

fix formatting issue

b82ad24

jnothman approved these changes Jul 2, 2018

View reviewed changes

jnothman merged commit b56fa39 into scikit-learn:master Jul 2, 2018

jeremiedbb deleted the doc-dataset-utilites branch July 12, 2018 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section #11328

[MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section #11328

Uh oh!

jeremiedbb commented Jun 20, 2018

Uh oh!

jnothman commented Jun 20, 2018

Uh oh!

jeremiedbb commented Jun 20, 2018

Uh oh!

ogrisel left a comment

Uh oh!

ogrisel Jun 21, 2018

Uh oh!

ogrisel Jun 21, 2018

Uh oh!

ogrisel Jun 21, 2018

Uh oh!

ogrisel Jun 21, 2018

Uh oh!

ogrisel Jun 21, 2018

Uh oh!

qinhanmin2014 left a comment

Uh oh!

qinhanmin2014 Jun 30, 2018

Uh oh!

jeremiedbb Jul 2, 2018

Uh oh!

qinhanmin2014 commented Jul 2, 2018

Uh oh!

Uh oh!

Uh oh!

[MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section #11328

[MRG+1] Doc: complete PR #11180 for the reorganization of the dataset loading utilities section #11328

Uh oh!

Conversation

jeremiedbb commented Jun 20, 2018

Uh oh!

jnothman commented Jun 20, 2018

Uh oh!

jeremiedbb commented Jun 20, 2018

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Jun 21, 2018

Choose a reason for hiding this comment

Uh oh!

ogrisel Jun 21, 2018

Choose a reason for hiding this comment

Uh oh!

ogrisel Jun 21, 2018

Choose a reason for hiding this comment

Uh oh!

ogrisel Jun 21, 2018

Choose a reason for hiding this comment

Uh oh!

ogrisel Jun 21, 2018

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 left a comment

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 Jun 30, 2018

Choose a reason for hiding this comment

Uh oh!

jeremiedbb Jul 2, 2018

Choose a reason for hiding this comment

Uh oh!

qinhanmin2014 commented Jul 2, 2018

Uh oh!

Uh oh!