ENH Add a retry mechanism in fetch_openml #21901

Rileran · 2021-12-06T18:08:27Z

Reference Issues/PRs

Fixes #21397
Retry mechanism for other fetch_* functions : #21691

What does this implement/fix? Explain your changes.

Added a retry mechanism for the function fetch_openml in case of a network error.

This is done by adding n_retries and delay arguments to the function. (similar to #21691).

If a call to urlopen would result in a network error (URLError), we will call the function again, up to n_retries times and with a delay between each call.

Any other comments?

As this is specific to OpenML, the retry mechanism will be bypassed if the network error has a status code 412, as it is the generic error returned by the OpenML API.

ogrisel · 2021-12-10T08:51:01Z

I have not checked this PR but I recently discovered that there is already a "retry once" mechanism named _retry_with_clean_cache.

I am not sure it plays well with the new concurrency aware download and cache mechanism of #21833 though.

thomasjpfan · 2021-12-10T14:40:18Z

The concurrency aware download works as expected with _retry_with_clean_cache. If _open_openml_url fails for a non-HTTPError reason, _retry_with_clean_cache will try again. The purpose of _retry_with_clean_cache was to catch errors from data corruption.

As for this PR, _retry_on_network_error will work with _retry_with_clean_cache, as long as we update:

scikit-learn/sklearn/datasets/_openml.py

Lines 62 to 63 in 6c9f165

    
           except HTTPError: 
        
               raise

to catch URLError (which is the parent class of HTTPError). This way _retry_on_network_error can retry and if it fails _retry_with_clean_cache will not retry and reraise the error.

Rileran · 2021-12-12T14:00:27Z

Thank you @thomasjpfan for clarifying the purpose of those two decorators. I have applied the correct changes and now _retry_with_clean_cache should re raise the error if the function fails because of a network error.

I have looked into the failed pipeline and I am not sure about the trouble. The error message caught by pytest look like one from another test.

+ unclosed file <_io.FileIO name='/private/var/folders/24/8k48jl6d249_n_qfxwsl6xvm0000gn/T/pytest-of-runner/pytest-0/popen-gw2/test_fetch_openml_verify_check0/test_invalid_checksum.arff' mode='rb' closefd=True>

Is it possible that another test is failing to close a file and that the error message is caught by our test ? I am not very familiar with pytest and therefore am having trouble isolating the problem.

thomasjpfan

Thank you for the update @Rileran!

I opened #22005 to address the FileIO issue.

thomasjpfan · 2021-12-16T22:03:47Z

sklearn/datasets/tests/test_openml.py

+        for r in record:
+            assert (
+                r.message.args[0]
+                == "A network error occured while downloading a file. Retrying..."
+            )
+        assert len(record) == 3


Since match= it set in the context manager, the message is already matched. Here only need to check the number of retries:

assert len(record) == 3

Corrected in last commit. Thanks a lot for reviewing my work !

ogrisel · 2021-12-17T09:07:09Z

I merged #22005 in main, so it should be possible to merge an updated main into this branch to get a clean CI.

ogrisel

LGTM once the comments below and the one remaining by @thomasjpfan (https://github.com/scikit-learn/scikit-learn/pull/21901/files#r770951394) are addressed.

sklearn/datasets/_openml.py

ogrisel

Thanks!

sklearn/datasets/tests/test_openml.py

thomasjpfan

Thank you for the update @Rileran!

A minor comment, otherwise LGTM!

sklearn/datasets/_openml.py

FIX Changed default parameter type to float Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

github-actions bot added the module:datasets label Dec 6, 2021

thomasjpfan mentioned this pull request Dec 17, 2021

MNT Closes corrupted file correctly in test_fetch_openml_verify_checksum #22005

Merged

thomasjpfan reviewed Dec 17, 2021

View reviewed changes

ogrisel approved these changes Dec 17, 2021

View reviewed changes

sklearn/datasets/_openml.py Outdated Show resolved Hide resolved

sklearn/datasets/_openml.py Show resolved Hide resolved

Rileran force-pushed the feat/retry-in-fetch-openml branch from 5e14d8e to a48d74c Compare December 17, 2021 09:09

ogrisel approved these changes Dec 17, 2021

View reviewed changes

ogrisel reviewed Dec 17, 2021

View reviewed changes

sklearn/datasets/tests/test_openml.py Outdated Show resolved Hide resolved

Rileran force-pushed the feat/retry-in-fetch-openml branch from 9460520 to 46f6190 Compare December 17, 2021 10:59

thomasjpfan approved these changes Dec 17, 2021

View reviewed changes

sklearn/datasets/_openml.py Outdated Show resolved Hide resolved

thomasjpfan changed the title ~~[MRG] Add a retry mechanism in fetch_openml~~ ENH Add a retry mechanism in fetch_openml Dec 17, 2021

Rileran added 6 commits December 22, 2021 11:37

ENH fetch_openml function retries on network error

231782b

DOC added fetch_open retry mechanism entry to v1.1

40273c4

fix: _retry_with_clean_cache now throws exceptions on URLError.

9c66b8c

ENH add url to network error on fetch_openml

bd5bf37

FIX avoid interpreting url as regex ops

7515462

FIX documentation merge conflict

a8f3a21

Rileran force-pushed the feat/retry-in-fetch-openml branch 2 times, most recently from a6f9302 to 6691632 Compare December 22, 2021 10:45

Update sklearn/datasets/_openml.py

d6ea453

FIX Changed default parameter type to float Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Rileran force-pushed the feat/retry-in-fetch-openml branch from 6691632 to d6ea453 Compare December 22, 2021 10:51

thomasjpfan merged commit 0882bd3 into scikit-learn:main Dec 22, 2021

venkyyuvy pushed a commit to venkyyuvy/scikit-learn that referenced this pull request Jan 1, 2022

ENH Add a retry mechanism in fetch_openml (scikit-learn#21901)

04c1584

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

lesteve mentioned this pull request May 13, 2022

Fix OpenML timeout #23358

Merged

lesteve mentioned this pull request Feb 22, 2024

ENH Add retry mechanism to fetch_xx functions. #28160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH Add a retry mechanism in fetch_openml #21901

ENH Add a retry mechanism in fetch_openml #21901

Uh oh!

Rileran commented Dec 6, 2021

Uh oh!

ogrisel commented Dec 10, 2021

Uh oh!

thomasjpfan commented Dec 10, 2021

Uh oh!

Rileran commented Dec 12, 2021 •

edited

Loading

Uh oh!

thomasjpfan left a comment

Uh oh!

thomasjpfan Dec 16, 2021

Uh oh!

Rileran Dec 17, 2021

Uh oh!

ogrisel commented Dec 17, 2021

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Uh oh!

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ENH Add a retry mechanism in fetch_openml #21901

ENH Add a retry mechanism in fetch_openml #21901

Uh oh!

Conversation

Rileran commented Dec 6, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

ogrisel commented Dec 10, 2021

Uh oh!

thomasjpfan commented Dec 10, 2021

Uh oh!

Rileran commented Dec 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

thomasjpfan Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

Rileran Dec 17, 2021

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Dec 17, 2021

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Rileran commented Dec 12, 2021 •

edited

Loading