MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast #24895

OmarManzoor · 2022-11-11T13:37:28Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Removed -Wsign-compare warnings when compiling sklearn.linear_model._cd_fast

Any other comments?

…learn.linear_model._cd_fast

OmarManzoor · 2022-11-11T13:41:26Z

@jjerphan For the warning https://gist.github.com/jjerphan/8cfbb5349f5680856460ff80152ab69d#file-full-trace-log-L429
it is basically related to this line of code https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/utils/_random.pxd#L43. It is mentioned here that we should be careful when casting this. Could you kindly guide on this one? Thank you.

jjerphan

Thanks @OmarManzoor for this PR and the observation you made.

I think it is preferable to restrain the scope of this PR to only remove -Wsign-compare warnings for two reasons:

this PR addresses two goals at the moment (removal of two kind of warning). For scikit-learn, we prefer having independent and atomic goals (and changes) is more readable and understanble in logs, and is more easily reviewable.
all the -Woverflow warnings when compiling the whole code-base are coming from the current inline sklearn.utils._random.our_rand_r function, and their removal might better be addressed in a dedicated PR.

Can you please restrain the scope of this PR to the removal of -Wsign-compare warnings solely? Thank you.

Here is a comment to address it better.

jjerphan · 2022-11-14T13:14:42Z

sklearn/linear_model/_cd_fast.pyx

@@ -171,7 +171,7 @@ def enet_coordinate_descent(
        # tol *= np.dot(y, y)
        tol *= _dot(n_samples, &y[0], 1, &y[0], 1)

-        for n_iter in range(max_iter):
+        for n_iter in range(<Py_ssize_t>max_iter):


I think it is preferable to change the type of n_iter to the one of max_iter (i.e. int) instead of casting max_iter to Py_ssize_t here.

As for SGD, max_iter will be validated by the Python class to be strictly positive.
In terms of semantic, it is more correct to use unsigned and the casting will be safe.
It can be done once for all in the signature of the function.

This removes the -Woverflow warnings observed when building scikit-learn. RAND_R_MAX is the max value for uint8, incrementing it causes an overflow (hence the warning). I think this commit fixes the implementation, yet I comes with a backwards incompatible results and tests for implementation relying on `our_rand_r` fails because results are now different. I see several alternatives to remove the warning while having tests pass - prefered solution: adapt the test suite using the new results so that all tests pass and ackowledge the change of behavior for impacted user-facing APIs in the changelog - accept the quirk of this implementation but hardcode and rename the effective constant - silent the -Woverflow warning by another mean Relates to: scikit-learn#13422 scikit-learn#24895

This removes the -Woverflow warnings observed when building scikit-learn. RAND_R_MAX is the max value for uint8, incrementing it causes an overflow (hence the warning). Elements were originaly mentionned in scikit-learn#13422 (comment) but left unreviewed, it seems. I think this commit fixes the implementation, yet I comes with a backwards incompatible results and tests for implementation relying on `our_rand_r` fails because results are now different. I see several alternatives to remove the warning while having tests pass - prefered solution: adapt the test suite using the new results so that all tests pass and ackowledge the change of behavior for impacted user-facing APIs in the changelog - accept the quirk of this implementation but hardcode and rename the effective constant - silent the -Woverflow warning by another mean Relates to: scikit-learn#13422 scikit-learn#24895

This removes the -Woverflow warnings observed when building scikit-learn. RAND_R_MAX is the max value for uint8, incrementing it causes an overflow (hence the warning). Elements were originally mentioned but seem to have been left unreviewed, see: scikit-learn#13422 (comment) I think this commit fixes the implementation, yet I comes with a backwards incompatible results and tests for implementation relying on `our_rand_r` fails because results are now different. I see several alternatives to remove the warning while having tests pass - prefered solution: adapt the test suite using the new results so that all tests pass and ackowledge the change of behavior for impacted user-facing APIs in the changelog - accept the quirk of this implementation but hardcode and rename the effective constant - silent the -Woverflow warning by another mean Relates to: scikit-learn#13422 scikit-learn#24895

This removes the -Woverflow warnings observed when building scikit-learn. RAND_R_MAX is the max value for uint8, incrementing it causes an overflow (hence the warning). Elements were originally mentioned but seem to have been left unreviewed, see: scikit-learn#13422 (comment) I think this commit fixes the implementation, yet I comes with a backwards incompatible results and tests for implementation relying on `our_rand_r` fails because results are now different. I see several alternatives to remove the warning while having tests pass - preferred solution: adapt the test suite using the new results so that all tests pass and acknowledge the change of behavior for impacted user-facing APIs in the change-log - accept the quirk of this implementation but hardcode and rename the effective constant - silent the -Woverflow warning by another mean Relates to: scikit-learn#13422 scikit-learn#24895

jjerphan · 2022-11-14T15:47:54Z

#24919 has been created to treat -Woverflow separately.

jjerphan

LGTM, thank you!

OmarManzoor · 2022-11-16T06:54:13Z

@jjerphan, @glemaitre
Similar to the sgd PR I also changed the max_iter parameter to an unsigned int over here. I also noted that we are returning np.asarray(w) in the functions. I was thinking would it make sense to replace this with w.base?

jjerphan · 2022-11-16T08:18:57Z

Thanks for mentioning that. I think we better remove np.asarray calls (and cnp.ndarray) in the PR removing the warning due to the use of the deprecated NumPy API (via Cython) (-Wcpp).

jjerphan · 2022-11-16T12:37:15Z

@OmarManzoor: I meant that it's better to integrate changes made in 5e66af3 in another PR (one removing -Wcpp warning).

OmarManzoor · 2022-11-16T14:35:03Z

@OmarManzoor: I meant that it's better to integrate changes made in 5e66af3 in another PR (one removing -Wcpp warning).

Oh I understand. However I don't think we will be able to remove the -Wcpp warning from this module completely because the const fused types complication. I think most of the other applicable instances already use memory views?

jjerphan · 2022-11-16T16:47:31Z

We can try to use fused-typed memoryview which aren't const-qualified.

glemaitre

LGTM

MAINT Remove -Wsign-compare and -Woverflow warnings when compiling sk…

33ed77f

…learn.linear_model._cd_fast

github-actions bot added cython module:linear_model labels Nov 11, 2022

glemaitre self-requested a review November 12, 2022 10:05

jjerphan reviewed Nov 14, 2022

View reviewed changes

OmarManzoor changed the title ~~MAINT Remove -Wsign-compare and -Woverflow warnings when compiling sklearn.linear_model._cd_fast~~ MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast Nov 14, 2022

Declare the loop variables as ints

92a5c77

jjerphan mentioned this pull request Nov 14, 2022

MAINT Remove all -Woverflow warnings #24919

Closed

OmarManzoor requested review from jjerphan and removed request for glemaitre November 15, 2022 09:01

jjerphan approved these changes Nov 15, 2022

View reviewed changes

OmarManzoor added 2 commits November 15, 2022 16:11

Merge remote-tracking branch 'upstream/main' into cython_cd_fast

16ea9df

Make the max_iter variable unsigned

935ac9d

Merge remote-tracking branch 'upstream/main' into cython_cd_fast

871c477

jjerphan approved these changes Nov 16, 2022

View reviewed changes

OmarManzoor added 2 commits November 16, 2022 15:38

Replace np.asarray with memory view's base attribute

5e66af3

Merge remote-tracking branch 'upstream/main' into cython_cd_fast

d781509

OmarManzoor added 2 commits November 17, 2022 11:39

Reverted the replacement of .base

0fd135c

Merge remote-tracking branch 'upstream/main' into cython_cd_fast

cc3660e

Merge branch 'main' into cython_cd_fast

e36c2bc

glemaitre self-requested a review November 17, 2022 09:52

glemaitre approved these changes Nov 17, 2022

View reviewed changes

glemaitre merged commit b9137b4 into scikit-learn:main Nov 17, 2022

OmarManzoor deleted the cython_cd_fast branch November 17, 2022 10:04

jjerphan mentioned this pull request Feb 28, 2023

MAINT Remove all Cython, C and C++ compilations warnings #24875

Closed

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast #24895

MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast #24895

Uh oh!

OmarManzoor commented Nov 11, 2022 •

edited

Loading

Uh oh!

OmarManzoor commented Nov 11, 2022

Uh oh!

jjerphan left a comment •

edited

Loading

Uh oh!

jjerphan Nov 14, 2022

Uh oh!

glemaitre Nov 15, 2022

Uh oh!

jjerphan commented Nov 14, 2022

Uh oh!

jjerphan left a comment

Uh oh!

OmarManzoor commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

OmarManzoor commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

glemaitre left a comment

Uh oh!

Uh oh!

Uh oh!

MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast #24895

MAINT Remove -Wsign-compare when compiling sklearn.linear_model._cd_fast #24895

Uh oh!

Conversation

OmarManzoor commented Nov 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

OmarManzoor commented Nov 11, 2022

Uh oh!

jjerphan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jjerphan Nov 14, 2022

Choose a reason for hiding this comment

Uh oh!

glemaitre Nov 15, 2022

Choose a reason for hiding this comment

Uh oh!

jjerphan commented Nov 14, 2022

Uh oh!

jjerphan left a comment

Choose a reason for hiding this comment

Uh oh!

OmarManzoor commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

OmarManzoor commented Nov 16, 2022

Uh oh!

jjerphan commented Nov 16, 2022

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

OmarManzoor commented Nov 11, 2022 •

edited

Loading

jjerphan left a comment •

edited

Loading