Stricter coversion of primitive Python types to `System.Object` #1958

lostmsu · 2022-09-30T18:21:43Z

When trying to convert to System.Object, only convert float, bool, and str to corresponding .NET primitives types. Do NOT convert instances of types derived from them.

What does this implement/fix? Explain your changes.

This changes Converter to perform exact type checks when target type is System.Object

Does this close any currently open issues?

fixes #1957

Checklist

Check all those that are applicable and complete.

Make sure to include one or more tests for your change

…, and `str` to corresponding .NET primitive types fixes pythonnet#1957

lostmsu · 2022-09-30T18:34:18Z

This should go into 3.0.0
Did you remove the release branch?

filmor · 2022-09-30T20:44:33Z

3.0.0 ia released already and I disagree with the proposed behavior. I don't get why you are insisting that this is the "correct" way. If a type is derived from another that clearly indicates an "is a" relation, and evidently people got really confused with the behavior we had in the RCs. I totally agree that we should allow overriding this behavior, I disagree with this approach.

lostmsu · 2022-10-01T03:44:18Z

@filmor I don't think you got the scenario. It is not about converting np.float64 to System.Double when the requested type is System.Double which was explicitly addressed in your PR. It is about what to convert to when the requested type is System.Object.

Without this change codecs can't be applied because the conversion to primitives takes priority.

Converting np.float64 to System.Double without Double being the desired type explicitly is also a loss of information, because at the receiving side you can no longer know that the original value was more specific.

lostmsu · 2022-10-05T15:38:20Z

@filmor any comments? Due to the above type information loss either this change needs to be accepted to be able to avoid the issue, or the checks need to move below codec application, which can lead to poorly predictable behavior for codecs that claim to be able to decode float itself.

filmor · 2022-10-05T17:11:27Z

I'm on vacation right now, will check once I'm back. I find it unlikely that I will accept the PR as it stands, but I'll need to be a bit around with it.

lostmsu · 2022-10-07T18:02:27Z

@filmor I think I came up with a good demonstration why this PR should be accepted as is. Without it the following code will work for any numpy value except numpy.float64 because numpy.float64 derives from float but other NumPy types do not, so float64 will be converted to System.Double, which does not have any NumPy members on it.

[ForbidPythonThreads]
dynamic Centroid(dynamic value)  => value - value.mean();

v = numpy.float32(42)
print(Centroid(v)) # OK
v = numpy.float64(42)
# RuntimeBinderException: System.Double does not have .mean()
print(Centroid(v))

Codecs are not involved here, so the conversion should not happen regardless of the state of the codecs.

filmor · 2022-10-07T19:39:44Z

With that argument, we should forbid /any/ conversion of primitive Python types to Object. Maybe this case should just be handled as an attribute?

lostmsu · 2022-10-08T02:09:16Z

@filmor I don't think it applies to primitive types, as

they don't really have that many members specific to Python
they are present in all Python installations and have natural mapping to .NET types
we actually already did that to int for much the same reason, and integer objects in this situation are converted to PyInt precisely because int does not map directly to any of System.Int*

filmor · 2022-10-15T18:54:55Z

I don't think it applies to primitive types, as

they don't really have that many members specific to Python

I'm not sure what you mean by that. How many would be "too many"?

we actually already did that to int for much the same reason, and integer objects in this situation are converted to PyInt precisely because int does not map directly to any of System.Int*

Exactly. And I think instead of breaking working code further, we should just make this an option. There are (to me) two kinds of functions for which this is relevant:

Functions that pass an element through (e.g. an imaginary Dictionary<object, object>.GetOrDefault)
Functions that use Python members and thus want to use a PyObject

For the latter, I don't see any reason why these shouldn't just take a PyObject parameter. I get that dynamic makes things more convenient, but a dynamic Centroid(PyObject obj) { dynamic o = obj; return obj - obj.mean(); } is not a lot more work. For these, an additional attribute would also do nicely, e.g. [DisablePythonnetConversions] or [PythonnetFunction(conversion = None, gil = true)].

For the former, things might be a bit more tricky, but I'd suggest (in general) to just expose more of the conversion machinery publicly and solve this case using a pythonnet.conversion.disable(method_wrapper) call (or a corresponding Python.Runtime call in C#).

lostmsu · 2022-10-18T20:38:30Z

I'm not sure what you mean by that. How many would be "too many"?

float is a primitive type and only has primitive operations implemented. Addition, subtraction, etc. There are maybe a few 10 ops in total, and the set hasn't changed for 10+ years. NumPy's float64 though has over 40 members on top of that. And they add new ones now and then.

For the latter, I don't see any reason why these shouldn't just take a PyObject parameter.

Then you proceed to name a reason. I'd also say that the only good reason to have a System.Object parameter in library code is that you either only ever plan to call .ToString and/or .GetHashCode, or your parameter is actually a dynamic. And since the first scenario is not broken after we added GIL to these members, the only remaining scenario for a System.Object parameter is made worse by this autoconversion.

filmor · 2022-10-19T06:08:36Z

I'm not sure what you mean by that. How many would be "too many"?

float is a primitive type and only has primitive operations implemented. Addition, subtraction, etc. There are maybe a few 10 ops in total, and the set hasn't changed for 10+ years. NumPy's float64 though has over 40 members on top of that. And they add new ones now and then.

That is not the point. float has is_integer, conjugate, etc. If you want to access those via dynamic, it would not work either. Same for all of the str members.

For the latter, I don't see any reason why these shouldn't just take a PyObject parameter.

Then you proceed to name a reason. I'd also say that the only good reason to have a System.Object parameter in library code is that you either only ever plan to call .ToString and/or .GetHashCode, or your parameter is actually a dynamic. And since the first scenario is not broken after we added GIL to these members, the only remaining scenario for a System.Object parameter is made worse by this autoconversion.

And I also give a way to make this work by being more explicit. That is the gist of a lot of the changes for Python.NET 3 (that you also introduced yourself!). There is no way to distinguish dynamic from object parameters, so the proper way to modify this behaviour without breaking existing code is to introduce attributes or runtime configuration.

lostmsu · 2022-10-19T16:57:00Z

And I also give a way to make this work by being more explicit.

Point is why even do it this way in the first place? What is the scenario where float should be passed to System.Object parameter and converted to System.Double?

filmor · 2022-10-19T18:18:29Z

And I also give a way to make this work by being more explicit.

Point is why even do it this way in the first place? What is the scenario where float should be passed to System.Object parameter and converted to System.Double?

We agree here. I just want to not break the behaviour right now, and if we break it, remove all special-casing in one go instead of keeping it for (some) primitives.

If we want to keep things simple, we can make this a runtime configuration parameter.

lostmsu · 2022-10-19T22:42:05Z

@filmor as far as I am concerned, this issue is a consequence of introducing last minute changes to 3.0 before release. And considering its use case rarity, it should be considered a bug in 3.0.0, so should be fixed in 3.0.1. If you are unhappy about interface breakage, 3.0.0 should be yanked.

filmor · 2022-10-20T04:09:45Z

It is not. As said, with the change as it stands, I'm not okay and I think we have discussed it enough now. The correct thing to do would be to not convert at all for object parameters, which is a much bigger breakage (compared to 2.5) and can't be introduced in a patch version. I'll draft a PR to enforce this behaviour with runtime configuration.

lostmsu · 2022-10-20T06:31:56Z

@filmor I don't think we need a global toggle for this. There was another option:

the checks need to move below codec application

That way this could be fixed with a codec.

…s when target type is System.Object useful to be able to change what numpy.float64 is converted to related to pythonnet#1957 this is an alternative to pythonnet#1958

filmor · 2022-10-28T12:51:02Z

Closing this in favour of #1986.

…s when target type is System.Object useful to be able to change what numpy.float64 is converted to related to pythonnet#1957 this is an alternative to pythonnet#1958

when converting to System.Object only convert exact float, bool…

72971b7

…, and `str` to corresponding .NET primitive types fixes pythonnet#1957

lostmsu requested a review from filmor September 30, 2022 18:28

filmor closed this Oct 28, 2022

lostmsu deleted the bugs/FloatMustBeExact branch November 4, 2022 19:55

Stricter coversion of primitive Python types to System.Object #1958

Stricter coversion of primitive Python types to System.Object #1958

Uh oh!

Conversation

lostmsu commented Sep 30, 2022

What does this implement/fix? Explain your changes.

Does this close any currently open issues?

Checklist

Uh oh!

lostmsu commented Sep 30, 2022

Uh oh!

filmor commented Sep 30, 2022

Uh oh!

lostmsu commented Oct 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lostmsu commented Oct 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

filmor commented Oct 5, 2022

Uh oh!

lostmsu commented Oct 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

filmor commented Oct 7, 2022

Uh oh!

lostmsu commented Oct 8, 2022

Uh oh!

filmor commented Oct 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lostmsu commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

filmor commented Oct 19, 2022

Uh oh!

lostmsu commented Oct 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

filmor commented Oct 19, 2022

Uh oh!

lostmsu commented Oct 19, 2022

Uh oh!

filmor commented Oct 20, 2022

Uh oh!

lostmsu commented Oct 20, 2022

Uh oh!

filmor commented Oct 28, 2022

Uh oh!

Uh oh!

Stricter coversion of primitive Python types to `System.Object` #1958

Stricter coversion of primitive Python types to `System.Object` #1958

lostmsu commented Oct 1, 2022 •

edited

Loading

lostmsu commented Oct 5, 2022 •

edited

Loading

lostmsu commented Oct 7, 2022 •

edited

Loading

filmor commented Oct 15, 2022 •

edited

Loading

lostmsu commented Oct 18, 2022 •

edited

Loading

lostmsu commented Oct 19, 2022 •

edited

Loading