Fix conversion of values in libtorch agnostic tests #155115

AlekseiNikiforovIBM · 2025-06-04T11:50:11Z

Due to different byteorder,
when copying data, it has to be put into last bytes to ensure that int32_t converted to int64_t keeps same value. Same has to be done when it's converted back.

This change fixes test
TestLibtorchAgnosticCPU::test_my_ones_like_cpu
from
cpp_extensions/libtorch_agnostic_extension/test/test_libtorch_agnostic.py on s390x.

pytorch-bot · 2025-06-04T11:50:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155115

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 5e7fe14 with merge base ab65581 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu, unstable) (gh)
MISSING REGRESSION TEST

This comment was automatically generated by Dr. CI and updates every 15 minutes.

let someone else look at it

AlekseiNikiforovIBM · 2025-06-23T12:37:08Z

@pytorchbot rebase

pytorchmergebot · 2025-06-23T12:38:34Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-06-23T12:38:37Z

Successfully rebased s390x_libtorch_agnostic onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout s390x_libtorch_agnostic && git pull --rebase)

AlekseiNikiforovIBM · 2025-06-24T08:30:51Z

Could you please take a look at this PR again? I'm looking at docker build failure separately, it is not relevant to this change.

AlekseiNikiforovIBM · 2025-07-01T13:46:06Z

@pytorchbot rebase

pytorchmergebot · 2025-07-01T13:47:38Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-07-01T13:47:41Z

Successfully rebased s390x_libtorch_agnostic onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout s390x_libtorch_agnostic && git pull --rebase)

AlekseiNikiforovIBM · 2025-07-08T09:11:23Z

@pytorchbot rebase

pytorchmergebot · 2025-07-08T09:12:53Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-07-08T09:12:56Z

Successfully rebased s390x_libtorch_agnostic onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout s390x_libtorch_agnostic && git pull --rebase)

Due to different byteorder, when copying data, it has to be put into last bytes to ensure that int32_t converted to int64_t keeps same value. Same has to be done when it's converted back. This change fixes test TestLibtorchAgnosticCPU::test_my_ones_like_cpu from cpp_extensions/libtorch_agnostic_extension/test/test_libtorch_agnostic.py on s390x.

…agnostic test on s390x

AlekseiNikiforovIBM · 2025-07-09T08:32:41Z

Could you please take a look at this change again?

AlekseiNikiforovIBM · 2025-08-04T09:19:51Z

@pytorchbot merge

pytorchmergebot · 2025-08-04T09:21:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

albanD

@huydhn can you give details on why this is needed and why this is ok to do?

huydhn · 2025-08-05T03:14:23Z

@albanD This is one of the PR to support s390x that @AlekseiNikiforovIBM brought up on Slack, and then to our OH few weeks ago as there wasn't any progress on the PR. I stamped it after searching the code base and saw similar usage https://github.com/search?q=repo%3Apytorch%2Fpytorch%20__BYTE_ORDER__&type=code. I thought that it was safe because the codepath for __ORDER_BIG_ENDIAN__ is added while the rest stayed the same and the change looks simple enough. What do you think of this change?

FYI, there is another more complex one that is still open #151447

albanD · 2025-08-05T15:20:30Z

I am confused why this PR is needed to begin with since both sides have the same endianess?

AlekseiNikiforovIBM · 2025-08-06T09:16:30Z

I am confused why this PR is needed to begin with since both sides have the same endianess?

There was a test failure in TestLibtorchAgnosticCPU::test_my_ones_like_cpu. The issue is that, for example, int (4 bytes) value is put into long (8 bytes), and used as long.

Let's say number was 305419896 (0x12345678).

Originally value was put into first 4 bytes of long, so we get 0x78 0x56 0x34 0x12 0x00 0x00 0x00 0x00 in memory, which is also 0x12345678 even as long. But on big endian we got 0x12 0x34 0x56 0x78 0x00 0x00 0x00 0x00 in memory, which is actually 0x1234567800000000 as long, which is a different value, value of 1311768464867721216.

So, with this change we actually put it as 0x00 0x00 0x00 0x00 0x12 0x34 0x56 0x78 into memory on big endian system, which is also 0x12345678 as long on big endian system.

AlekseiNikiforovIBM requested review from huydhn, seemethere and malfet June 4, 2025 11:50

AlekseiNikiforovIBM added topic: not user facing topic category ciflow/s390 s390x-related CI jobs labels Jun 4, 2025

pytorchbot added the open source label Jun 4, 2025

Skylion007 previously approved these changes Jun 4, 2025

View reviewed changes

jcaip added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 5, 2025

AlekseiNikiforovIBM force-pushed the s390x_libtorch_agnostic branch from 485e25c to 6283f59 Compare June 18, 2025 12:56

AlekseiNikiforovIBM requested a review from a team as a code owner June 18, 2025 12:56

AlekseiNikiforovIBM requested a review from jeffdaily as a code owner June 18, 2025 15:40

pytorchmergebot force-pushed the s390x_libtorch_agnostic branch from 60964d5 to c78ce6c Compare June 23, 2025 12:38

pytorchmergebot force-pushed the s390x_libtorch_agnostic branch from c78ce6c to 7a41f76 Compare July 1, 2025 13:47

AlekseiNikiforovIBM mentioned this pull request Jul 3, 2025

[CI] s390x-periodic tests broken with "No matching distribution found for cuda-bindings<13.0,>=12.0" #157409

Closed

pytorchmergebot force-pushed the s390x_libtorch_agnostic branch from 7a41f76 to c2e5e2c Compare July 8, 2025 09:12

AlekseiNikiforovIBM added 2 commits July 8, 2025 13:46

Enable cpp_extensions/libtorch_agnostic_extension/test/test_libtorch_…

5e7fe14

…agnostic test on s390x

AlekseiNikiforovIBM force-pushed the s390x_libtorch_agnostic branch from c2e5e2c to 5e7fe14 Compare July 8, 2025 11:46

huydhn requested a review from Skylion007 July 18, 2025 17:09

huydhn approved these changes Jul 18, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 4, 2025

pytorchmergebot added the merging label Aug 4, 2025

pytorchmergebot added the Merged label Aug 4, 2025

pytorchmergebot closed this in e5a81aa Aug 4, 2025

pytorchmergebot removed the merging label Aug 4, 2025

albanD reviewed Aug 4, 2025

View reviewed changes

Fix conversion of values in libtorch agnostic tests #155115

Fix conversion of values in libtorch agnostic tests #155115

Uh oh!

Conversation

AlekseiNikiforovIBM commented Jun 4, 2025

Uh oh!

pytorch-bot bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155115

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

AlekseiNikiforovIBM commented Jun 23, 2025

Uh oh!

pytorchmergebot commented Jun 23, 2025

Uh oh!

pytorchmergebot commented Jun 23, 2025

Uh oh!

AlekseiNikiforovIBM commented Jun 24, 2025

Uh oh!

AlekseiNikiforovIBM commented Jul 1, 2025

Uh oh!

pytorchmergebot commented Jul 1, 2025

Uh oh!

pytorchmergebot commented Jul 1, 2025

Uh oh!

AlekseiNikiforovIBM commented Jul 8, 2025

Uh oh!

pytorchmergebot commented Jul 8, 2025

Uh oh!

pytorchmergebot commented Jul 8, 2025

Uh oh!

AlekseiNikiforovIBM commented Jul 9, 2025

Uh oh!

AlekseiNikiforovIBM commented Aug 4, 2025

Uh oh!

pytorchmergebot commented Aug 4, 2025

Merge started

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

huydhn commented Aug 5, 2025

Uh oh!

albanD commented Aug 5, 2025

Uh oh!

AlekseiNikiforovIBM commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 4, 2025 •

edited

Loading

AlekseiNikiforovIBM commented Aug 6, 2025 •

edited

Loading