[test][do not merge] Upgrade oneDNN to v3.9 #157994

yanbing-j · 2025-07-10T02:25:52Z

Add this commit inside this PR
Rebase to the latest main to fix XPU failures.

cc @gujinghui @PenghuiCheng @XiaobingSuper @jianyuh @jgong5 @mingfeima @sanchitintel @ashokei @jingxu10 @min-jean-cho @Guobing-Chen @Xia-Weiwen @snadampal @malfet @milpuz01 @aditew01 @nikhil-arm @fadara01

pytorch-bot · 2025-07-10T02:25:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157994

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 53 Pending

As of commit 1d51259 with merge base bfc873d ():

NEW FAILURES - The following jobs have failed:

linux-aarch64 / linux-jammy-aarch64-py3.10 / test (default, 1, 3, linux.arm64.m7g.4xlarge) (gh)
'test/test_ops.py::TestCommonCPU::test_dtypes_nn_functional_conv2d_cpu'
linux-aarch64 / linux-jammy-aarch64-py3.10 / test (default, 2, 3, linux.arm64.m7g.4xlarge) (gh)
'test/test_ops.py::TestFakeTensorCPU::test_fake_autocast_nn_functional_conv2d_cpu_float32'
linux-aarch64 / linux-jammy-aarch64-py3.10 / test (default, 3, 3, linux.arm64.m7g.4xlarge) (gh)
'test/test_ops.py::TestCommonCPU::test_dtypes_nn_functional_conv1d_cpu'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

yanbing-j · 2025-07-10T03:21:38Z

Hi @sanchitintel , I'm trying to upgrade oneDNN to v3.9 for testing. since v3.9 is not officially released, I tried main branch commit first. And it wil fail in test_jit_llga_fuser.py::TestOpCPU::test_add_cpu_float32. Could you please take a look first? Thanks!

FAILED [0.0335s] test_jit_llga_fuser.py::TestOpCPU::test_add_cpu_float32 - AssertionError: Tensor-likes are not close!
2025-07-10T02:59:20.4033167Z 
2025-07-10T02:59:20.4033380Z Mismatched elements: 13696 / 25088 (54.6%)
2025-07-10T02:59:20.4034391Z Greatest absolute difference: 1.701464974517854e+38 at index (0, 0, 5, 12) (up to 1e-05 allowed)
2025-07-10T02:59:20.4035723Z Greatest relative difference: inf at index (0, 4, 25, 12) (up to 1.3e-06 allowed)
2025-07-10T02:59:20.4036446Z 
2025-07-10T02:59:20.4036941Z To execute this test, run the following from the base repo dir:
2025-07-10T02:59:20.4037928Z     python test/test_jit_llga_fuser.py TestOpCPU.test_add_cpu_float32

Skylion007 · 2025-07-10T13:26:44Z

Does this fix uxlfoundation/oneDNN#3383 btw? ;-;

vpirogov · 2025-07-10T23:20:53Z

Does this fix uxlfoundation/oneDNN#3383 btw? ;-;

Unfortunately no. We recommend disabling IPO for oneDNN builds on Windows.

yanbing-j · 2025-07-17T02:36:19Z

Hi @sanchitintel , I'm trying to upgrade oneDNN to v3.9 for testing. since v3.9 is not officially released, I tried main branch commit first. And it wil fail in test_jit_llga_fuser.py::TestOpCPU::test_add_cpu_float32. Could you please take a look first? Thanks!

FAILED [0.0335s] test_jit_llga_fuser.py::TestOpCPU::test_add_cpu_float32 - AssertionError: Tensor-likes are not close!
2025-07-10T02:59:20.4033167Z 
2025-07-10T02:59:20.4033380Z Mismatched elements: 13696 / 25088 (54.6%)
2025-07-10T02:59:20.4034391Z Greatest absolute difference: 1.701464974517854e+38 at index (0, 0, 5, 12) (up to 1e-05 allowed)
2025-07-10T02:59:20.4035723Z Greatest relative difference: inf at index (0, 4, 25, 12) (up to 1.3e-06 allowed)
2025-07-10T02:59:20.4036446Z 
2025-07-10T02:59:20.4036941Z To execute this test, run the following from the base repo dir:
2025-07-10T02:59:20.4037928Z     python test/test_jit_llga_fuser.py TestOpCPU.test_add_cpu_float32

UT python test/test_jit_llga_fuser.py TestOpCPU.test_add_cpu_float32 has been fixed.

yanbing-j · 2025-07-17T05:04:37Z

Hi @vpirogov ,

For the bazel build failure of https://github.com/pytorch/pytorch/actions/runs/16334831332/job/46154536338, does #include <altivec.h> need to be inside #ifdef __MMA__ in src/cpu/ppc64/ppc64_gemm_reorder.cpp:20? And I'm not sure if we need to build files under mkl-dnn/src/cpu/ppc64/, can we just simply disable the build of ppc64?

yanbing-j · 2025-07-24T06:05:44Z

@pytorchbot rebase

pytorchmergebot · 2025-07-24T06:07:11Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-07-24T06:07:14Z

Successfully rebased yanbing/update_onednn_v3.9 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout yanbing/update_onednn_v3.9 && git pull --rebase)

yanbing-j · 2025-07-31T08:05:50Z

Hi @milpuz01 , could you please help verify this CI failure https://github.com/pytorch/pytorch/actions/runs/16643249983/job/47097723925?pr=157994? Thanks!

aditew01 · 2025-07-31T09:26:30Z

Hi @milpuz01 , could you please help verify this CI failure https://github.com/pytorch/pytorch/actions/runs/16643249983/job/47097723925?pr=157994? Thanks!

Thanks for testing this @yanbing-j. Seems like ACL version should be bumped up to the latest one.

2025-07-31T07:59:30.9603124Z -- Found ACL: /ComputeLibrary
2025-07-31T07:59:31.0004741Z �[31mCMake Error at third_party/ideep/mkl-dnn/cmake/ACL.cmake:67 (message):
2025-07-31T07:59:31.0005536Z Detected ACL version 25.02, but minimum compatible is 52.0

I can test the build with a forked version of this changes after bumping ACL properly to verify.

aditew01 · 2025-07-31T14:20:08Z

@yanbing-j with this commit, the build is working fine. I'll open a draft PR to upgrade ACL with this version. The build is working fine now: https://github.com/pytorch/pytorch/actions/runs/16647200279/job/47110635776

yanbing-j · 2025-08-01T01:24:14Z

@yanbing-j with this commit, the build is working fine. I'll open a draft PR to upgrade ACL with this version. The build is working fine now: https://github.com/pytorch/pytorch/actions/runs/16647200279/job/47110635776

Thanks @aditew01 ! Will the PR of upgrading ACL to the version be fast? Maybe I can involve the code change inside the current PR.

aditew01 · 2025-08-01T10:00:05Z

Maybe I can involve the code change inside the current PR.

@yanbing-j sounds good to me. Thanks!

yanbing-j · 2025-08-04T01:07:53Z

Maybe I can involve the code change inside the current PR.

@yanbing-j sounds good to me. Thanks!

Okay. I have put this into the task list. Thanks!

Fix CI failures Fix bazel build failure

pytorch-bot bot added ciflow/linux-aarch64 linux aarch64 CI workflow module: mkldnn Related to Intel IDEEP or oneDNN (a.k.a. mkldnn) integration topic: not user facing topic category labels Jul 10, 2025

yanbing-j added ciflow/trunk Trigger trunk jobs on your pull request ci-no-td Do not run TD on this PR labels Jul 10, 2025

pytorchbot added the open source label Jul 10, 2025

yanbing-j force-pushed the yanbing/update_onednn_v3.9 branch from 776d19c to b40994b Compare July 17, 2025 02:35

yanbing-j added ciflow/inductor ciflow/xpu Run XPU CI tasks labels Jul 18, 2025

yanbing-j force-pushed the yanbing/update_onednn_v3.9 branch from b40994b to 210dee3 Compare July 23, 2025 01:26

pytorchmergebot force-pushed the yanbing/update_onednn_v3.9 branch from 210dee3 to d226856 Compare July 24, 2025 06:07

yanbing-j force-pushed the yanbing/update_onednn_v3.9 branch 3 times, most recently from 8211889 to 96d0c2e Compare July 31, 2025 07:53

yanbing-j added the module: arm Related to ARM architectures builds of PyTorch. Includes Apple M1 label Jul 31, 2025

yanbing-j added the ciflow/op-benchmark Trigger microbenchmark for operations. label Aug 5, 2025

aditew01 mentioned this pull request Aug 11, 2025

[DO NOT MERGE] ACL Version Upgrade v52.3.0 #160316

Draft

yanbing-j and others added 2 commits August 12, 2025 08:19

Upgrade oneDNN to v3.9

e35e194

Fix CI failures Fix bazel build failure

acl upgrade

1d51259

yanbing-j force-pushed the yanbing/update_onednn_v3.9 branch from 96d0c2e to 1d51259 Compare August 12, 2025 08:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[test][do not merge] Upgrade oneDNN to v3.9 #157994

[test][do not merge] Upgrade oneDNN to v3.9 #157994

yanbing-j commented Jul 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 10, 2025 •

edited

Loading

Uh oh!

yanbing-j commented Jul 10, 2025

Uh oh!

Skylion007 commented Jul 10, 2025

Uh oh!

vpirogov commented Jul 10, 2025

Uh oh!

yanbing-j commented Jul 17, 2025

Uh oh!

yanbing-j commented Jul 17, 2025 •

edited

Loading

Uh oh!

yanbing-j commented Jul 24, 2025

Uh oh!

pytorchmergebot commented Jul 24, 2025

Uh oh!

pytorchmergebot commented Jul 24, 2025

Uh oh!

yanbing-j commented Jul 31, 2025

Uh oh!

aditew01 commented Jul 31, 2025

Uh oh!

aditew01 commented Jul 31, 2025 •

edited

Loading

Uh oh!

yanbing-j commented Aug 1, 2025

Uh oh!

aditew01 commented Aug 1, 2025

Uh oh!

yanbing-j commented Aug 4, 2025

Uh oh!

Uh oh!

[test][do not merge] Upgrade oneDNN to v3.9 #157994

Are you sure you want to change the base?

[test][do not merge] Upgrade oneDNN to v3.9 #157994

Conversation

yanbing-j commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157994

❌ 3 New Failures, 53 Pending

Uh oh!

yanbing-j commented Jul 10, 2025

Uh oh!

Skylion007 commented Jul 10, 2025

Uh oh!

vpirogov commented Jul 10, 2025

Uh oh!

yanbing-j commented Jul 17, 2025

Uh oh!

yanbing-j commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yanbing-j commented Jul 24, 2025

Uh oh!

pytorchmergebot commented Jul 24, 2025

Uh oh!

pytorchmergebot commented Jul 24, 2025

Uh oh!

yanbing-j commented Jul 31, 2025

Uh oh!

aditew01 commented Jul 31, 2025

Uh oh!

aditew01 commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yanbing-j commented Aug 1, 2025

Uh oh!

aditew01 commented Aug 1, 2025

Uh oh!

yanbing-j commented Aug 4, 2025

Uh oh!

Uh oh!

yanbing-j commented Jul 10, 2025 •

edited

Loading

pytorch-bot bot commented Jul 10, 2025 •

edited

Loading

yanbing-j commented Jul 17, 2025 •

edited

Loading

aditew01 commented Jul 31, 2025 •

edited

Loading