sycl: Batched mulmat rework for oneDNN dispatch #14617

ShanoToni · 2025-07-10T15:20:44Z

PR proposes a rework to existing dispatch of batched Mul_Mats for the sycl backend to oneDNN to allow better use of broadcasts for non matching batch sizes for inputs and handle non continuous data being passed in the tensors. This reduces the number of calls to oneDNN matmul.

Additionaly small fix added to PR for the ggml_sycl_mul_mat_vec_nc case to allow src1 to be non-continuous as well.

test-backend-ops passing all tests

llama-bench running model qwen2 shows no performance regression compared to master
running on Intel Battlemage

Master

model	size	params	backend	ngl	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	pp512	8531.35 ± 62.72
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	tg128	157.24 ± 0.27

PR branch

model	size	params	backend	ngl	test	t/s
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	pp512	8602.93 ± 39.79
qwen2 1.5B Q4_0	1013.62 MiB	1.78 B	SYCL	99	tg128	157.20 ± 0.76

Alcpz

Thanks for changing the semantics of the dnn calls

ggml/src/ggml-sycl/ggml-sycl.cpp

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Jul 10, 2025

Alcpz reviewed Jul 11, 2025

View reviewed changes

ggml/src/ggml-sycl/ggml-sycl.cpp Show resolved Hide resolved

sycl: Batched mulmat rework for oneDNN dispatch

b2785f8

ShanoToni force-pushed the sycl_mul_mat_batched_rework branch from 747c12e to b2785f8 Compare July 11, 2025 10:57

Alcpz approved these changes Jul 11, 2025

View reviewed changes

Alcpz merged commit 65a3ebb into ggml-org:master Jul 14, 2025
48 checks passed

Alcpz mentioned this pull request Jul 14, 2025

SYCL: use 1D kernel for set_rows #14618

Merged

ShanoToni mentioned this pull request Jul 14, 2025

sycl: Hotfix for non dnnl codepath #14677

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sycl: Batched mulmat rework for oneDNN dispatch #14617

sycl: Batched mulmat rework for oneDNN dispatch #14617

Uh oh!

ShanoToni commented Jul 10, 2025

Uh oh!

Alcpz left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sycl: Batched mulmat rework for oneDNN dispatch #14617

sycl: Batched mulmat rework for oneDNN dispatch #14617

Uh oh!

Conversation

ShanoToni commented Jul 10, 2025

Uh oh!

Alcpz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Alcpz left a comment •

edited

Loading