ggml: check if non-native endian model is being loaded #13943

taronaeo · 2025-05-31T13:08:06Z

This PR adds more descriptive error messages for when a non-native endian model is being loaded on the host system.

Verification

To ensure that this implementation did not break anything, this PR has been tested on the following systems:

IBM z15 Mainframe (16 IFLs / 160 RAIM / NOSMT / LPAR)
M3 MacBook Air (8 Cores / 16 GB / SMT)
Kindly request additional systems to be tested in this PR

System	Granite 3.1 2B Instruct LE Model	Granite 3.1 2B Instruct BE Model
IBM z15 Mainframe (BE System)	❌ Does not load (Expected)	✅ Loads (Expected)
M3 MacBook Air (LE System)	✅ Loads (Expected)	❌ Does not load (Expected)

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

ggml/src/gguf.cpp

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

taronaeo · 2025-06-01T09:26:08Z

@JohannesGaessler applied your suggested changes. I've moved the GGML_ASSERT just after the endianness check, otherwise it will trigger the assert before the check could produce any useful information for the end-user.

PTAL again.

P.S., GGML_ASSERT fails because the version field in non-native endian models are read as 50331648.

Edit: Also re-tested the new changes on ARM64 and s390x. Both worked as intended.

ggml/src/gguf.cpp

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

JohannesGaessler

I would say with this error message it's fine to remove the assert again but either way is fine I think.

taronaeo · 2025-06-01T13:37:06Z

Unfortunately that GGML_ASSERT is still required because when I change the version to 0x0001 FFFF, the assert catches it via

llama.cpp-s390x/ggml/src/gguf.cpp:363: GGML_ASSERT(ctx->version > 0 && ctx->version <= 65535) failed

while the endianness check did not; as expected.

If the CI passes, feel free to merge it into master :)

taronaeo added 2 commits May 31, 2025 20:59

gguf: prevent non-native endian models from being loaded

e2fdf28

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

gguf: update error message

cc1def7

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

taronaeo requested a review from JohannesGaessler as a code owner May 31, 2025 13:08

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label May 31, 2025

JohannesGaessler reviewed Jun 1, 2025

View reviewed changes

ggml/src/gguf.cpp Outdated Show resolved Hide resolved

taronaeo added 2 commits June 1, 2025 17:12

gguf: make the non-native endian check more verbose

25c971d

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

ggml: move ggml_assert location

65bf062

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

JohannesGaessler reviewed Jun 1, 2025

View reviewed changes

ggml/src/gguf.cpp Show resolved Hide resolved

ggml: reword the endianness check error message

c83208f

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

JohannesGaessler approved these changes Jun 1, 2025

View reviewed changes

JohannesGaessler merged commit e57bb87 into ggml-org:master Jun 1, 2025
46 checks passed

JohannesGaessler mentioned this pull request Jun 1, 2025

gguf: fix failure on version == 0 #13956

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml: check if non-native endian model is being loaded #13943

ggml: check if non-native endian model is being loaded #13943

Uh oh!

taronaeo commented May 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

taronaeo commented Jun 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

JohannesGaessler left a comment

Uh oh!

taronaeo commented Jun 1, 2025

Uh oh!

Uh oh!

Uh oh!

ggml: check if non-native endian model is being loaded #13943

ggml: check if non-native endian model is being loaded #13943

Uh oh!

Conversation

taronaeo commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Verification

Uh oh!

Uh oh!

taronaeo commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

taronaeo commented Jun 1, 2025

Uh oh!

Uh oh!

Uh oh!

taronaeo commented May 31, 2025 •

edited

Loading

taronaeo commented Jun 1, 2025 •

edited

Loading