scripts: add bpw per layer and model #14703

EAddario · 2025-07-15T21:48:55Z

Since llama-quantize allows users to select a wide range of quant types, sometimes it may not obvious to determine what's the most appropriate weight encoding scheme to use when following the GGUF naming conventions.

This PR modifies gguf_dump.py to display the bits per weight (bpw) for each layer, and for the overall model, when using the --markdown option.

EAddario · 2025-07-16T06:39:44Z

Thank you @CISC

Add bpw per tensor group and model

aa7aa5d

github-actions bot added the python python script changes label Jul 15, 2025

CISC approved these changes Jul 15, 2025

View reviewed changes

CISC merged commit c81f419 into ggml-org:master Jul 15, 2025
4 checks passed

EAddario deleted the gguf_dump branch July 16, 2025 06:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

scripts: add bpw per layer and model #14703

scripts: add bpw per layer and model #14703

Uh oh!

EAddario commented Jul 15, 2025

Uh oh!

Uh oh!

EAddario commented Jul 16, 2025

Uh oh!

Uh oh!

scripts: add bpw per layer and model #14703

scripts: add bpw per layer and model #14703

Uh oh!

Conversation

EAddario commented Jul 15, 2025

Uh oh!

Uh oh!

EAddario commented Jul 16, 2025

Uh oh!

Uh oh!