[safetensors] parameters count based on quantization config #1673

mishig25 · 2025-08-06T08:41:50Z

Enhance safetensors metadata parsing to support new data types and quantization configurations

Added support for sub-byte data types: F4, F6_E2M3, F6_E3M2, E8M0, FP4, and UE8.
Implemented fetching and parsing of model configuration for quantization information.
Updated parameter counting functions to consider quantization effects on tensor parameters.
Added tests to validate the handling of new data types and parameter counting for large models.

SunMarc

Wow, thanks for taking into account quantization for parameters count !

mishig25 requested a review from coyotte508 as a code owner August 6, 2025 08:41

mishig25 requested review from julien-c and SunMarc August 6, 2025 08:42

[safetensors] parameters count based on quantization config

5070231

mishig25 force-pushed the st_quant_config branch from e126515 to 5070231 Compare August 6, 2025 08:47

mishig25 merged commit db10397 into main Aug 6, 2025
4 of 5 checks passed

mishig25 deleted the st_quant_config branch August 6, 2025 08:50

SunMarc reviewed Aug 6, 2025

View reviewed changes

Provide feedback