Add MXFP4 GGUF QuantizationType #1677

CISC · 2025-08-07T10:14:51Z

CISC · 2025-08-07T18:49:33Z

CISC · 2025-08-07T18:55:58Z

It checks GGMLQuantizationType against GGUF_QUANT_ORDER which is GGMLFileQuantizationType, they are not equivalent:

Line 66 in d67a464

export const GGUF_QUANT_ORDER: GGMLFileQuantizationType[] = [

ngxson · 2025-08-07T20:26:29Z

The CI seems to be broken, but this is quite a trivial change so I guess it's ok to merge without CI being green

CISC added 2 commits August 7, 2025 12:01

add MXFP4 QuantizationType

3887e3c

add MXFP4 size and description

5203864

CISC requested review from mishig25, ngxson, julien-c, SBrandeis, gary149, Wauplin and pcuenca as code owners August 7, 2025 10:14

ngxson approved these changes Aug 7, 2025

View reviewed changes

CISC and others added 2 commits August 7, 2025 20:58

fix test

f551091

Merge branch 'main' into gguf-mxfp4

5af7077

ngxson merged commit e841a53 into huggingface:main Aug 7, 2025
3 of 4 checks passed

CISC deleted the gguf-mxfp4 branch August 7, 2025 20:49

Provide feedback