Support fp8 weight quant cache #10914

zhangbo9674 · 2025-08-07T03:06:51Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

Performance optimization

PR changes

Others

Description

Support fp8 weight quant cache

paddle-bot · 2025-08-07T03:06:56Z

Thanks for your contribution!

support fp8 weight quant cache

eaa8a58

zhangbo9674 added 3 commits August 7, 2025 03:33

fix bug

34fef01

fix conflict

3ec8c4a

fix confilct

39f57d7

phlrain approved these changes Aug 7, 2025

View reviewed changes

phlrain merged commit e911a76 into PaddlePaddle:dsv3_dev Aug 7, 2025
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support fp8 weight quant cache #10914

Support fp8 weight quant cache #10914

Uh oh!

zhangbo9674 commented Aug 7, 2025

Uh oh!

paddle-bot bot commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!

Support fp8 weight quant cache #10914

Support fp8 weight quant cache #10914

Uh oh!

Conversation

zhangbo9674 commented Aug 7, 2025

Before submitting

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!