-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Insights: PaddlePaddle/PaddleNLP
Overview
Could not load contribution data
Please try again later
9 Pull requests merged by 7 people
-
fp8_quant_cache_and_using_fp8_gemm
#10923 merged
Aug 11, 2025 -
support bw split
#10915 merged
Aug 9, 2025 -
Support fp8 weight quant cache
#10914 merged
Aug 7, 2025 -
Support magic send for mtp
#10916 merged
Aug 7, 2025 -
Update ppo
#10912 merged
Aug 7, 2025 -
Fix mtp bug when send_mtp_embed=True
#10909 merged
Aug 7, 2025 -
fix ordered_save func
#10896 merged
Aug 5, 2025 -
token_dispatcher support expert_num 64
#10905 merged
Aug 5, 2025 -
Fix 0 size bug
#10906 merged
Aug 5, 2025
7 Pull requests opened by 6 people
-
Revert dsv3_dev to runnable version
#10907 opened
Aug 5, 2025 -
Add vapo
#10908 opened
Aug 5, 2025 -
Update vapo
#10918 opened
Aug 8, 2025 -
[Auto-parallel] Improve usability of auto_dy pipeline parallel
#10920 opened
Aug 8, 2025 -
Fix bug when get shape attribute for PySafeSlice in Windows
#10924 opened
Aug 11, 2025 -
optimize reshard
#10925 opened
Aug 11, 2025 -
Add config control for quant cache
#10926 opened
Aug 11, 2025
3 Issues closed by 2 people
-
[Bug]: 5060显卡支持问题
#10664 closed
Aug 11, 2025 -
[Question]: 模型不能从本地目录加载吗?
#10917 closed
Aug 8, 2025 -
[Question]:
#10635 closed
Aug 5, 2025
5 Issues opened by 5 people
-
[Question]: No module named 'paddle.distributed.auto_parallel.intermediate'
#10922 opened
Aug 10, 2025 -
[Question]: 飞腾+昆仑R200,飞浆官网没有提供paddlepaddle3.0的昆仑芯 XPU 的 ARM 架构安装包
#10921 opened
Aug 9, 2025 -
[Question]: 关于开启stage3后,checkpoint保存失败的问题
#10919 opened
Aug 8, 2025 -
[Bug]: 单机运行DeepSeek_R1动态图 + MTP 动态图出现no attribute 'output_via_mq'
#10911 opened
Aug 6, 2025
47 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[Bug]: 在用Taskflow推理的时候,指定本地模型路径没生效
#10660 commented on
Aug 8, 2025 • 0 new comments -
[Question]: pp-uie使用taskflow无法加载自定义路径模型,只能加载默认路径:C:\Users\.paddlenlp\models\paddlenlp/PP-UIE-0.5B
#10409 commented on
Aug 9, 2025 • 0 new comments -
[Bug]: 安装完paddlenlp_ops运行不起来
#10719 commented on
Aug 10, 2025 • 0 new comments -
Add model VisualBERT
#1283 commented on
Aug 4, 2025 • 0 new comments -
Add byt5 Model
#1742 commented on
Aug 4, 2025 • 0 new comments -
Add question generation example
#2944 commented on
Aug 4, 2025 • 0 new comments -
Refactor training loop
#6098 commented on
Aug 4, 2025 • 0 new comments -
Moe
#6515 commented on
Aug 4, 2025 • 0 new comments -
[DO NOT MERGE][TaskFlow]Support llama & bloom taskflow
#6586 commented on
Aug 4, 2025 • 0 new comments -
[NPU] adaptation for LLaMA
#7262 commented on
Aug 4, 2025 • 0 new comments -
[AutoParallel] Test 3d SP acc
#7677 commented on
Aug 4, 2025 • 0 new comments -
[DO NOT Merge] Test dynamic auto parallel 3d sp acc
#7683 commented on
Aug 4, 2025 • 0 new comments -
vocabulary_expansion_pretraining
#7755 commented on
Aug 4, 2025 • 0 new comments -
[DO NOT MERGE] Dynamic Auto parallel Performance Test.
#7804 commented on
Aug 4, 2025 • 0 new comments -
ceval_quant_eval
#8220 commented on
Aug 7, 2025 • 0 new comments -
xxx. fix_ceval_quant_eval
#8221 commented on
Aug 6, 2025 • 0 new comments -
support Baichuan13B inference
#8287 commented on
Aug 6, 2025 • 0 new comments -
[DON'T NEED REVIEW] Mthreads llama 13 b 128 pp16
#9193 commented on
Aug 5, 2025 • 0 new comments -
Sci/benchmark iluvatar
#9369 commented on
Aug 5, 2025 • 0 new comments -
[DON'T NEED REVIEW] Mthreads llama 13 b 64 pp8
#9557 commented on
Aug 4, 2025 • 0 new comments -
[DON'T NEED REVIEW] Mthreads llama 13 b 64 pp16
#9558 commented on
Aug 4, 2025 • 0 new comments -
[LLM] Add fused attention in Qwen2MoE
#9767 commented on
Aug 8, 2025 • 0 new comments -
Support deepseek v3
#9835 commented on
Aug 4, 2025 • 0 new comments -
[Feature] Sageattn write 8 bit kv-cache
#10032 commented on
Aug 8, 2025 • 0 new comments -
2:4 sparse for int8/fp8/bf16/fp16 gemm
#10081 commented on
Aug 4, 2025 • 0 new comments -
Add fused_topk_to_multihot / fused_multihot_prob_backto_topk(grad) custom op to prevent CPU stall.
#10125 commented on
Aug 4, 2025 • 0 new comments -
[LLM] fix openai client and stream output bug
#10267 commented on
Aug 7, 2025 • 0 new comments -
Add record_stream for dispatch and combine output tensors
#10269 commented on
Aug 4, 2025 • 0 new comments -
Dsv3 dev
#10273 commented on
Aug 11, 2025 • 0 new comments -
Bf16 batch gemm dual gemm
#10281 commented on
Aug 8, 2025 • 0 new comments -
update pybind_H
#10299 commented on
Aug 10, 2025 • 0 new comments -
Deepseek xpu
#10340 commented on
Aug 4, 2025 • 0 new comments -
add_pybind for dyGraph predictor running
#10374 commented on
Aug 10, 2025 • 0 new comments -
int8 train test
#10451 commented on
Aug 9, 2025 • 0 new comments -
add append_attn ut
#10527 commented on
Aug 6, 2025 • 0 new comments -
Add llama-13b dynamic auto benchmark
#10654 commented on
Aug 5, 2025 • 0 new comments -
【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3
#10688 commented on
Aug 9, 2025 • 0 new comments -
Tmp llama13b benchmark
#10694 commented on
Aug 10, 2025 • 0 new comments -
add register for auto model
#10699 commented on
Aug 11, 2025 • 0 new comments -
add generate_expert_indices op
#10705 commented on
Aug 11, 2025 • 0 new comments -
Support wint2 unzip
#10706 commented on
Aug 5, 2025 • 0 new comments -
[Auto-Parallel] Add benchmark for fast_rms_norm in llama13b N4C32 dy_auto
#10713 commented on
Aug 9, 2025 • 0 new comments -
Add forward unittest for fused_transpose_split_quant
#10717 commented on
Aug 10, 2025 • 0 new comments -
[DeepGEMM] Print tuning and compilation time stat
#10725 commented on
Aug 11, 2025 • 0 new comments -
[Auto Parallel] Align the meaning of configuration items, decouple auto-parallel pipeline networking and trainer
#10775 commented on
Aug 6, 2025 • 0 new comments -
[CI]add workflow for llm&unittest-gpu
#10878 commented on
Aug 7, 2025 • 0 new comments -
add ppo
#10884 commented on
Aug 6, 2025 • 0 new comments