[Cherry-Pick][OP] cherry-pick #7073 support deepgemm for sm103#7081
[Cherry-Pick][OP] cherry-pick #7073 support deepgemm for sm103#7081EmmonsCurse merged 6 commits intoPaddlePaddle:release/2.5from
Conversation
|
Thanks for your contribution! |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## release/2.5 #7081 +/- ##
==============================================
Coverage ? 69.08%
==============================================
Files ? 390
Lines ? 54230
Branches ? 8553
==============================================
Hits ? 37464
Misses ? 14051
Partials ? 2715
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
fastdeploy-bot
left a comment
There was a problem hiding this comment.
🤖 AI Code Review
📋 Review 摘要
PR 概述:将 DeepGemm 的 SM 版本检查从 == 100 改为 >= 100,以支持 SM103 等更新的 Blackwell 架构 GPU
变更范围:model_executor/layers/quantization/
影响面 Tag:[OP] [Quantization]
问题
未发现阻塞性问题。
总体评价
变更逻辑正确且一致,将 SM 版本判断从 == 100(仅 SM100)改为 >= 100(SM100+),使得 DeepGemm 能够支持 SM103 等更新的 Blackwell 架构 GPU。代码库中其他类似逻辑(如 flash_attn_backend.py 中的 sm_version >= 100)已采用相同的 >= 模式,本次修改保持了一致性。
Motivation
deepgeem支持sm103
Modifications
deepgeem适配
Usage or Command
none
Accuracy Tests
none
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.