[Cherry-Pick] Add clear_grpah_opt_backend method to Qwen3VL(#7086)#7110
[Cherry-Pick] Add clear_grpah_opt_backend method to Qwen3VL(#7086)#7110CSWYF3634076 merged 1 commit intoPaddlePaddle:release/2.5from
Conversation
…Generation Add clear_grpah_opt_backend method that delegates to the underlying model to clear cuda graph optimization backend.
|
Thanks for your contribution! |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## release/2.5 #7110 +/- ##
==============================================
Coverage ? 68.42%
==============================================
Files ? 390
Lines ? 54091
Branches ? 8515
==============================================
Hits ? 37012
Misses ? 14393
Partials ? 2686
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
fastdeploy-bot
left a comment
There was a problem hiding this comment.
🤖 AI Code Review |
{{TIMESTAMP}}
📋 Review 摘要
PR 概述:Cherry-pick 为 Qwen3VLForConditionalGeneration 类添加 clear_grpah_opt_backend 方法,支持清理 cuda graph 优化后端
变更范围:model_executor/models/qwen3_vl/
影响面 Tag:[Models] [Graph Optimization]
📝 PR 规范检查
标题使用 [Cherry-Pick] 不在官方 Tag 列表中,建议使用标准 Tag。
标题建议(可直接复制):
[Models] Add clear_grpah_opt_backend method to Qwen3VL (#7086 cherry-pick)
问题
未发现阻塞性问题。
总体评价
代码实现正确,遵循了项目中其他模型(如 qwen3.py、qwen2.py、deepseek_v3.py)相同的委托模式。方法名 clear_grpah_opt_backend 虽然存在拼写问题(grpah → graph),但与项目现有命名保持一致,是正确的做法。
c55f2e0
into
PaddlePaddle:release/2.5
Motivation
Cherry-pick from develop branch.
Add
clear_grpah_opt_backendmethod toQwen3VLForConditionalGenerationto support clearing cuda graph optimization backend, enabling proper cleanup when switching inference backends.Modifications
clear_grpah_opt_backendmethod inQwen3VLForConditionalGenerationthat delegates to the underlying model to clear cuda graph optimization backend.Usage or Command
N/A
Accuracy Tests
No model output changes involved.
Checklist