Fix integer overflow in RKNPU implicit bias allocation by GopalakrishnanN · Pull Request #29249 · microsoft/onnxruntime

GopalakrishnanN · 2026-06-24T20:37:41Z

Description

The RKNPU execution provider's ONNX converter allocates implicit (all-zero)
bias buffers with malloc(sizeof(T) * dim) when a Conv / Gemm /
QLinearConv node omits its bias input. dim comes from a model weight's
shape (an attacker-controlled uint32_t after RKNPU shape conversion), and the
allocation result was never checked before memset.

This PR hardens the converter in two places:

validates ONNX int64_t dimensions before narrowing them to the RKNPU
uint32_t shape representation, rejecting negative or out-of-range values;
routes all four implicit-bias allocation sites (AddLayerConvImpl,
AddLayerQLinearConvImpl, AddLayerDepthwiseConvImpl, AddLayerFC) through
a shared AllocZeroedBias helper that computes byte counts with
SafeInt<size_t> and null-checks malloc before zeroing.

Both overflow/OOM error paths include element_size, count, and byte-count
context where available.

Motivation and Context

A malicious ONNX model can provide dimensions that are not safe for the RKNPU
converter's 32-bit shape representation or for byte-size allocation arithmetic:

ONNX stores dimensions as int64_t, while the RKNPU converter/DDK uses
uint32_t shape values. Silently narrowing a large or negative int64_t
value can produce a misleading uint32_t dimension.
Even after a dimension is represented as uint32_t, sizeof(T) * dim can
overflow size_t on 32-bit RKNPU targets. For example, dim = 0x40000400
makes sizeof(float) * dim wrap to 4096 bytes while the created tensor
still advertises dim elements, causing a heap buffer overflow when the bias
is consumed by the driver.
On 64-bit hosts, very large dimensions can instead lead to a failing malloc
whose NULL result was previously dereferenced by memset.

SafeInt<size_t> is the ORT-standard idiom for memory-size arithmetic, and the
new dimension helper rejects invalid ONNX dimensions before they enter the
RKNPU uint32_t shape model.

Validation: the RKNPU EP requires the Rockchip DDK (RKNPU_DDK_PATH) and an
ARM target, so it does not build on a typical x64 dev box and has no GPU-free CI
leg. The change was validated with clang-format, git diff --check, and a
standalone test against ORT's SafeInt.hpp, confirming the guard throws on the
overflowing dimensions (0x40000400, 0xFFFFFFFF) while preserving normal and
zero-dimension behavior.

Testing: No unit test is included because the RKNPU EP is not compiled in
any CI leg (it requires the proprietary Rockchip DDK and an ARM target), and the
ToRknpuDim / AllocZeroedBias helpers have internal (file-static) linkage,
so they are not reachable from the gtest suites. The overflow/validation logic
was instead exercised with a standalone test against ORT's SafeInt.hpp as
noted above.

The RKNPU converter built implicit (all-zero) bias buffers with malloc(sizeof(T) * dim), where dim is an attacker-controlled uint32_t taken from a model weight's shape, and never checked the result before memset. On 32-bit RKNPU targets (size_t is 32-bit) a crafted dim such as 0x40000400 makes sizeof(float) * dim wrap to 4096 bytes while the tensor still advertises dim elements, yielding a heap buffer overflow when the bias is consumed. On 64-bit hosts a large dim instead produces a failing malloc whose NULL return was dereferenced by the following memset. Route all four bias sites (Conv, QLinearConv, DepthwiseConv, FC) through a single AllocZeroedBias helper that computes the byte count with SafeInt<size_t> (throws on overflow) and null-checks the allocation before zeroing.

Copilot

Pull request overview

This PR hardens the RKNPU execution provider’s ONNX-to-RKNN converter against attacker-controlled shape dimensions by preventing sizeof(T) * dim wraparound and by avoiding memset on a null allocation when creating implicit (all-zero) bias buffers.

Changes:

Add a shared AllocZeroedBias helper that computes allocation sizes with SafeInt<size_t> and checks malloc results before zeroing.
Route implicit-bias allocations in Conv/QLinearConv/DepthwiseConv/FC layer builders through the helper.
Include ORT’s safeint.h to use the standard overflow-checked arithmetic idiom.

Address review feedback: AllocZeroedBias now reports element_size, count, and the computed byte count in both the SafeInt-overflow and malloc-failure error paths, so an oversized/malicious model or OOM is easier to diagnose.

Reject negative and out-of-range ONNX dimensions before converting them to the uint32_t shape representation used by the RKNPU converter and DDK. This prevents malicious int64 ONNX dimensions from silently truncating at model ingestion.

Address review feedback by replacing the explicit invalid_argument throw in ToRknpuDim with ORT_ENFORCE and including core/common/common.h directly.

Keep the allocation overflow/OOM diagnostic context while replacing direct std::runtime_error throws in AllocZeroedBias with ORT_THROW and ORT_ENFORCE.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

Address review feedback by replacing the raw try/catch in AllocZeroedBias with ORT_TRY/ORT_CATCH so the code remains portable to ORT_NO_EXCEPTIONS builds.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

GopalakrishnanN requested a review from Copilot June 24, 2026 23:29

Copilot started reviewing on behalf of GopalakrishnanN June 24, 2026 23:29 View session

GopalakrishnanN requested review from tianleiwu and yuslepukhin June 24, 2026 23:32

Copilot AI reviewed Jun 24, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/rknpu/onnx_converter.cc

Gopalakrishnan Nallasamy added 2 commits June 24, 2026 17:04

Add size/count context to RKNPU bias allocation errors

8be5a2e

Address review feedback: AllocZeroedBias now reports element_size, count, and the computed byte count in both the SafeInt-overflow and malloc-failure error paths, so an oversized/malicious model or OOM is easier to diagnose.

Validate RKNPU tensor dimensions before narrowing

b153a6d

Reject negative and out-of-range ONNX dimensions before converting them to the uint32_t shape representation used by the RKNPU converter and DDK. This prevents malicious int64 ONNX dimensions from silently truncating at model ingestion.

xadupre reviewed Jun 25, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/rknpu/onnx_converter.cc Outdated

Gopalakrishnan Nallasamy added 2 commits June 25, 2026 14:23

Use ORT_ENFORCE for RKNPU dimension validation

ead36d1

Address review feedback by replacing the explicit invalid_argument throw in ToRknpuDim with ORT_ENFORCE and including core/common/common.h directly.

Use ORT macros for RKNPU bias allocation errors

18bb9fe

Keep the allocation overflow/OOM diagnostic context while replacing direct std::runtime_error throws in AllocZeroedBias with ORT_THROW and ORT_ENFORCE.

tianleiwu requested a review from Copilot June 25, 2026 22:09

Copilot started reviewing on behalf of tianleiwu June 25, 2026 22:10 View session

Copilot AI reviewed Jun 25, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/rknpu/onnx_converter.cc

Use ORT_TRY for RKNPU bias size overflow handling

e9d8a06

Address review feedback by replacing the raw try/catch in AllocZeroedBias with ORT_TRY/ORT_CATCH so the code remains portable to ORT_NO_EXCEPTIONS builds.

GopalakrishnanN requested a review from Copilot June 25, 2026 22:27

Copilot started reviewing on behalf of GopalakrishnanN June 25, 2026 22:27 View session

Copilot AI reviewed Jun 25, 2026

View reviewed changes

GopalakrishnanN requested a review from xadupre June 25, 2026 23:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix integer overflow in RKNPU implicit bias allocation#29249

Fix integer overflow in RKNPU implicit bias allocation#29249
GopalakrishnanN wants to merge 6 commits into
mainfrom
GopalakrishnanN/rknpu-bias-overflow-fix

GopalakrishnanN commented Jun 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

GopalakrishnanN commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GopalakrishnanN commented Jun 24, 2026 •

edited

Loading