ICM fixes (5/n) by hariharans29 · Pull Request #27971 · microsoft/onnxruntime

hariharans29 · 2026-04-03T17:57:53Z

Description

Fixes 3 ICM fixes:

https://portal.microsofticm.com/imp/v5/incidents/details/31000000572208/summary
https://portal.microsofticm.com/imp/v5/incidents/details/31000000573313/summary
https://portal.microsofticm.com/imp/v5/incidents/details/31000000575583/summary

Motivation and Context

Fix ICM issues

Copilot

Pull request overview

This PR addresses multiple ICM-reported robustness issues by adding missing input/attribute validation to CPU ML/math kernels and strengthening overflow handling in RotaryEmbedding, with new negative tests to prevent regressions.

Changes:

Add validation for LinearRegressor input rank and coefficient vector sizing, plus a regression test for undersized coefficients.
Tighten CumSum axis tensor validation to reject empty/multi-element axis inputs, plus a regression test.
Add overflow checks in RotaryEmbedding (ONNX + contrib) and add tests for overflow/invalid sizing scenarios.

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
onnxruntime/core/providers/cpu/ml/linearregressor.cc	Adds extra input/attribute validation and coefficient-size verification.
onnxruntime/test/providers/cpu/ml/linearregressor_test.cc	Adds failure test for undersized `coefficients`.
onnxruntime/core/providers/cpu/math/cumsum.h	Rejects axis tensors whose element-count is not exactly 1.
onnxruntime/test/providers/cpu/math/cumsum_test.cc	Adds failure test for empty axis tensor.
onnxruntime/core/providers/cpu/llm/rotary_embedding.cc	Adds ptrdiff overflow-checked loop bound calculations and safer iteration types.
onnxruntime/core/providers/cpu/llm/rotary_embedding_helper.h	Adds int32 narrowing/checked-multiply utilities and uses them during input validation.
onnxruntime/test/providers/cpu/llm/rotary_embedding_op_test.cc	Adds regression tests for overflow/shape-validation behavior.
onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc	Same overflow-checked loop bound updates for contrib RotaryEmbedding.
onnxruntime/contrib_ops/cpu/bert/rotary_embedding_helper.h	Same int32 narrowing/checked-multiply utilities for contrib input validation.
onnxruntime/test/contrib_ops/rotary_embedding_op_test.cc	Adds contrib regression tests for overflow/shape-validation behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions

You can commit the suggested changes from lintrunner.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 7 comments.

Comments suppressed due to low confidence (2)

onnxruntime/core/providers/cpu/llm/rotary_embedding.cc:120

cost is computed using int arithmetic (head_size * sizeof(T) * 2 + rotary_emb_dim * 32) before being cast to double, which can overflow for large head_size/rotary_emb_dim and potentially yield a negative/incorrect cost passed to TryParallelFor. Consider promoting operands to double/int64_t before multiplying/adding so the cost calculation itself can’t overflow.

  // The cost is calculated as:
  //   - head_size * sizeof(T) for reading input
  //   - head_size * sizeof(T) for writing output
  //   - rotary_emb_dim * 32 for the rotary embedding operations (32 is an approximation of the number of CPU cycles)
  const double cost = static_cast<double>(head_size * sizeof(T) * 2 + rotary_emb_dim * 32);

onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc:142

cost is computed with int arithmetic (head_size * sizeof(T) * 2 + rotary_emb_dim * 32) before converting to double, so it can overflow for large shapes and pass an incorrect/negative cost into TryParallelFor. Consider promoting operands to double/int64_t before the arithmetic to avoid overflow in the cost computation itself.

  // The cost is calculated as:
  //   - head_size * sizeof(T) for reading input
  //   - head_size * sizeof(T) for writing output
  //   - rotary_emb_dim * 32 for the rotary embedding operations (32 is an approximation of the number of CPU cycles)
  const double cost = static_cast<double>(head_size * sizeof(T) * 2 + rotary_emb_dim * 32);

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions

You can commit the suggested changes from lintrunner.

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…into hari/icm_5

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 12 out of 12 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tianleiwu

Review Summary

Solid hardening PR — the checked arithmetic approach (NarrowNonNegativeToInt32, CheckedMulToInt32, CheckedMulToPtrdiff) is thorough and well-structured. CumSum empty-axis fix and LinearRegressor validations are clean. Test coverage is good across all three fix areas.

One high-priority concern: the contrib RotaryEmbedding helper has a potential division-by-zero that the llm variant correctly guards against (see inline comment).

tianleiwu · 2026-04-09T20:29:01Z

+    ORT_RETURN_IF_ERROR(detail::NarrowNonNegativeToInt32(cos_cache_dims[1], "cache_width", cache_width));
+    ORT_RETURN_IF_ERROR(detail::CheckedMulToInt32(cache_width, 2, "head_size", head_size));
+  } else {
+    if (!transposed && hidden_size % num_heads != 0) {


⚠️ Division-by-zero when num_heads == 0: This hidden_size % num_heads can be undefined behavior if num_heads is 0 (its default attribute value). The equivalent code in the llm variant (core/providers/cpu/llm/rotary_embedding_helper.h) correctly adds a num_heads <= 0 guard before any division:

if (num_heads <= 0) { return ORT_MAKE_STATUS(ONNXRUNTIME, INVALID_ARGUMENT, "RotaryEmbedding: num_heads must be greater than 0 for rank-3 input"); }

The same guard should be added here before the modulo check, for rank-3 input when rotary_embedding_dim > 0.

tianleiwu · 2026-04-09T20:49:15Z

  const int rotary_emb_dim = parameters.rotary_embedding_dim;
  const int half_rotary_emb_dim = rotary_emb_dim / 2;

+  std::ptrdiff_t position_count = 0;


How about using SafeInt directly

std::ptrdiff_t position_count = SafeInt<std::ptrdiff_t>(batch_size) * sequence_length;

SafeInt will call ORT_THROW if there is overflow.

Icm fixes

c6a005a

hariharans29 requested a review from Copilot April 3, 2026 17:57

Copilot started reviewing on behalf of hariharans29 April 3, 2026 17:59 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

Copilot comments

0e6a670

hariharans29 requested a review from Copilot April 3, 2026 18:13

Copilot started reviewing on behalf of hariharans29 April 3, 2026 18:15 View session

github-actions bot reviewed Apr 3, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc Outdated

Update onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc

2e91a3c

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Copilot AI reviewed Apr 3, 2026

View reviewed changes

hariharans29 added 3 commits April 3, 2026 11:28

Fix builds and address copilot comments

fbc3ec5

Fix

ca967f6

Builds

12aeda4

github-actions bot reviewed Apr 3, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc Outdated

Comment thread onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc Outdated

Comment thread onnxruntime/core/providers/cpu/llm/rotary_embedding.cc Outdated

hariharans29 and others added 5 commits April 3, 2026 11:36

Update onnxruntime/core/providers/cpu/llm/rotary_embedding.cc

1939b19

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc

9534eb5

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update onnxruntime/contrib_ops/cpu/bert/rotary_embedding.cc

9791dfd

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Fix builds

6bc00a7

Merge branch 'hari/icm_5' of https://github.com/microsoft/onnxruntime …

ad9f077

…into hari/icm_5

hariharans29 requested a review from Copilot April 3, 2026 19:02

Copilot started reviewing on behalf of hariharans29 April 3, 2026 19:04 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

Comment thread onnxruntime/test/providers/cpu/ml/linearregressor_test.cc

Comment thread onnxruntime/core/providers/cpu/llm/rotary_embedding_helper.h Outdated

Comment thread onnxruntime/core/providers/cpu/llm/rotary_embedding.cc Outdated

hariharans29 added 2 commits April 3, 2026 12:15

Latest comments

b1117d4

Merge remote-tracking branch 'origin' into hari/icm_5

8e8a794

hariharans29 requested a review from Copilot April 3, 2026 19:26

Copilot started reviewing on behalf of hariharans29 April 3, 2026 19:28 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

hariharans29 added 3 commits April 3, 2026 14:12

Copilot comments

996ea6f

Cumcum failure

c592d34

Fix builds

7451bf5

hariharans29 requested a review from Copilot April 3, 2026 22:05

Copilot started reviewing on behalf of hariharans29 April 3, 2026 22:12 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

hariharans29 added 2 commits April 3, 2026 17:43

Copilot comments + Fix CUDA builds

bc29793

More changes

ea039d4

hariharans29 requested a review from Copilot April 4, 2026 00:44

Copilot started reviewing on behalf of hariharans29 April 4, 2026 00:45 View session

Copilot AI reviewed Apr 4, 2026

View reviewed changes

Comment thread onnxruntime/test/providers/cpu/llm/rotary_embedding_op_test.cc Outdated

Comment thread onnxruntime/test/contrib_ops/rotary_embedding_op_test.cc Outdated

fix stale tests

d209b25

hariharans29 requested a review from tianleiwu April 8, 2026 23:11

tianleiwu reviewed Apr 9, 2026

View reviewed changes

Conversation

hariharans29 commented Apr 3, 2026

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

tianleiwu left a comment

Choose a reason for hiding this comment

Review Summary

Uh oh!

tianleiwu Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

tianleiwu Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tianleiwu Apr 9, 2026 •

edited

Loading