Skip to content

vpto fix trowexpand op for col major tile#828

Open
Likai-19 wants to merge 1 commit into
hw-native-sys:mainfrom
Likai-19:vpto_row_expand_fix
Open

vpto fix trowexpand op for col major tile#828
Likai-19 wants to merge 1 commit into
hw-native-sys:mainfrom
Likai-19:vpto_row_expand_fix

Conversation

@Likai-19

Copy link
Copy Markdown

问题总结

trowexpand 系列 op 的 TileLang 模板及 DSL lowering 层在处理 src1 为 col major [M, 1] 时存在两类缺陷:

  1. 模板层trowexpandmul/trowexpandsub/trowexpanddiv 三个模板仅实现了 row_major 路径,对 src1[row, :] 使用 vlds 对齐加载。当 src1 为 col major 时,tile 切片地址不满足 512B 对齐要求,触发 A5 error 340。

  2. lowering 层vldas/vldus 指令的 IR 生成路径中,将 tile 先转为 memref 再通过 subview + castptr 获取指针的方式存在结构性问题——vldas 只接受 !pto.ptr 类型,memref 中间表示引入了不必要的类型转换,且对 col major tile 的地址计算逻辑不正确。

修改方案

1. 模板层 — 新增 col major 非对齐访问路径

在三个模板中分别添加 col_major 分支。通过 pto.constexpr(src1.config.b_layout == pto.BLayout.COL_MAJOR) 在编译期选择路径:

  • col major 路径:使用 vldas + vldus 非对齐加载流水线替代 vlds

    align_src1 = pto.vldas(src1[row, :])        # 非对齐加载请求
    scalar_vec, _ = pto.vldus(src1[row, :], align_src1)  # 非对齐数据读取
    broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

    vldas/vldus 每行仅调用一次,broadcast 到全掩码后复用于该行所有列迭代。

  • row major 路径:保持原有 vlds 对齐访问逻辑不变。

此修改同时覆盖了 trowexpandmul(f16/f32)、trowexpandsub(f16/f32)、trowexpanddiv(f16/f32 及 high_precision 变体)所有精度和模式组合。

2. lowering 层 — tile 直接转 ptr 并修正地址计算

_AuthoringRenderer 中:

  • 新增 _materialize_tile_ptr() 方法:通过 pto.tile_buf_addr 将 tile 直接转换为 !pto.ptr<...> 类型,跳过 memref 中间表示。结果缓存在 _tile_ptr_cache 中。

  • 修正 vldasvldus 的 2D tile 地址计算:不再使用 _materialize_rank2_tile_subview + _materialize_copy_buffer_ptr 的 memref 路径,改为直接用 arith.muli + arith.addi 计算元素偏移(offset = row * shape[1] + col),再通过 pto.addptr 获取目标地址。此公式对 row major [M, N] 和 col major [M, 1] 均适用。

  • 两个指令的 lowering 逻辑保持一致:tile → ptr → 偏移计算 → addptr → vldas/vldus。

3. 涉及文件

文件 变更内容
lib/TileOps/trowexpanddiv_template.py 添加 col major 非对齐加载路径,覆盖 f32/f16 × default/high_precision 共 4 条路径
lib/TileOps/trowexpandmul_template.py 添加 col major 非对齐加载路径
lib/TileOps/trowexpandsub_template.py 添加 col major 非对齐加载路径
tilelang-dsl/python/tilelang_dsl/lowering.py 新增 _materialize_tile_ptr();修正 vldas/vldus 的 2D tile 地址计算逻辑

@reedhecre

reedhecre commented Jun 16, 2026

Copy link
Copy Markdown

Codex Review

该评论由 review 机器人自动更新。

  • PR: vpto fix trowexpand op for col major tile #828 vpto fix trowexpand op for col major tile
  • Author: Likai-19
  • Base/Head: main / vpto_row_expand_fix
  • Head SHA: 3eaf1e3aa356
  • Trigger: 检测到新的 open PR
  • Generated At: 2026-06-16T13:36:07Z
  • Status: failed at codex-review (exit=1)

Summary

Review failed at stage codex-review: exit=1

Findings

未生成结构化 findings,因为 review 过程提前失败。

Log Tail

===== STAGE clone @ 2026-06-16 21:35:29 =====
set -euo pipefail
rm -rf '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
git clone --branch 'main' --depth 50 'https://github.com/hw-native-sys/PTOAS.git' '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
cd '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
git fetch origin 'refs/pull/828/head:pr-828' --depth 50
git fetch origin 'main' --depth 50 || true
git checkout -f 'pr-828'
git rev-parse HEAD
git diff --stat 'origin/main...HEAD' || true
Cloning into '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'...
From https://github.com/hw-native-sys/PTOAS
 * [new ref]         refs/pull/828/head -> pr-828
From https://github.com/hw-native-sys/PTOAS
 * branch            main       -> FETCH_HEAD
Switched to branch 'pr-828'
3eaf1e3aa356d4bfa67b87a01750980f7865c084
 lib/TileOps/trowexpanddiv_template.py        | 154 ++++++++++++++++++---------
 lib/TileOps/trowexpandmul_template.py        |  45 +++++---
 lib/TileOps/trowexpandsub_template.py        |  45 +++++---
 tilelang-dsl/python/tilelang_dsl/lowering.py | 115 ++++++++++++++++----
 4 files changed, 265 insertions(+), 94 deletions(-)
===== END STAGE clone rc=0 @ 2026-06-16 21:35:33 =====

===== STAGE codex-review @ 2026-06-16 21:35:33 =====
set -euo pipefail
cd '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
'codex' exec -C '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo' -s read-only -c 'model_provider="codereview"' -c 'model="gpt-5.4"' -c 'model_reasoning_effort="xhigh"' --output-schema '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/review_schema.json' -o '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/codex_last_message.json' --color never - < '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/review_prompt.txt'
OpenAI Codex v0.115.0 (research preview)
--------
workdir: /tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo
model: gpt-5.4
provider: codereview
approval: never
sandbox: read-only
reasoning effort: xhigh
reasoning summaries: none
session id: 019ed0a4-fc36-70c0-a5f8-abbaaae1bcd9
--------
user
你现在在审查 GitHub PR。

仓库:hw-native-sys/PTOAS
PR:#828 vpto fix trowexpand op for col major tile
作者:Likai-19
base branch:origin/main
head branch:HEAD(当前已 checkout 到 PR head)

要求:
1. 只审查这个 PR 相对 origin/main 的改动,必要时可以看上下文文件。
2. 重点找真实的 correctness / regression / contract mismatch / CI / runtime / compatibility 问题。
3. 不要提纯风格建议,不要提低价值猜测。
4. 严格按优先级输出:
   - P1:高概率会导致错误结果、编译/运行失败、严重回归、发布阻断
   - P2:重要缺陷、行为回归、遗漏校验/测试、较大兼容性问题
   - P3:次要但明确可改的问题
5. 如果没有问题,summary 直接写:未检查到 PR #828 存在问题,并返回 findings=[]。
6. 如果有问题,summary 简洁概括,findings 里每条都要给出:
   - severity
   - title
   - body(说明为什么是问题,尽量具体)
   - file(尽量给相对路径)
   - line(能确定就填整数,否则 null)

建议先查看:
- git status --short
- git diff --stat origin/main...HEAD
- git diff --unified=80 origin/main...HEAD

最终输出必须严格匹配 JSON schema。

mcp startup: no servers
Reconnecting... 1/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: a736196a-f9e7-4526-b0b9-ffb4908e4737)
Reconnecting... 2/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: 51ad90d6-4ce9-4e28-9627-e6f59a0c065e)
Reconnecting... 3/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: d7670e41-1f08-48f6-ace0-2422572c43f0)
Reconnecting... 4/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: c702a25d-2336-4468-a893-47428705f4eb)
Reconnecting... 5/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: 2931326e-fd15-4bc2-baf4-1d267e34ede0)
ERROR: unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: d6152518-6bbc-42cd-8c91-07cdc6904aa7
Warning: no last agent message; wrote empty content to /tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/codex_last_message.json
===== END STAGE codex-review rc=1 @ 2026-06-16 21:36:07 =====

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for column-major layouts in trowexpanddiv, trowexpandmul, and trowexpandsub templates by utilizing an unaligned load pipeline (vldas + vldus) to prevent unaligned memory access errors. Additionally, the lowering logic in tilelang_dsl/lowering.py is updated to materialize tile pointers directly via pto.tile_buf_addr instead of using the broken memref-to-subview path. The code reviewer identified critical issues where pto.pset_b32(pto.PAT.ALL) was incorrectly hardcoded for 16-bit float operations in trowexpanddiv_f16 and for generic types in trowexpandmul and trowexpandsub, which would lead to compilation or masking errors. They suggested using pto.pset_b16 for 16-bit types and dynamically resolving the mask size based on the element type in generic templates.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

for row in range(0, valid_rows, 1):
align_src1 = pto.vldas(src1[row, :])
scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

In template_trowexpanddiv_f16, the element type is f16 (16-bit). Therefore, the mask used for broadcasting should be pto.pset_b16(pto.PAT.ALL) instead of pto.pset_b32(pto.PAT.ALL). Using pset_b32 on f16 data will lead to incorrect masking or compilation errors.

Suggested change
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
broadcasted = pto.vdup(scalar_vec, pto.pset_b16(pto.PAT.ALL))

for row in range(0, valid_rows, 1):
align_src1 = pto.vldas(src1[row, :])
scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

In template_trowexpanddiv_f16, the element type is f16 (16-bit). Therefore, the mask used for broadcasting should be pto.pset_b16(pto.PAT.ALL) instead of pto.pset_b32(pto.PAT.ALL). Using pset_b32 on f16 data will lead to incorrect masking or compilation errors.

Suggested change
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
broadcasted = pto.vdup(scalar_vec, pto.pset_b16(pto.PAT.ALL))

# vldas+vldus once per row, broadcast across all col iterations
align_src1 = pto.vldas(src1[row, :])
scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

Since template_trowexpandmul is generic and supports multiple data types (including f16, i16, i8), hardcoding pto.pset_b32(pto.PAT.ALL) will cause incorrect masking or compilation failures for non-32-bit types. We should dynamically select the mask based on the element size of dtype using pto.constexpr and pto.get_lanes(dtype).

Suggested change
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
if pto.constexpr(pto.get_lanes(dtype) == 8):
mask_all = pto.pset_b32(pto.PAT.ALL)
elif pto.constexpr(pto.get_lanes(dtype) == 16):
mask_all = pto.pset_b16(pto.PAT.ALL)
else:
mask_all = pto.pset_b8(pto.PAT.ALL)
broadcasted = pto.vdup(scalar_vec, mask_all)

# vldas+vldus once per row, broadcast across all col iterations
align_src1 = pto.vldas(src1[row, :])
scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

Since template_trowexpandsub is generic and supports multiple data types (including f16, i16, i8), hardcoding pto.pset_b32(pto.PAT.ALL) will cause incorrect masking or compilation failures for non-32-bit types. We should dynamically select the mask based on the element size of dtype using pto.constexpr and pto.get_lanes(dtype).

Suggested change
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
if pto.constexpr(pto.get_lanes(dtype) == 8):
mask_all = pto.pset_b32(pto.PAT.ALL)
elif pto.constexpr(pto.get_lanes(dtype) == 16):
mask_all = pto.pset_b16(pto.PAT.ALL)
else:
mask_all = pto.pset_b8(pto.PAT.ALL)
broadcasted = pto.vdup(scalar_vec, mask_all)

@Likai-19

Copy link
Copy Markdown
Author

Fixes #826

@Likai-19

Copy link
Copy Markdown
Author

/run a5

@reedhecre

Copy link
Copy Markdown

已接收 /run a5,A5 板测器会处理这条请求。

页面会自动刷新,可以直接看当前阶段、排队情况和最近结果。

@reedhecre

Copy link
Copy Markdown

A5 板测失败

  • 触发方式:manual
  • 源码提交:f12a092580fe
  • 结果汇总:OK 155 / FAIL 68 / SKIP 1
  • 日志:/root/ptoas-board-monitor-a5/logs/20260616_214112_manual_pr828.log
  • 手动指令:/run a5
  • 触发人:Likai-19
  • 触发评论:vpto fix trowexpand op for col major tile #828 (comment)
  • 失败阶段:board-validation / exit=1

失败用例

  • test_tmov_row_major_1x16_control_a5 (run, exit=2)
  • test_tmov_col_major_16x1_align_a5 (run, exit=2)
  • test_dynamic_valid_shape (run, exit=2)
  • test_barrier_sync (run, exit=2)
  • test_auto_sync_tail_hint (run, exit=2)
  • rmsnorm_incore_0 (run, exit=2)
  • rar_optimization_test (run, exit=2)
  • nested_loop_confliect (run, exit=2)
  • matmul (run, exit=2)
  • decode_projection_incore_0 (run, exit=2)
  • compensation_test (run, exit=2)
  • add_double_dynamic (run, exit=2)
  • rowexpandsub (run, exit=139)
  • rems (run, exit=2)
  • rem (run, exit=2)
  • rope_kv_cache (run, exit=2)
  • rmsnorm (run, exit=2)
  • qwen3_decode_incore_7 (run, exit=2)
  • qwen3_decode_incore_6 (run, exit=2)
  • qwen3_decode_incore_5 (run, exit=2)
  • qwen3_decode_incore_4 (run, exit=2)
  • qwen3_decode_incore_2 (run, exit=2)
  • qwen3_decode_incore_1 (run, exit=2)
  • qwen3_decode_incore_12 (run, exit=2)
  • qwen3_decode_incore_11 (run, exit=2)
  • qwen3_decode_incore_10 (run, exit=2)
  • post_rmsnorm (run, exit=2)
  • vector_example_dag_kernel_mul (run, exit=2)
  • vector_example_dag_kernel_add_scalar (run, exit=2)
  • vector_example_dag_kernel_add (run, exit=2)
  • paged_attention_example_kernel_softmax_prepare (run, exit=2)
  • paged_attention_example_kernel_qk_matmul (run, exit=2)
  • paged_attention_example_kernel_pv_matmul (run, exit=2)
  • paged_attention_example_kernel_online_update (run, exit=2)
  • paged_attention_example_kernel_init_inplace (run, exit=2)
  • orchestration_example_kernel_mul (run, exit=2)
  • orchestration_example_kernel_add_scalar (run, exit=2)
  • orchestration_example_kernel_add (run, exit=2)
  • prelu (run, exit=2)
  • plan_memory_reuse_sequential (run, exit=2)
  • plan_memory_peak_exact_capacity (run, exit=2)
  • plan_memory_peak_8_overlapping (run, exit=2)
  • plan_memory_no_reuse_overlap (run, exit=2)
  • plan_memory_nested_loops (run, exit=2)
  • plan_memory_loop_no_reuse_outer_live (run, exit=2)
  • plan_memory_loop_in_if (run, exit=2)
  • plan_memory_if_yield (run, exit=2)
  • plan_memory_if_in_loop (run, exit=2)
  • plan_memory_fragmentation_two_holes (run, exit=2)
  • plan_memory_fragmentation_hole_fit (run, exit=2)
  • plan_memory_for_iter_args_yield (run, exit=2)
  • plan_memory_bind_tile_alias_liveness (run, exit=2)
  • partition_view_verify_valid (run, exit=2)
  • partition_view_verify_rank_mismatch_valid (run, exit=2)
  • partition5d_dynamic_a5 (run, exit=2)
  • partition5d_a5 (run, exit=2)
  • tensor_view_layout_dn (run, exit=2)
  • fillpad (run, exit=139)
  • sparse_attn_test_incore_7 (run, exit=2)
  • decode_swa_test_incore_40 (run, exit=2)
  • decode_hca_test_incore_54 (run, exit=2)
  • decode_csa_test_incore_81 (run, exit=2)
  • attention_swa_test_incore_40 (run, exit=2)
  • attention_hca_test_incore_54 (run, exit=2)
  • attention_csa_test_refresh_incore_81 (run, exit=2)
  • tgather_root_binding (run, exit=139)
  • cmps (run, exit=2)
  • cmp (run, exit=2)

@reedhecre

Copy link
Copy Markdown

A5 板测失败详情:PR #828

test_tmov_row_major_1x16_control_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_tmov_row_major_1x16_control_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_tmov_row_major_1x16_control_a5.dir/build.make:98: test_tmov_row_major_1x16_control_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_tmov_row_major_1x16_control_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:52:45] ERROR: testcase failed (exit 2): test_tmov_row_major_1x16_control_a5
test_tmov_col_major_16x1_align_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_tmov_col_major_16x1_align_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_tmov_col_major_16x1_align_a5.dir/build.make:98: test_tmov_col_major_16x1_align_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_tmov_col_major_16x1_align_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:52:47] ERROR: testcase failed (exit 2): test_tmov_col_major_16x1_align_a5
test_dynamic_valid_shape

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_dynamic_valid_shape_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_dynamic_valid_shape.dir/build.make:98: test_dynamic_valid_shape] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_dynamic_valid_shape.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:40] ERROR: testcase failed (exit 2): test_dynamic_valid_shape
test_barrier_sync

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_barrier_sync_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_barrier_sync.dir/build.make:98: test_barrier_sync] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_barrier_sync.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:43] ERROR: testcase failed (exit 2): test_barrier_sync
test_auto_sync_tail_hint

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_auto_sync_tail_hint_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_auto_sync_tail_hint.dir/build.make:98: test_auto_sync_tail_hint] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_auto_sync_tail_hint.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:45] ERROR: testcase failed (exit 2): test_auto_sync_tail_hint
rmsnorm_incore_0

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librmsnorm_incore_0_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rmsnorm_incore_0.dir/build.make:98: rmsnorm_incore_0] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rmsnorm_incore_0.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:43] ERROR: testcase failed (exit 2): rmsnorm_incore_0
rar_optimization_test

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librar_optimization_test_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rar_optimization_test.dir/build.make:98: rar_optimization_test] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rar_optimization_test.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:46] ERROR: testcase failed (exit 2): rar_optimization_test
nested_loop_confliect

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libnested_loop_confliect_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/nested_loop_confliect.dir/build.make:98: nested_loop_confliect] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/nested_loop_confliect.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:48] ERROR: testcase failed (exit 2): nested_loop_confliect
matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libmatmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/matmul.dir/build.make:98: matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:51] ERROR: testcase failed (exit 2): matmul
decode_projection_incore_0

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_projection_incore_0_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_projection_incore_0.dir/build.make:98: decode_projection_incore_0] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_projection_incore_0.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:54] ERROR: testcase failed (exit 2): decode_projection_incore_0
compensation_test

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libcompensation_test_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/compensation_test.dir/build.make:98: compensation_test] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/compensation_test.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:56] ERROR: testcase failed (exit 2): compensation_test
add_double_dynamic

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libadd_double_dynamic_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/add_double_dynamic.dir/build.make:98: add_double_dynamic] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/add_double_dynamic.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:59] ERROR: testcase failed (exit 2): add_double_dynamic
rowexpandsub

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1245055 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:04:25] ERROR: testcase failed (exit 139): rowexpandsub
rems

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librems_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rems.dir/build.make:98: rems] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rems.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:30] ERROR: testcase failed (exit 2): rems
rem

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librem_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rem.dir/build.make:98: rem] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rem.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:32] ERROR: testcase failed (exit 2): rem
rope_kv_cache

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librope_kv_cache_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rope_kv_cache.dir/build.make:98: rope_kv_cache] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rope_kv_cache.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:55] ERROR: testcase failed (exit 2): rope_kv_cache
rmsnorm

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librmsnorm_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rmsnorm.dir/build.make:98: rmsnorm] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rmsnorm.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:58] ERROR: testcase failed (exit 2): rmsnorm
qwen3_decode_incore_7

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_7_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_7.dir/build.make:98: qwen3_decode_incore_7] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_7.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:01] ERROR: testcase failed (exit 2): qwen3_decode_incore_7
qwen3_decode_incore_6

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_6_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_6.dir/build.make:98: qwen3_decode_incore_6] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_6.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:04] ERROR: testcase failed (exit 2): qwen3_decode_incore_6
qwen3_decode_incore_5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_5.dir/build.make:98: qwen3_decode_incore_5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:06] ERROR: testcase failed (exit 2): qwen3_decode_incore_5
qwen3_decode_incore_4

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_4_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_4.dir/build.make:98: qwen3_decode_incore_4] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_4.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:09] ERROR: testcase failed (exit 2): qwen3_decode_incore_4
qwen3_decode_incore_2

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_2_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_2.dir/build.make:98: qwen3_decode_incore_2] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_2.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:12] ERROR: testcase failed (exit 2): qwen3_decode_incore_2
qwen3_decode_incore_1

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_1_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_1.dir/build.make:98: qwen3_decode_incore_1] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_1.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:14] ERROR: testcase failed (exit 2): qwen3_decode_incore_1
qwen3_decode_incore_12

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_12_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_12.dir/build.make:98: qwen3_decode_incore_12] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_12.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:17] ERROR: testcase failed (exit 2): qwen3_decode_incore_12
qwen3_decode_incore_11

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_11_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_11.dir/build.make:98: qwen3_decode_incore_11] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_11.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:19] ERROR: testcase failed (exit 2): qwen3_decode_incore_11
qwen3_decode_incore_10

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_10_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_10.dir/build.make:98: qwen3_decode_incore_10] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_10.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:22] ERROR: testcase failed (exit 2): qwen3_decode_incore_10
post_rmsnorm

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpost_rmsnorm_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/post_rmsnorm.dir/build.make:98: post_rmsnorm] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/post_rmsnorm.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:25] ERROR: testcase failed (exit 2): post_rmsnorm
vector_example_dag_kernel_mul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_mul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_mul.dir/build.make:98: vector_example_dag_kernel_mul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_mul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:17] ERROR: testcase failed (exit 2): vector_example_dag_kernel_mul
vector_example_dag_kernel_add_scalar

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_add_scalar_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_add_scalar.dir/build.make:98: vector_example_dag_kernel_add_scalar] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_add_scalar.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:20] ERROR: testcase failed (exit 2): vector_example_dag_kernel_add_scalar
vector_example_dag_kernel_add

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_add_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_add.dir/build.make:98: vector_example_dag_kernel_add] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_add.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:22] ERROR: testcase failed (exit 2): vector_example_dag_kernel_add
paged_attention_example_kernel_softmax_prepare

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_softmax_prepare_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_softmax_prepare.dir/build.make:98: paged_attention_example_kernel_softmax_prepare] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_softmax_prepare.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:25] ERROR: testcase failed (exit 2): paged_attention_example_kernel_softmax_prepare
paged_attention_example_kernel_qk_matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_qk_matmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_qk_matmul.dir/build.make:98: paged_attention_example_kernel_qk_matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_qk_matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:28] ERROR: testcase failed (exit 2): paged_attention_example_kernel_qk_matmul
paged_attention_example_kernel_pv_matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_pv_matmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_pv_matmul.dir/build.make:98: paged_attention_example_kernel_pv_matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_pv_matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:30] ERROR: testcase failed (exit 2): paged_attention_example_kernel_pv_matmul
paged_attention_example_kernel_online_update

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_online_update_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_online_update.dir/build.make:98: paged_attention_example_kernel_online_update] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_online_update.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:33] ERROR: testcase failed (exit 2): paged_attention_example_kernel_online_update
paged_attention_example_kernel_init_inplace

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_init_inplace_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_init_inplace.dir/build.make:98: paged_attention_example_kernel_init_inplace] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_init_inplace.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:35] ERROR: testcase failed (exit 2): paged_attention_example_kernel_init_inplace
orchestration_example_kernel_mul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_mul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_mul.dir/build.make:98: orchestration_example_kernel_mul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_mul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:38] ERROR: testcase failed (exit 2): orchestration_example_kernel_mul
orchestration_example_kernel_add_scalar

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_add_scalar_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_add_scalar.dir/build.make:98: orchestration_example_kernel_add_scalar] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_add_scalar.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:41] ERROR: testcase failed (exit 2): orchestration_example_kernel_add_scalar
orchestration_example_kernel_add

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_add_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_add.dir/build.make:98: orchestration_example_kernel_add] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_add.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:43] ERROR: testcase failed (exit 2): orchestration_example_kernel_add
prelu

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libprelu_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/prelu.dir/build.make:98: prelu] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/prelu.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:04] ERROR: testcase failed (exit 2): prelu
plan_memory_reuse_sequential

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_reuse_sequential_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_reuse_sequential.dir/build.make:98: plan_memory_reuse_sequential] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_reuse_sequential.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:07] ERROR: testcase failed (exit 2): plan_memory_reuse_sequential
plan_memory_peak_exact_capacity

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_peak_exact_capacity_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_peak_exact_capacity.dir/build.make:98: plan_memory_peak_exact_capacity] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_peak_exact_capacity.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:10] ERROR: testcase failed (exit 2): plan_memory_peak_exact_capacity
plan_memory_peak_8_overlapping

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_peak_8_overlapping_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_peak_8_overlapping.dir/build.make:98: plan_memory_peak_8_overlapping] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_peak_8_overlapping.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:12] ERROR: testcase failed (exit 2): plan_memory_peak_8_overlapping
plan_memory_no_reuse_overlap

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_no_reuse_overlap_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_no_reuse_overlap.dir/build.make:98: plan_memory_no_reuse_overlap] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_no_reuse_overlap.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:15] ERROR: testcase failed (exit 2): plan_memory_no_reuse_overlap
plan_memory_nested_loops

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_nested_loops_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_nested_loops.dir/build.make:98: plan_memory_nested_loops] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_nested_loops.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:17] ERROR: testcase failed (exit 2): plan_memory_nested_loops
plan_memory_loop_no_reuse_outer_live

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_loop_no_reuse_outer_live_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_loop_no_reuse_outer_live.dir/build.make:98: plan_memory_loop_no_reuse_outer_live] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_loop_no_reuse_outer_live.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:20] ERROR: testcase failed (exit 2): plan_memory_loop_no_reuse_outer_live
plan_memory_loop_in_if

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_loop_in_if_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_loop_in_if.dir/build.make:98: plan_memory_loop_in_if] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_loop_in_if.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:22] ERROR: testcase failed (exit 2): plan_memory_loop_in_if
plan_memory_if_yield

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_if_yield_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_if_yield.dir/build.make:98: plan_memory_if_yield] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_if_yield.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:25] ERROR: testcase failed (exit 2): plan_memory_if_yield
plan_memory_if_in_loop

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_if_in_loop_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_if_in_loop.dir/build.make:98: plan_memory_if_in_loop] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_if_in_loop.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:27] ERROR: testcase failed (exit 2): plan_memory_if_in_loop
plan_memory_fragmentation_two_holes

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_fragmentation_two_holes_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_fragmentation_two_holes.dir/build.make:98: plan_memory_fragmentation_two_holes] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_fragmentation_two_holes.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:30] ERROR: testcase failed (exit 2): plan_memory_fragmentation_two_holes
plan_memory_fragmentation_hole_fit

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_fragmentation_hole_fit_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_fragmentation_hole_fit.dir/build.make:98: plan_memory_fragmentation_hole_fit] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_fragmentation_hole_fit.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:33] ERROR: testcase failed (exit 2): plan_memory_fragmentation_hole_fit
plan_memory_for_iter_args_yield

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_for_iter_args_yield_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_for_iter_args_yield.dir/build.make:98: plan_memory_for_iter_args_yield] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_for_iter_args_yield.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:35] ERROR: testcase failed (exit 2): plan_memory_for_iter_args_yield
plan_memory_bind_tile_alias_liveness

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_bind_tile_alias_liveness_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_bind_tile_alias_liveness.dir/build.make:98: plan_memory_bind_tile_alias_liveness] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_bind_tile_alias_liveness.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:38] ERROR: testcase failed (exit 2): plan_memory_bind_tile_alias_liveness
partition_view_verify_valid

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition_view_verify_valid_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition_view_verify_valid.dir/build.make:98: partition_view_verify_valid] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition_view_verify_valid.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:27] ERROR: testcase failed (exit 2): partition_view_verify_valid
partition_view_verify_rank_mismatch_valid

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition_view_verify_rank_mismatch_valid_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition_view_verify_rank_mismatch_valid.dir/build.make:98: partition_view_verify_rank_mismatch_valid] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition_view_verify_rank_mismatch_valid.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:30] ERROR: testcase failed (exit 2): partition_view_verify_rank_mismatch_valid
partition5d_dynamic_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition5d_dynamic_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition5d_dynamic_a5.dir/build.make:98: partition5d_dynamic_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition5d_dynamic_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:32] ERROR: testcase failed (exit 2): partition5d_dynamic_a5
partition5d_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition5d_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition5d_a5.dir/build.make:98: partition5d_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition5d_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:35] ERROR: testcase failed (exit 2): partition5d_a5
tensor_view_layout_dn

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtensor_view_layout_dn_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/tensor_view_layout_dn.dir/build.make:98: tensor_view_layout_dn] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/tensor_view_layout_dn.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:16:27] ERROR: testcase failed (exit 2): tensor_view_layout_dn
fillpad

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1295964 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:18:20] ERROR: testcase failed (exit 139): fillpad
sparse_attn_test_incore_7

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libsparse_attn_test_incore_7_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/sparse_attn_test_incore_7.dir/build.make:98: sparse_attn_test_incore_7] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/sparse_attn_test_incore_7.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:20:59] ERROR: testcase failed (exit 2): sparse_attn_test_incore_7
decode_swa_test_incore_40

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_swa_test_incore_40_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_swa_test_incore_40.dir/build.make:98: decode_swa_test_incore_40] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_swa_test_incore_40.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:02] ERROR: testcase failed (exit 2): decode_swa_test_incore_40
decode_hca_test_incore_54

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_hca_test_incore_54_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_hca_test_incore_54.dir/build.make:98: decode_hca_test_incore_54] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_hca_test_incore_54.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:05] ERROR: testcase failed (exit 2): decode_hca_test_incore_54
decode_csa_test_incore_81

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_csa_test_incore_81_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_csa_test_incore_81.dir/build.make:98: decode_csa_test_incore_81] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_csa_test_incore_81.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:07] ERROR: testcase failed (exit 2): decode_csa_test_incore_81
attention_swa_test_incore_40

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_swa_test_incore_40_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_swa_test_incore_40.dir/build.make:98: attention_swa_test_incore_40] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_swa_test_incore_40.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:10] ERROR: testcase failed (exit 2): attention_swa_test_incore_40
attention_hca_test_incore_54

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_hca_test_incore_54_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_hca_test_incore_54.dir/build.make:98: attention_hca_test_incore_54] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_hca_test_incore_54.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:13] ERROR: testcase failed (exit 2): attention_hca_test_incore_54
attention_csa_test_refresh_incore_81

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_csa_test_refresh_incore_81_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_csa_test_refresh_incore_81.dir/build.make:98: attention_csa_test_refresh_incore_81] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_csa_test_refresh_incore_81.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:15] ERROR: testcase failed (exit 2): attention_csa_test_refresh_incore_81
tgather_root_binding

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1321027 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:24:10] ERROR: testcase failed (exit 139): tgather_root_binding
cmps

stage=run info=exit=2

[ERROR] Packed mask mismatch: golden_v2.bin vs v2.bin, idx=4 (golden=98, out=0)
[ERROR] compare failed
[2026-06-16 22:29:10] ERROR: testcase failed (exit 2): cmps
cmp

stage=run info=exit=2

[ERROR] Packed mask mismatch: golden_v3.bin vs v3.bin, idx=4 (golden=49, out=0)
[ERROR] compare failed
[2026-06-16 22:29:20] ERROR: testcase failed (exit 2): cmp

constraints=[_constraint_trowexpanddiv_row_major],
)
def template_trowexpanddiv_f32(src0: pto.Tile, src1: pto.Tile, dst: pto.Tile):
"""Template for pto.trowexpanddiv with f32 dtype and optional high-precision mode."""

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

请补一个st用例,看护这个场景

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants