vpto fix trowexpand op for col major tile by Likai-19 · Pull Request #828 · hw-native-sys/PTOAS

Likai-19 · 2026-06-16T13:34:50Z

问题总结

trowexpand 系列 op 的 TileLang 模板及 DSL lowering 层在处理 src1 为 col major [M, 1] 时存在两类缺陷：

模板层：trowexpandmul/trowexpandsub/trowexpanddiv 三个模板仅实现了 row_major 路径，对 src1[row, :] 使用 vlds 对齐加载。当 src1 为 col major 时，tile 切片地址不满足 512B 对齐要求，触发 A5 error 340。
lowering 层：vldas/vldus 指令的 IR 生成路径中，将 tile 先转为 memref 再通过 subview + castptr 获取指针的方式存在结构性问题——vldas 只接受 !pto.ptr 类型，memref 中间表示引入了不必要的类型转换，且对 col major tile 的地址计算逻辑不正确。

修改方案

1. 模板层 — 新增 col major 非对齐访问路径

在三个模板中分别添加 col_major 分支。通过 pto.constexpr(src1.config.b_layout == pto.BLayout.COL_MAJOR) 在编译期选择路径：

col major 路径：使用 vldas + vldus 非对齐加载流水线替代 vlds：

align_src1 = pto.vldas(src1[row, :])        # 非对齐加载请求
scalar_vec, _ = pto.vldus(src1[row, :], align_src1)  # 非对齐数据读取
broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

vldas/vldus 每行仅调用一次，broadcast 到全掩码后复用于该行所有列迭代。

row major 路径：保持原有 vlds 对齐访问逻辑不变。

此修改同时覆盖了 trowexpandmul（f16/f32）、trowexpandsub（f16/f32）、trowexpanddiv（f16/f32 及 high_precision 变体）所有精度和模式组合。

2. lowering 层 — tile 直接转 ptr 并修正地址计算

在 _AuthoringRenderer 中：

新增 _materialize_tile_ptr() 方法：通过 pto.tile_buf_addr 将 tile 直接转换为 !pto.ptr<...> 类型，跳过 memref 中间表示。结果缓存在 _tile_ptr_cache 中。
修正 vldas 和 vldus 的 2D tile 地址计算：不再使用 _materialize_rank2_tile_subview + _materialize_copy_buffer_ptr 的 memref 路径，改为直接用 arith.muli + arith.addi 计算元素偏移（offset = row * shape[1] + col），再通过 pto.addptr 获取目标地址。此公式对 row major [M, N] 和 col major [M, 1] 均适用。
两个指令的 lowering 逻辑保持一致：tile → ptr → 偏移计算 → addptr → vldas/vldus。

3. 涉及文件

文件	变更内容
`lib/TileOps/trowexpanddiv_template.py`	添加 col major 非对齐加载路径，覆盖 f32/f16 × default/high_precision 共 4 条路径
`lib/TileOps/trowexpandmul_template.py`	添加 col major 非对齐加载路径
`lib/TileOps/trowexpandsub_template.py`	添加 col major 非对齐加载路径
`tilelang-dsl/python/tilelang_dsl/lowering.py`	新增 `_materialize_tile_ptr()`；修正 `vldas`/`vldus` 的 2D tile 地址计算逻辑

reedhecre · 2026-06-16T13:35:28Z

Codex Review

该评论由 review 机器人自动更新。

PR: vpto fix trowexpand op for col major tile #828 vpto fix trowexpand op for col major tile
Author: Likai-19
Base/Head: main / vpto_row_expand_fix
Head SHA: 3eaf1e3aa356
Trigger: 检测到新的 open PR
Generated At: 2026-06-16T13:36:07Z
Status: failed at codex-review (exit=1)

Summary

Review failed at stage codex-review: exit=1

Findings

未生成结构化 findings，因为 review 过程提前失败。

Log Tail

===== STAGE clone @ 2026-06-16 21:35:29 =====
set -euo pipefail
rm -rf '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
git clone --branch 'main' --depth 50 'https://github.com/hw-native-sys/PTOAS.git' '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
cd '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
git fetch origin 'refs/pull/828/head:pr-828' --depth 50
git fetch origin 'main' --depth 50 || true
git checkout -f 'pr-828'
git rev-parse HEAD
git diff --stat 'origin/main...HEAD' || true
Cloning into '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'...
From https://github.com/hw-native-sys/PTOAS
 * [new ref]         refs/pull/828/head -> pr-828
From https://github.com/hw-native-sys/PTOAS
 * branch            main       -> FETCH_HEAD
Switched to branch 'pr-828'
3eaf1e3aa356d4bfa67b87a01750980f7865c084
 lib/TileOps/trowexpanddiv_template.py        | 154 ++++++++++++++++++---------
 lib/TileOps/trowexpandmul_template.py        |  45 +++++---
 lib/TileOps/trowexpandsub_template.py        |  45 +++++---
 tilelang-dsl/python/tilelang_dsl/lowering.py | 115 ++++++++++++++++----
 4 files changed, 265 insertions(+), 94 deletions(-)
===== END STAGE clone rc=0 @ 2026-06-16 21:35:33 =====

===== STAGE codex-review @ 2026-06-16 21:35:33 =====
set -euo pipefail
cd '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo'
'codex' exec -C '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo' -s read-only -c 'model_provider="codereview"' -c 'model="gpt-5.4"' -c 'model_reasoning_effort="xhigh"' --output-schema '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/review_schema.json' -o '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/codex_last_message.json' --color never - < '/tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/review_prompt.txt'
OpenAI Codex v0.115.0 (research preview)
--------
workdir: /tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/repo
model: gpt-5.4
provider: codereview
approval: never
sandbox: read-only
reasoning effort: xhigh
reasoning summaries: none
session id: 019ed0a4-fc36-70c0-a5f8-abbaaae1bcd9
--------
user
你现在在审查 GitHub PR。

仓库：hw-native-sys/PTOAS
PR：#828 vpto fix trowexpand op for col major tile
作者：Likai-19
base branch：origin/main
head branch：HEAD（当前已 checkout 到 PR head）

要求：
1. 只审查这个 PR 相对 origin/main 的改动，必要时可以看上下文文件。
2. 重点找真实的 correctness / regression / contract mismatch / CI / runtime / compatibility 问题。
3. 不要提纯风格建议，不要提低价值猜测。
4. 严格按优先级输出：
   - P1：高概率会导致错误结果、编译/运行失败、严重回归、发布阻断
   - P2：重要缺陷、行为回归、遗漏校验/测试、较大兼容性问题
   - P3：次要但明确可改的问题
5. 如果没有问题，summary 直接写：未检查到 PR #828 存在问题，并返回 findings=[]。
6. 如果有问题，summary 简洁概括，findings 里每条都要给出：
   - severity
   - title
   - body（说明为什么是问题，尽量具体）
   - file（尽量给相对路径）
   - line（能确定就填整数，否则 null）

建议先查看：
- git status --short
- git diff --stat origin/main...HEAD
- git diff --unified=80 origin/main...HEAD

最终输出必须严格匹配 JSON schema。

mcp startup: no servers
Reconnecting... 1/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: a736196a-f9e7-4526-b0b9-ffb4908e4737)
Reconnecting... 2/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: 51ad90d6-4ce9-4e28-9627-e6f59a0c065e)
Reconnecting... 3/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: d7670e41-1f08-48f6-ace0-2422572c43f0)
Reconnecting... 4/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: c702a25d-2336-4468-a893-47428705f4eb)
Reconnecting... 5/5 (unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: 2931326e-fd15-4bc2-baf4-1d267e34ede0)
ERROR: unexpected status 503 Service Unavailable: Service temporarily unavailable, url: https://codex.0u0o.com/responses, request id: d6152518-6bbc-42cd-8c91-07cdc6904aa7
Warning: no last agent message; wrote empty content to /tmp/ptoas-pr-review-monitor/runs/20260616_213527_pr828/codex_last_message.json
===== END STAGE codex-review rc=1 @ 2026-06-16 21:36:07 =====

gemini-code-assist

Code Review

This pull request introduces support for column-major layouts in trowexpanddiv, trowexpandmul, and trowexpandsub templates by utilizing an unaligned load pipeline (vldas + vldus) to prevent unaligned memory access errors. Additionally, the lowering logic in tilelang_dsl/lowering.py is updated to materialize tile pointers directly via pto.tile_buf_addr instead of using the broken memref-to-subview path. The code reviewer identified critical issues where pto.pset_b32(pto.PAT.ALL) was incorrectly hardcoded for 16-bit float operations in trowexpanddiv_f16 and for generic types in trowexpandmul and trowexpandsub, which would lead to compilation or masking errors. They suggested using pto.pset_b16 for 16-bit types and dynamically resolving the mask size based on the element type in generic templates.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-16T13:36:48Z

+            for row in range(0, valid_rows, 1):
+                align_src1 = pto.vldas(src1[row, :])
+                scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
+                broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))


In template_trowexpanddiv_f16, the element type is f16 (16-bit). Therefore, the mask used for broadcasting should be pto.pset_b16(pto.PAT.ALL) instead of pto.pset_b32(pto.PAT.ALL). Using pset_b32 on f16 data will lead to incorrect masking or compilation errors.

Suggested change

broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

broadcasted = pto.vdup(scalar_vec, pto.pset_b16(pto.PAT.ALL))

gemini-code-assist · 2026-06-16T13:36:48Z

+            for row in range(0, valid_rows, 1):
+                align_src1 = pto.vldas(src1[row, :])
+                scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
+                broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))


In template_trowexpanddiv_f16, the element type is f16 (16-bit). Therefore, the mask used for broadcasting should be pto.pset_b16(pto.PAT.ALL) instead of pto.pset_b32(pto.PAT.ALL). Using pset_b32 on f16 data will lead to incorrect masking or compilation errors.

Suggested change

broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

broadcasted = pto.vdup(scalar_vec, pto.pset_b16(pto.PAT.ALL))

gemini-code-assist · 2026-06-16T13:36:48Z

+            # vldas+vldus once per row, broadcast across all col iterations
+            align_src1 = pto.vldas(src1[row, :])
+            scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
+            broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))


Since template_trowexpandmul is generic and supports multiple data types (including f16, i16, i8), hardcoding pto.pset_b32(pto.PAT.ALL) will cause incorrect masking or compilation failures for non-32-bit types. We should dynamically select the mask based on the element size of dtype using pto.constexpr and pto.get_lanes(dtype).

Suggested change

broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

if pto.constexpr(pto.get_lanes(dtype) == 8):

mask_all = pto.pset_b32(pto.PAT.ALL)

elif pto.constexpr(pto.get_lanes(dtype) == 16):

mask_all = pto.pset_b16(pto.PAT.ALL)

else:

mask_all = pto.pset_b8(pto.PAT.ALL)

broadcasted = pto.vdup(scalar_vec, mask_all)

gemini-code-assist · 2026-06-16T13:36:48Z

+            # vldas+vldus once per row, broadcast across all col iterations
+            align_src1 = pto.vldas(src1[row, :])
+            scalar_vec, _ = pto.vldus(src1[row, :], align_src1)
+            broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))


Since template_trowexpandsub is generic and supports multiple data types (including f16, i16, i8), hardcoding pto.pset_b32(pto.PAT.ALL) will cause incorrect masking or compilation failures for non-32-bit types. We should dynamically select the mask based on the element size of dtype using pto.constexpr and pto.get_lanes(dtype).

Suggested change

broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))

if pto.constexpr(pto.get_lanes(dtype) == 8):

mask_all = pto.pset_b32(pto.PAT.ALL)

elif pto.constexpr(pto.get_lanes(dtype) == 16):

mask_all = pto.pset_b16(pto.PAT.ALL)

else:

mask_all = pto.pset_b8(pto.PAT.ALL)

broadcasted = pto.vdup(scalar_vec, mask_all)

Likai-19 · 2026-06-16T13:39:00Z

Fixes #826

Likai-19 · 2026-06-16T13:40:03Z

/run a5

reedhecre · 2026-06-16T13:40:49Z

已接收 /run a5，A5 板测器会处理这条请求。

进度页：http://154.9.227.233/ptoas-board-dashboard/#board-a5
当前状态：板测器空闲，这条请求会在本轮轮询启动。

页面会自动刷新，可以直接看当前阶段、排队情况和最近结果。

reedhecre · 2026-06-16T14:32:09Z

A5 板测失败

触发方式：manual
源码提交：f12a092580fe
结果汇总：OK 155 / FAIL 68 / SKIP 1
日志：/root/ptoas-board-monitor-a5/logs/20260616_214112_manual_pr828.log
手动指令：/run a5
触发人：Likai-19
触发评论：vpto fix trowexpand op for col major tile #828 (comment)
失败阶段：board-validation / exit=1

失败用例

test_tmov_row_major_1x16_control_a5 (run, exit=2)
test_tmov_col_major_16x1_align_a5 (run, exit=2)
test_dynamic_valid_shape (run, exit=2)
test_barrier_sync (run, exit=2)
test_auto_sync_tail_hint (run, exit=2)
rmsnorm_incore_0 (run, exit=2)
rar_optimization_test (run, exit=2)
nested_loop_confliect (run, exit=2)
matmul (run, exit=2)
decode_projection_incore_0 (run, exit=2)
compensation_test (run, exit=2)
add_double_dynamic (run, exit=2)
rowexpandsub (run, exit=139)
rems (run, exit=2)
rem (run, exit=2)
rope_kv_cache (run, exit=2)
rmsnorm (run, exit=2)
qwen3_decode_incore_7 (run, exit=2)
qwen3_decode_incore_6 (run, exit=2)
qwen3_decode_incore_5 (run, exit=2)
qwen3_decode_incore_4 (run, exit=2)
qwen3_decode_incore_2 (run, exit=2)
qwen3_decode_incore_1 (run, exit=2)
qwen3_decode_incore_12 (run, exit=2)
qwen3_decode_incore_11 (run, exit=2)
qwen3_decode_incore_10 (run, exit=2)
post_rmsnorm (run, exit=2)
vector_example_dag_kernel_mul (run, exit=2)
vector_example_dag_kernel_add_scalar (run, exit=2)
vector_example_dag_kernel_add (run, exit=2)
paged_attention_example_kernel_softmax_prepare (run, exit=2)
paged_attention_example_kernel_qk_matmul (run, exit=2)
paged_attention_example_kernel_pv_matmul (run, exit=2)
paged_attention_example_kernel_online_update (run, exit=2)
paged_attention_example_kernel_init_inplace (run, exit=2)
orchestration_example_kernel_mul (run, exit=2)
orchestration_example_kernel_add_scalar (run, exit=2)
orchestration_example_kernel_add (run, exit=2)
prelu (run, exit=2)
plan_memory_reuse_sequential (run, exit=2)
plan_memory_peak_exact_capacity (run, exit=2)
plan_memory_peak_8_overlapping (run, exit=2)
plan_memory_no_reuse_overlap (run, exit=2)
plan_memory_nested_loops (run, exit=2)
plan_memory_loop_no_reuse_outer_live (run, exit=2)
plan_memory_loop_in_if (run, exit=2)
plan_memory_if_yield (run, exit=2)
plan_memory_if_in_loop (run, exit=2)
plan_memory_fragmentation_two_holes (run, exit=2)
plan_memory_fragmentation_hole_fit (run, exit=2)
plan_memory_for_iter_args_yield (run, exit=2)
plan_memory_bind_tile_alias_liveness (run, exit=2)
partition_view_verify_valid (run, exit=2)
partition_view_verify_rank_mismatch_valid (run, exit=2)
partition5d_dynamic_a5 (run, exit=2)
partition5d_a5 (run, exit=2)
tensor_view_layout_dn (run, exit=2)
fillpad (run, exit=139)
sparse_attn_test_incore_7 (run, exit=2)
decode_swa_test_incore_40 (run, exit=2)
decode_hca_test_incore_54 (run, exit=2)
decode_csa_test_incore_81 (run, exit=2)
attention_swa_test_incore_40 (run, exit=2)
attention_hca_test_incore_54 (run, exit=2)
attention_csa_test_refresh_incore_81 (run, exit=2)
tgather_root_binding (run, exit=139)
cmps (run, exit=2)
cmp (run, exit=2)

reedhecre · 2026-06-16T14:32:12Z

A5 板测失败详情：PR #828

test_tmov_row_major_1x16_control_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_tmov_row_major_1x16_control_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_tmov_row_major_1x16_control_a5.dir/build.make:98: test_tmov_row_major_1x16_control_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_tmov_row_major_1x16_control_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:52:45] ERROR: testcase failed (exit 2): test_tmov_row_major_1x16_control_a5

test_tmov_col_major_16x1_align_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_tmov_col_major_16x1_align_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_tmov_col_major_16x1_align_a5.dir/build.make:98: test_tmov_col_major_16x1_align_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_tmov_col_major_16x1_align_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:52:47] ERROR: testcase failed (exit 2): test_tmov_col_major_16x1_align_a5

test_dynamic_valid_shape

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_dynamic_valid_shape_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_dynamic_valid_shape.dir/build.make:98: test_dynamic_valid_shape] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_dynamic_valid_shape.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:40] ERROR: testcase failed (exit 2): test_dynamic_valid_shape

test_barrier_sync

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_barrier_sync_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_barrier_sync.dir/build.make:98: test_barrier_sync] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_barrier_sync.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:43] ERROR: testcase failed (exit 2): test_barrier_sync

test_auto_sync_tail_hint

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtest_auto_sync_tail_hint_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/test_auto_sync_tail_hint.dir/build.make:98: test_auto_sync_tail_hint] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/test_auto_sync_tail_hint.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:56:45] ERROR: testcase failed (exit 2): test_auto_sync_tail_hint

rmsnorm_incore_0

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librmsnorm_incore_0_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rmsnorm_incore_0.dir/build.make:98: rmsnorm_incore_0] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rmsnorm_incore_0.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:43] ERROR: testcase failed (exit 2): rmsnorm_incore_0

rar_optimization_test

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librar_optimization_test_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rar_optimization_test.dir/build.make:98: rar_optimization_test] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rar_optimization_test.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:46] ERROR: testcase failed (exit 2): rar_optimization_test

nested_loop_confliect

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libnested_loop_confliect_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/nested_loop_confliect.dir/build.make:98: nested_loop_confliect] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/nested_loop_confliect.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:48] ERROR: testcase failed (exit 2): nested_loop_confliect

matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libmatmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/matmul.dir/build.make:98: matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:51] ERROR: testcase failed (exit 2): matmul

decode_projection_incore_0

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_projection_incore_0_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_projection_incore_0.dir/build.make:98: decode_projection_incore_0] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_projection_incore_0.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:54] ERROR: testcase failed (exit 2): decode_projection_incore_0

compensation_test

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libcompensation_test_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/compensation_test.dir/build.make:98: compensation_test] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/compensation_test.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:56] ERROR: testcase failed (exit 2): compensation_test

add_double_dynamic

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libadd_double_dynamic_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/add_double_dynamic.dir/build.make:98: add_double_dynamic] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/add_double_dynamic.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 21:57:59] ERROR: testcase failed (exit 2): add_double_dynamic

rowexpandsub

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1245055 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:04:25] ERROR: testcase failed (exit 139): rowexpandsub

rems

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librems_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rems.dir/build.make:98: rems] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rems.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:30] ERROR: testcase failed (exit 2): rems

rem

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librem_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rem.dir/build.make:98: rem] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rem.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:32] ERROR: testcase failed (exit 2): rem

rope_kv_cache

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librope_kv_cache_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rope_kv_cache.dir/build.make:98: rope_kv_cache] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rope_kv_cache.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:55] ERROR: testcase failed (exit 2): rope_kv_cache

rmsnorm

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by librmsnorm_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/rmsnorm.dir/build.make:98: rmsnorm] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/rmsnorm.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:07:58] ERROR: testcase failed (exit 2): rmsnorm

qwen3_decode_incore_7

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_7_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_7.dir/build.make:98: qwen3_decode_incore_7] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_7.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:01] ERROR: testcase failed (exit 2): qwen3_decode_incore_7

qwen3_decode_incore_6

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_6_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_6.dir/build.make:98: qwen3_decode_incore_6] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_6.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:04] ERROR: testcase failed (exit 2): qwen3_decode_incore_6

qwen3_decode_incore_5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_5.dir/build.make:98: qwen3_decode_incore_5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:06] ERROR: testcase failed (exit 2): qwen3_decode_incore_5

qwen3_decode_incore_4

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_4_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_4.dir/build.make:98: qwen3_decode_incore_4] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_4.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:09] ERROR: testcase failed (exit 2): qwen3_decode_incore_4

qwen3_decode_incore_2

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_2_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_2.dir/build.make:98: qwen3_decode_incore_2] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_2.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:12] ERROR: testcase failed (exit 2): qwen3_decode_incore_2

qwen3_decode_incore_1

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_1_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_1.dir/build.make:98: qwen3_decode_incore_1] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_1.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:14] ERROR: testcase failed (exit 2): qwen3_decode_incore_1

qwen3_decode_incore_12

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_12_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_12.dir/build.make:98: qwen3_decode_incore_12] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_12.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:17] ERROR: testcase failed (exit 2): qwen3_decode_incore_12

qwen3_decode_incore_11

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_11_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_11.dir/build.make:98: qwen3_decode_incore_11] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_11.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:19] ERROR: testcase failed (exit 2): qwen3_decode_incore_11

qwen3_decode_incore_10

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libqwen3_decode_incore_10_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/qwen3_decode_incore_10.dir/build.make:98: qwen3_decode_incore_10] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/qwen3_decode_incore_10.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:22] ERROR: testcase failed (exit 2): qwen3_decode_incore_10

post_rmsnorm

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpost_rmsnorm_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/post_rmsnorm.dir/build.make:98: post_rmsnorm] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/post_rmsnorm.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:08:25] ERROR: testcase failed (exit 2): post_rmsnorm

vector_example_dag_kernel_mul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_mul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_mul.dir/build.make:98: vector_example_dag_kernel_mul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_mul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:17] ERROR: testcase failed (exit 2): vector_example_dag_kernel_mul

vector_example_dag_kernel_add_scalar

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_add_scalar_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_add_scalar.dir/build.make:98: vector_example_dag_kernel_add_scalar] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_add_scalar.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:20] ERROR: testcase failed (exit 2): vector_example_dag_kernel_add_scalar

vector_example_dag_kernel_add

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libvector_example_dag_kernel_add_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/vector_example_dag_kernel_add.dir/build.make:98: vector_example_dag_kernel_add] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/vector_example_dag_kernel_add.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:22] ERROR: testcase failed (exit 2): vector_example_dag_kernel_add

paged_attention_example_kernel_softmax_prepare

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_softmax_prepare_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_softmax_prepare.dir/build.make:98: paged_attention_example_kernel_softmax_prepare] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_softmax_prepare.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:25] ERROR: testcase failed (exit 2): paged_attention_example_kernel_softmax_prepare

paged_attention_example_kernel_qk_matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_qk_matmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_qk_matmul.dir/build.make:98: paged_attention_example_kernel_qk_matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_qk_matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:28] ERROR: testcase failed (exit 2): paged_attention_example_kernel_qk_matmul

paged_attention_example_kernel_pv_matmul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_pv_matmul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_pv_matmul.dir/build.make:98: paged_attention_example_kernel_pv_matmul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_pv_matmul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:30] ERROR: testcase failed (exit 2): paged_attention_example_kernel_pv_matmul

paged_attention_example_kernel_online_update

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_online_update_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_online_update.dir/build.make:98: paged_attention_example_kernel_online_update] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_online_update.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:33] ERROR: testcase failed (exit 2): paged_attention_example_kernel_online_update

paged_attention_example_kernel_init_inplace

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpaged_attention_example_kernel_init_inplace_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/paged_attention_example_kernel_init_inplace.dir/build.make:98: paged_attention_example_kernel_init_inplace] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/paged_attention_example_kernel_init_inplace.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:35] ERROR: testcase failed (exit 2): paged_attention_example_kernel_init_inplace

orchestration_example_kernel_mul

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_mul_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_mul.dir/build.make:98: orchestration_example_kernel_mul] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_mul.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:38] ERROR: testcase failed (exit 2): orchestration_example_kernel_mul

orchestration_example_kernel_add_scalar

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_add_scalar_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_add_scalar.dir/build.make:98: orchestration_example_kernel_add_scalar] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_add_scalar.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:41] ERROR: testcase failed (exit 2): orchestration_example_kernel_add_scalar

orchestration_example_kernel_add

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by liborchestration_example_kernel_add_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/orchestration_example_kernel_add.dir/build.make:98: orchestration_example_kernel_add] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/orchestration_example_kernel_add.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:09:43] ERROR: testcase failed (exit 2): orchestration_example_kernel_add

prelu

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libprelu_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/prelu.dir/build.make:98: prelu] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/prelu.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:04] ERROR: testcase failed (exit 2): prelu

plan_memory_reuse_sequential

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_reuse_sequential_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_reuse_sequential.dir/build.make:98: plan_memory_reuse_sequential] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_reuse_sequential.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:07] ERROR: testcase failed (exit 2): plan_memory_reuse_sequential

plan_memory_peak_exact_capacity

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_peak_exact_capacity_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_peak_exact_capacity.dir/build.make:98: plan_memory_peak_exact_capacity] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_peak_exact_capacity.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:10] ERROR: testcase failed (exit 2): plan_memory_peak_exact_capacity

plan_memory_peak_8_overlapping

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_peak_8_overlapping_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_peak_8_overlapping.dir/build.make:98: plan_memory_peak_8_overlapping] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_peak_8_overlapping.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:12] ERROR: testcase failed (exit 2): plan_memory_peak_8_overlapping

plan_memory_no_reuse_overlap

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_no_reuse_overlap_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_no_reuse_overlap.dir/build.make:98: plan_memory_no_reuse_overlap] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_no_reuse_overlap.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:15] ERROR: testcase failed (exit 2): plan_memory_no_reuse_overlap

plan_memory_nested_loops

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_nested_loops_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_nested_loops.dir/build.make:98: plan_memory_nested_loops] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_nested_loops.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:17] ERROR: testcase failed (exit 2): plan_memory_nested_loops

plan_memory_loop_no_reuse_outer_live

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_loop_no_reuse_outer_live_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_loop_no_reuse_outer_live.dir/build.make:98: plan_memory_loop_no_reuse_outer_live] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_loop_no_reuse_outer_live.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:20] ERROR: testcase failed (exit 2): plan_memory_loop_no_reuse_outer_live

plan_memory_loop_in_if

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_loop_in_if_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_loop_in_if.dir/build.make:98: plan_memory_loop_in_if] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_loop_in_if.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:22] ERROR: testcase failed (exit 2): plan_memory_loop_in_if

plan_memory_if_yield

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_if_yield_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_if_yield.dir/build.make:98: plan_memory_if_yield] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_if_yield.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:25] ERROR: testcase failed (exit 2): plan_memory_if_yield

plan_memory_if_in_loop

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_if_in_loop_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_if_in_loop.dir/build.make:98: plan_memory_if_in_loop] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_if_in_loop.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:27] ERROR: testcase failed (exit 2): plan_memory_if_in_loop

plan_memory_fragmentation_two_holes

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_fragmentation_two_holes_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_fragmentation_two_holes.dir/build.make:98: plan_memory_fragmentation_two_holes] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_fragmentation_two_holes.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:30] ERROR: testcase failed (exit 2): plan_memory_fragmentation_two_holes

plan_memory_fragmentation_hole_fit

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_fragmentation_hole_fit_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_fragmentation_hole_fit.dir/build.make:98: plan_memory_fragmentation_hole_fit] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_fragmentation_hole_fit.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:33] ERROR: testcase failed (exit 2): plan_memory_fragmentation_hole_fit

plan_memory_for_iter_args_yield

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_for_iter_args_yield_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_for_iter_args_yield.dir/build.make:98: plan_memory_for_iter_args_yield] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_for_iter_args_yield.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:35] ERROR: testcase failed (exit 2): plan_memory_for_iter_args_yield

plan_memory_bind_tile_alias_liveness

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libplan_memory_bind_tile_alias_liveness_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/plan_memory_bind_tile_alias_liveness.dir/build.make:98: plan_memory_bind_tile_alias_liveness] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/plan_memory_bind_tile_alias_liveness.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:10:38] ERROR: testcase failed (exit 2): plan_memory_bind_tile_alias_liveness

partition_view_verify_valid

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition_view_verify_valid_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition_view_verify_valid.dir/build.make:98: partition_view_verify_valid] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition_view_verify_valid.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:27] ERROR: testcase failed (exit 2): partition_view_verify_valid

partition_view_verify_rank_mismatch_valid

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition_view_verify_rank_mismatch_valid_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition_view_verify_rank_mismatch_valid.dir/build.make:98: partition_view_verify_rank_mismatch_valid] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition_view_verify_rank_mismatch_valid.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:30] ERROR: testcase failed (exit 2): partition_view_verify_rank_mismatch_valid

partition5d_dynamic_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition5d_dynamic_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition5d_dynamic_a5.dir/build.make:98: partition5d_dynamic_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition5d_dynamic_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:32] ERROR: testcase failed (exit 2): partition5d_dynamic_a5

partition5d_a5

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libpartition5d_a5_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/partition5d_a5.dir/build.make:98: partition5d_a5] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/partition5d_a5.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:11:35] ERROR: testcase failed (exit 2): partition5d_a5

tensor_view_layout_dn

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libtensor_view_layout_dn_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/tensor_view_layout_dn.dir/build.make:98: tensor_view_layout_dn] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/tensor_view_layout_dn.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:16:27] ERROR: testcase failed (exit 2): tensor_view_layout_dn

fillpad

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1295964 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:18:20] ERROR: testcase failed (exit 139): fillpad

sparse_attn_test_incore_7

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libsparse_attn_test_incore_7_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/sparse_attn_test_incore_7.dir/build.make:98: sparse_attn_test_incore_7] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/sparse_attn_test_incore_7.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:20:59] ERROR: testcase failed (exit 2): sparse_attn_test_incore_7

decode_swa_test_incore_40

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_swa_test_incore_40_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_swa_test_incore_40.dir/build.make:98: decode_swa_test_incore_40] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_swa_test_incore_40.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:02] ERROR: testcase failed (exit 2): decode_swa_test_incore_40

decode_hca_test_incore_54

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_hca_test_incore_54_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_hca_test_incore_54.dir/build.make:98: decode_hca_test_incore_54] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_hca_test_incore_54.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:05] ERROR: testcase failed (exit 2): decode_hca_test_incore_54

decode_csa_test_incore_81

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libdecode_csa_test_incore_81_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/decode_csa_test_incore_81.dir/build.make:98: decode_csa_test_incore_81] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/decode_csa_test_incore_81.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:07] ERROR: testcase failed (exit 2): decode_csa_test_incore_81

attention_swa_test_incore_40

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_swa_test_incore_40_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_swa_test_incore_40.dir/build.make:98: attention_swa_test_incore_40] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_swa_test_incore_40.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:10] ERROR: testcase failed (exit 2): attention_swa_test_incore_40

attention_hca_test_incore_54

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_hca_test_incore_54_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_hca_test_incore_54.dir/build.make:98: attention_hca_test_incore_54] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_hca_test_incore_54.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:13] ERROR: testcase failed (exit 2): attention_hca_test_incore_54

attention_csa_test_refresh_incore_81

stage=run info=exit=2

ld.lld: error: undefined reference due to --no-allow-shlib-undefined: kernel
>>> referenced by libattention_csa_test_refresh_incore_81_kernel.so
cceld: Linker ReturnCode: 1
cceld: ExecutionFailed: 0
cceld: ErrMsg:
bisheng: error: linker command failed with exit code 1 (use -v to see invocation)
gmake[2]: *** [CMakeFiles/attention_csa_test_refresh_incore_81.dir/build.make:98: attention_csa_test_refresh_incore_81] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:111: CMakeFiles/attention_csa_test_refresh_incore_81.dir/all] Error 2
gmake: *** [Makefile:91: all] Error 2
[2026-06-16 22:21:15] ERROR: testcase failed (exit 2): attention_csa_test_refresh_incore_81

tgather_root_binding

stage=run info=exit=139

./test/npu_validation/scripts/run_remote_npu_validation.sh: line 380: 1321027 Segmentation fault      (core dumped) LD_LIBRARY_PATH="${LD_LIBRARY_PATH_NPU}" ./build/${testcase}
[2026-06-16 22:24:10] ERROR: testcase failed (exit 139): tgather_root_binding

cmps

stage=run info=exit=2

[ERROR] Packed mask mismatch: golden_v2.bin vs v2.bin, idx=4 (golden=98, out=0)
[ERROR] compare failed
[2026-06-16 22:29:10] ERROR: testcase failed (exit 2): cmps

cmp

stage=run info=exit=2

[ERROR] Packed mask mismatch: golden_v3.bin vs v3.bin, idx=4 (golden=49, out=0)
[ERROR] compare failed
[2026-06-16 22:29:20] ERROR: testcase failed (exit 2): cmp

Zhendong404 · 2026-06-17T07:28:19Z

    constraints=[_constraint_trowexpanddiv_row_major],
 )
 def template_trowexpanddiv_f32(src0: pto.Tile, src1: pto.Tile, dst: pto.Tile):
-    """Template for pto.trowexpanddiv with f32 dtype and optional high-precision mode."""


请补一个st用例，看护这个场景

vpto fix trowexpand op for col major tile

3eaf1e3

gemini-code-assist Bot reviewed Jun 16, 2026

View reviewed changes

Zhendong404 approved these changes Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vpto fix trowexpand op for col major tile#828

vpto fix trowexpand op for col major tile#828
Likai-19 wants to merge 1 commit into
hw-native-sys:mainfrom
Likai-19:vpto_row_expand_fix

Likai-19 commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Uh oh!

Likai-19 commented Jun 16, 2026

Uh oh!

Likai-19 commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026

Uh oh!

Zhendong404 Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
	broadcasted = pto.vdup(scalar_vec, pto.pset_b16(pto.PAT.ALL))

-            broadcasted = pto.vdup(scalar_vec, pto.pset_b32(pto.PAT.ALL))
+            if pto.constexpr(pto.get_lanes(dtype) == 8):
+                mask_all = pto.pset_b32(pto.PAT.ALL)
+            elif pto.constexpr(pto.get_lanes(dtype) == 16):
+                mask_all = pto.pset_b16(pto.PAT.ALL)
+            else:
+                mask_all = pto.pset_b8(pto.PAT.ALL)
+            broadcasted = pto.vdup(scalar_vec, mask_all)

Conversation

Likai-19 commented Jun 16, 2026

问题总结

修改方案

1. 模板层 — 新增 col major 非对齐访问路径

2. lowering 层 — tile 直接转 ptr 并修正地址计算

3. 涉及文件

Uh oh!

reedhecre commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codex Review

Summary

Findings

Log Tail

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

Likai-19 commented Jun 16, 2026

Uh oh!

Likai-19 commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026

Uh oh!

reedhecre commented Jun 16, 2026

A5 板测失败

失败用例

Uh oh!

reedhecre commented Jun 16, 2026

A5 板测失败详情：PR #828

Uh oh!

Zhendong404 Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

reedhecre commented Jun 16, 2026 •

edited

Loading