Skip to content

feat(hygon-gemm): add Hygon backend support for Add/Gemm#31

Open
gongchensu wants to merge 2 commits intoInfiniTensor:feat/dev-infrafrom
gongchensu:feat/hygon-gemm
Open

feat(hygon-gemm): add Hygon backend support for Add/Gemm#31
gongchensu wants to merge 2 commits intoInfiniTensor:feat/dev-infrafrom
gongchensu:feat/hygon-gemm

Conversation

@gongchensu
Copy link

No description provided.

@gongchensu gongchensu self-assigned this Mar 24, 2026
- Add `WITH_HYGON` build support and a Hygon `Add` backend that reuses the shared CUDA implementation.
- Detect DTK `nvcc` from the Hygon toolkit layout and auto-detect the GPU arch from `rocminfo`.
- Treat Hygon as a CUDA-like backend in shared data type, cast, and kernel helper headers.
- Skip the Hygon `gemm` example for now and ignore `build-*` temporary directories.
- Verified with `pip install -e .[dev]` and `pytest tests/test_add.py`.
- add a Hygon `Gemm` backend on top of the shared CUDA BLAS path
- use DTK-friendly compute and algo settings for fp32/fp16 gemm
- fall back to `cublasGemmEx` for single-batch Hygon gemm to avoid DTK crashes
- release Hygon cublas handles after each call and re-enable the `gemm` example
- verified with `pip install -e .[dev]`, `pytest tests/test_gemm.py -k cuda`, and `pytest tests/test_gemm.py`
@gongchensu
Copy link
Author

海光编译及算子测试:
image
A100编译及算子测试:
image
摩尔编译及算子测试:
image
image
沐曦编译及算子测试:
image
寒武纪编译:
image
天数编译及算子测试:
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant