MCHEP is a highly parallelizable Monte Carlo integration routine. Specifically, it supports multi-threads/cores parallelization, Single Instruction Multiple Data (SIMD) instructions, and GPU acceleration. Currently, it implements two adaptive multidimensional integrations, namely VEGAS and VEGAS+ (with adaptive stratified sampling) as presented in the paper arXiv:2009.05112.
The following benchmark compares MCHEP against the CUBA library across different integrand complexities, simulating typical High Energy Physics (HEP) workloads:
The left plot shows the integration throughput (evaluations per millisecond) vs. computational cost per evaluation. MCHEP with SIMD+AVX consistently outperforms both MCHEP scalar and CUBA across all complexity levels. The right plot shows the speedup factor of MCHEP implementations compared to CUBA Vegas. MCHEP SIMD+AVX achieves 4-6x speedup over CUBA for typical HEP workloads.
Significant improvements can be obtained using SIMD+AVX instructions. See benchmark details for comprehensive comparisons.
| QCD Complexity | Typical Cost (FLOPS) | MCHEP Scalar (evals/ms) | MCHEP SIMD+AVX (evals/ms) | CUBA (evals/ms) | Speedup vs CUBA |
|---|---|---|---|---|---|
| LO | ~10k | 2,500 | 7,500 | 1,800 | 4.2x |
| NLO | ~1M | 48 | 153 | 34 | 4.5x |
| NNLO | ~10M | 5 | 18 | 5 | 3.9x |
| Feature | Rust API | C/C++ API | Python API | |||
|---|---|---|---|---|---|---|
| Vegas | Vegas+ | Vegas | Vegas+ | Vegas | Vegas+ | |
| Multi-threaded (Rayon) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| SIMD | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| GPU | ✓ | ✓ | ||||
| MPI | ✓ | ✓ | ✓ | |||
| MPI+SIMD | ✓ | ✓ | ||||
