High-Performance Cross-Platform GPU Computing Framework
-
Updated
Apr 7, 2026 - C
High-Performance Cross-Platform GPU Computing Framework
GPU-accelerated Sobel edge detection using OpenCL. Features a high-performance "Universal Tiling" implementation with local memory (SRAM) caching and strided loading, achieving a 138x speedup on NVIDIA K20 hardware.
Système d'exploitation Exokernel "Bare-Metal" et langage dédié (Neuro-Lang) pour l'IA. Élimination de la "Taxe d'Abstraction" : exécution Ring-0, mémoire unifiée SASOS et accès GPU direct sans latenc
Add a description, image, and links to the cuda-alternative topic page so that developers can more easily learn about it.
To associate your repository with the cuda-alternative topic, visit your repo's landing page and select "manage topics."