Skip to content
Change the repository type filter

All

    Repositories list

    • Distributed Deep Learning (DL) proxy benchmark.
      C++
      GNU General Public License v3.0
      1030Updated May 5, 2026May 5, 2026
    • Repository for the laboratory lessons of the GPU Computing course (Academic year 2025/2026). Prof. Flavio Vella, Assistant Prof. Kaijie Fan.
      C++
      2900Updated May 5, 2026May 5, 2026
    • JAX-NG

      Public
      Python
      MIT License
      0000Updated May 4, 2026May 4, 2026
    • Python
      0000Updated Apr 30, 2026Apr 30, 2026
    • SPARTA

      Public
      SParse AcceleRation on Tensor Architecture
      Emacs Lisp
      71870Updated Apr 15, 2026Apr 15, 2026
    • JobPlacer

      Public
      A simple tool to obtain a graph/tree from the topology of a systems and place jobs under certain constraints
      Rust
      GNU General Public License v3.0
      0000Updated Apr 4, 2026Apr 4, 2026
    • NNEQ

      Public
      Model Checking Equivalence
      BSD 3-Clause "New" or "Revised" License
      0000Updated Apr 1, 2026Apr 1, 2026
    • Trident

      Public
      New repo for the distribuited SpGEMM project
      Cuda
      0200Updated Mar 24, 2026Mar 24, 2026
    • A lightweight Python tool that converts PyTorch Profiler traces (.json) into clean, structured CSV datasets. It intelligently filters composite operations to is…
      Python
      0000Updated Feb 26, 2026Feb 26, 2026
    • A customization of the official Graph500 benchmark oriented to interconnects performance.
      C
      Other
      0000Updated Feb 9, 2026Feb 9, 2026
    • C++
      3030Updated Feb 4, 2026Feb 4, 2026
    • Blink-GPU

      Public
      A suite of microbenchmarks developed for systems with multi-GPU per node.
      Cuda
      3940Updated Jan 22, 2026Jan 22, 2026
    • Extract the shape of the weight matrices of some Transformer-based models
      Python
      0000Updated Jan 20, 2026Jan 20, 2026
    • Popcorn

      Public
      Cuda
      1900Updated Dec 10, 2025Dec 10, 2025
    • ADS

      Public
      Lab of Algorithms and Data Structures of University of Trento
      C
      1200Updated Nov 24, 2025Nov 24, 2025
    • 0100Updated Oct 27, 2025Oct 27, 2025
    • MergedCSR

      Public
      Optimized implementations of BFS yielding up to 2x speedup over the GAP benchmark.
      Jupyter Notebook
      1100Updated Oct 21, 2025Oct 21, 2025
    • This mini-repo aims to precompute and normalize large graphs.
      Cuda
      0050Updated Oct 17, 2025Oct 17, 2025
    • PAul

      Public
      PArallel Unsupervised Learning
      C
      0000Updated Oct 6, 2025Oct 6, 2025
    • Code accompanying our ICML 2025 paper “Riemann Tensor Neural Networks: Learning Conservative Systems with Physics-Constrained Networks.”
      Python
      0100Updated Jul 17, 2025Jul 17, 2025
    • Blink: A Benchmark for Large-Scale Interconnection Networks
      C
      MIT License
      2000Updated Jun 21, 2025Jun 21, 2025
    • Breadth-First Search acceleration with CUDA
      Cuda
      0000Updated Jun 10, 2025Jun 10, 2025
    • Repository for the laboratory lessons of the GPU Computing course (Academic year 2024/2025). Prof. Flavio Vella, Teaching assistant Lorenzo Pichetti.
      C++
      3700Updated May 29, 2025May 29, 2025
    • UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
      Python
      MIT License
      15000Updated Apr 29, 2025Apr 29, 2025
    • dpu_kmeans

      Public template
      Implementation of the K-means algorithm on UPMEM PIM architecture
      Python
      MIT License
      4000Updated Mar 25, 2025Mar 25, 2025
    • Parallel Expected Force
      Cuda
      5200Updated Jan 9, 2025Jan 9, 2025
    • C++
      3000Updated Dec 19, 2024Dec 19, 2024
    • A private fork of 'Blink-GPU' done for the energy mesurements part
      Cuda
      3000Updated Dec 17, 2024Dec 17, 2024
    • Python
      MIT License
      0300Updated Dec 15, 2024Dec 15, 2024
    • smat

      Public
      Code for High Performance Unstructured SpMM Computation Using Tensor Cores
      C++
      3000Updated Nov 3, 2024Nov 3, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.