Skip to content
Change the repository type filter

All

    Repositories list

    • Minuet

      Public
      [EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
      Cuda
      Apache License 2.0
      48030Updated Jun 7, 2024Jun 7, 2024
    • hfta

      Public
      Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
      Python
      MIT License
      113261Updated May 15, 2024May 15, 2024
    • Sylva

      Public
      0000Updated Apr 19, 2024Apr 19, 2024
    • This repository contains the source code for Grape.
      Python
      1600Updated Sep 15, 2023Sep 15, 2023
    • hotline

      Public
      Python
      Apache License 2.0
      33330Updated Jun 6, 2023Jun 6, 2023
    • Python
      Other
      77000Updated May 29, 2023May 29, 2023
    • A domain specific language to express machine learning workloads.
      C++
      Apache License 2.0
      213001Updated Apr 28, 2023Apr 28, 2023
    • CSCD70

      Public
      CSCD70 Compiler Optimization
      C++
      6626100Updated Apr 17, 2023Apr 17, 2023
    • Tempo

      Public
      Memory footprint reduction for transformer models
      Python
      21100Updated Jan 24, 2023Jan 24, 2023
    • Benchmarking using MXNet GPU Memory Profiler
      Python
      2311Updated Nov 19, 2022Nov 19, 2022
    • tvm

      Public
      Open deep learning compiler stack for cpu, gpu and specialized accelerators
      Python
      Apache License 2.0
      3.9k000Updated Aug 26, 2022Aug 26, 2022
    • Python
      Apache License 2.0
      1010Updated Aug 25, 2022Aug 25, 2022
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k000Updated Aug 19, 2022Aug 19, 2022
    • HTML
      1500Updated Aug 15, 2022Aug 15, 2022
    • DietCode

      Public
      DietCode Code Release
      Cuda
      96500Updated Jul 21, 2022Jul 21, 2022
    • skyline

      Public archive
      🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
      Python
      Apache License 2.0
      0200Updated May 18, 2022May 18, 2022
    • An Open Source Machine Learning Framework for Everyone
      C++
      Apache License 2.0
      75k000Updated May 17, 2022May 17, 2022
    • Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascrip…
      C++
      Apache License 2.0
      6.7k000Updated May 17, 2022May 17, 2022
    • A collection of Twitter's anonymized production cache traces.
      Shell
      Creative Commons Attribution 4.0 International
      37000Updated Dec 13, 2021Dec 13, 2021
    • brax

      Public
      Massively parallel rigidbody physics simulation on accelerator hardware.
      Jupyter Notebook
      Apache License 2.0
      338000Updated Nov 17, 2021Nov 17, 2021
    • jaxviz

      Public
      Rich interactive visualization of Jax and XLA computational graph IRs.
      Python
      1000Updated Jul 24, 2021Jul 24, 2021
    • Some results of visualizing jax
      HTML
      1000Updated Jul 16, 2021Jul 16, 2021
    • The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".
      Python
      MIT License
      11310Updated Jun 7, 2021Jun 7, 2021
    • MoIL

      Public
      MoIL: Enabling Efficient Incremental Training on Edge Devices
      Apache License 2.0
      0200Updated May 1, 2021May 1, 2021
    • rlscope

      Public
      RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
      Python
      Apache License 2.0
      14810Updated Apr 7, 2021Apr 7, 2021
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      C++
      Other
      28k000Updated Apr 7, 2021Apr 7, 2021
    • Fork of https://github.com/hill-a/stable-baselines with RL-Scope annotations added.
      Python
      MIT License
      0000Updated Feb 4, 2021Feb 4, 2021
    • Fork of https://github.com/araffin/rl-baselines-zoo with RL-Scope annotations added.
      Python
      MIT License
      0000Updated Feb 4, 2021Feb 4, 2021
    • Fork of https://github.com/facebookresearch/ReAgent with RL-Scope annotations added.
      Python
      BSD 3-Clause "New" or "Revised" License
      0000Updated Feb 4, 2021Feb 4, 2021
    • Fork of https://github.com/tensorflow/agents with RL-Scope annotations added.
      Python
      Apache License 2.0
      0000Updated Feb 4, 2021Feb 4, 2021
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.