Skip to content

x4la/einsum

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

einsum (development version)

This repository contains the development work for a lightweight, high-performance einsum implementation based on a Transpose–Transpose–BMM–Transpose strategy (see paper ). It includes all experimental versions (v1v8) of the C++ BMM kernels and the Python interface, as well as the benchmarking and test scripts used during optimization.

The project was created for the Algorithm-Engineering course at FSU Jena and explores how view-based layouts together with a tuned BMM backend can significantly outperform NumPy’s einsum and achieve strong performance on memory-bound cases compared to PyTorch.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors