Skip to content
#

systolic-array

Here are 22 public repositories matching this topic...

Parameterized N×N output-stationary systolic array accelerator for INT8 neural network inference. Full RTL-to-GDS flow on ASAP7 7nm using Cadence Genus + Innovus. 667 MHz, 42.7 GOPS peak throughput, 0.33 mW/GOP. SystemVerilog RTL, synthesis, place-and-route and self-checking testbench included.

  • Updated Feb 18, 2026
  • Verilog

An 8×8 systolic array AI accelerator implemented in SystemVerilog on Zynq UltraScale+ ZCU104, achieving 1.7 GOPS at 6 mW PL logic power (~283 GOPS/W efficiency) with full AXI-Stream PS-PL integration. Targets INT8 matrix multiplication for transformer inference acceleration, verified across behavioral, post-synthesis and implementation simulation.

  • Updated Apr 21, 2026
  • SystemVerilog

Improve this page

Add a description, image, and links to the systolic-array topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the systolic-array topic, visit your repo's landing page and select "manage topics."

Learn more