Getting peak TOPS on a Ryzen AI 7 350 NPU
Summary
Explains how to reach peak TOPS on the Ryzen AI 7 350 NPU using int8 8x8 matrix multiply-accumulate on the AIE-MLv2 engine. Describes architecture, development stack (mlir-aie, IRON, Peano, XRT), and benchmarking results with around 56-59 TOPS at 1.8 GHz, plus tracing with Perfetto.