Why Your CPU Is Fast But Your Program Is Slow: Understanding the Memory Wall
Summary
The article explains the memory wall phenomenon: CPU speed often outpaces memory bandwidth due to DRAM architecture and cache hierarchies. It uses the Aletheia experiments and the Roofline model to illustrate how memory access patterns, not raw compute, often limit performance, and it provides practical guidance on making data movement the focus of optimization.