Every byte matters
Summary
This article explains how CPU cache organization and memory access patterns influence software performance, with practical math and examples. It compares AoS vs SoA layouts and shows how data layout and working-set size can dramatically affect latency, sometimes by tens of times. It emphasizes designing data structures with cache locality in mind to maximize throughput.