Efficient C++ Programming for Modern C++ CPUs, Chapter 4/part 2
Summary
This article analyzes CPU clock cycle costs relevant to modern CPUs in the context of C++ performance. It covers multiplication/division latencies, RTTI costs, and the cost of exceptions versus return-based error handling, plus topics like atomics, inlining, TLS, and thread context switches, with numerous references.