Branchless Quicksort
Summary
The piece introduces a branchless quicksort approach implemented in the blqsort library, including C/C++ headers and single- and multi-threaded variants. It highlights a branchless partitioning technique using sorting networks and provides benchmark results on Apple M1 and Ryzen, showing performance advantages over std::sort and pdqsort. It also points to source code and usage guidance.