Equivalence of Unicode strings is strange
Summary
Blog post exploring how Unicode string equivalence under collations in databases can produce surprising results. It explains that case-insensitive and accent-insensitive collations can make distinct strings compare as equal, discusses the Unicode Collation Algorithm and ICU, and highlights implications for sorting, grouping, and hash-based operations in DBMSs.