Efficient String Compression for Modern Database Systems
Summary
CedarDB introduces FSST-based string compression to halve text-column storage and speed up queries. The post compares FSST with dictionary compression, explains combining FSST with a dictionary, and presents a penalty-based decision rule and benchmark results on ClickBench and TPC-H, highlighting storage gains and mixed effects on query latency due to decompression.