Diagnosing Random MariaDB Freezes on Frappe Cloud
Summary
A detailed case study on diagnosing random MariaDB freezes on Frappe Cloud, highlighting observability challenges under heavy I/O, the use of eBPF for kernel-level tracing, and a multi-step approach to mitigate I/O spikes without compromising live workloads. The article provides practical lessons for tuning and safer metric collection in cloud-hosted databases.