Full-Text Search with DuckDB
Summary
The post investigates using DuckDB's full-text search capabilities to query large text datasets, with a focus on practical setup, indexing, and tuning using Okapi BM25. It provides a hands-on workflow for preprocessing emails with Python, creating an FTS index, and executing sample queries, while noting current limitations and potential future enhancements like phrase queries and vector search.