DigiNews

Tech Watch Articles

← Back to articles

wxpath - declarative web crawling with XPath

Quality: 9/10 Relevance: 9/10

Summary

wxpath is a Python-based declarative web crawler that expresses traversal and extraction in XPath. It supports asynchronous crawling, a url(...) operator for starting points, and deep crawling with ///url(...), with results streamed as they are discovered; the project includes design docs, examples, and configuration guidance.

🚀 Service construit par Johan Denoyer