defuddle: Get the main content of any page as Markdown
Summary
Defuddle is a tool that extracts the main content from web pages and returns cleaned HTML or Markdown, aiming to replace Readability with more metadata and formatting options. It targets browser and Node.js usage (including a CLI) and emphasizes a configurable pipeline for content extraction and HTML standardization, while noting it is a work in progress.