People cannot "just pay attention" to (boring, routine) things
Summary
The article discusses how overly generic User-Agent strings from bots trigger blocking by site operators to reduce server load and preserve bandwidth. It argues for clearer bot identification and responsible data collection practices, highlighting the surge of crawlers in early 2025 and the impact on AI training data harvesting. The piece underscores ongoing tensions between data access for model development and site/resource protection.