AIs can generate near-verbatim copies of novels from training data
Summary
This article reports on research showing that top AI models memorize training data and can generate near-verbatim copies of novels, challenging claims that models do not store copyrighted works. It discusses implications for copyright liability, privacy, and the need for governance as AI models become more capable of reproducing protected text.