AIs can generate near-verbatim copies of novels from training data

February 23, 2026 at 15:38

Quality: 9/10 Relevance: 9/10

Summary

This article reports on research showing that top AI models memorize training data and can generate near-verbatim copies of novels, challenging claims that models do not store copyrighted works. It discusses implications for copyright liability, privacy, and the need for governance as AI models become more capable of reproducing protected text.

Read Original Article