Talkie: a 13B vintage language model from 1930

April 27, 2026 at 21:55

Quality: 8/10 Relevance: 9/10

Summary

Talkie introduces a 13B vintage language model trained on pre-1931 text and investigates how such models perform relative to modern counterparts. The article covers data quality challenges (OCR), leakage prevention, post-training pipelines, and plans for scaling, highlighting how historical data shapes model behavior and research insights.

LLM & Prompting AI Research

Read Original Article