Linear Representations and Superposition

February 15, 2026 at 04:29

Quality: 7/10 Relevance: 9/10

Summary

The post surveys linear representations and superposition as interpretability frameworks for LLMs. It explains embedding and unembedding spaces, concept representations, and the role of nonlinearity in managing interference, with references to Park et al. and Anthropic and notes on Llama 2 experiments.

Read Original Article