Teaching LLMs to Be Funny

January 27, 2026 at 16:58

Quality: 8/10 Relevance: 9/10

Summary

The article documents an experiment to train a trillion-parameter LLM to be funny by applying rubric-based reinforcement learning to the Kimi K2 model. It details decomposing humor into verifiable rubrics, building a data pipeline from social media and humor publications, and iterating with SFT and RL to improve humorous outputs, while noting what didn’t work and what did.

Read Original Article