What color are your bits? (2004)
Summary
The article describes slime, an LLM post-training framework for RL scaling by THUDM, focusing on high-performance training and flexible data generation. It explains the architecture (training, rollout, and data buffer), quick-start guides, and notable projects built with slime, highlighting its developer guidance and usage in research-to-production workflows.