Power consumption of LLM's
Summary
A community discussion on the power consumption of large language models, questioning whether major vendors publish per-token energy metrics and training costs. The thread cites a GPT-OSS energy post and a JP Morgan data-center study, discusses rough energy per token estimates, and debates context size, efficiency, and environmental impact.