LLMCap – A proxy that hard-stops LLM API calls when you hit a dollar cap
Summary
LLMCap offers a proxy-based solution to enforce hard dollar caps on LLM API usage across multiple providers. When a configured cap is reached, the proxy returns HTTP 429 and ensures no tokens are billed, helping teams avoid surprise bills. It supports streaming, per-model/per-key caps, a 3-day trial, and both hosted and planned self-hosted options.