Where can i learn about concepts like prompt cache misses? I don't have a mental model how that interacts with my context of 1M or 400k tokens... I can cargo cult follow instructions of course but help us understand if you can so we can intelligently adapt our behavior. Thanks.
Thanks. Just noting that those docs say the cache duration is 5 min and not 1 hour as stated in sibling comment:
> By default, the cache has a 5-minute lifetime. The cache is refreshed for no additional cost each time the cached content is used.
>
> If you find that 5 minutes is too short, Anthropic also offers a 1-hour cache duration at additional cost.
Apparently Anthropic downgraded cache TTL to 5 min without telling anyone. My biggest issue with the recent issues with Claude Code is the lack transparency, although it looks like even Boris doesn't know about one:
https://news.ycombinator.com/item?id=47736476