> 1hr -> 5min on March 6th This is not accurate. The main agent typically uses a...

throwdbaaway · 2026-04-12T15:28:10 1776007690

https://github.com/anthropics/claude-code/issues/46829#issue... - Have you checked with your colleague? (and his AI, of course)

fluidcruft · 2026-04-12T16:08:08 1776010088

Doesn't what's said at the link approximately agree? The 5m bug was said to be isolated to use of overage (API billing).

mvkel · 2026-04-12T20:47:25 1776026845

Then my original question stands: why did this become an issue seemingly overnight if nothing changed?

aaronblohowiak · 2026-04-12T15:36:31 1776008191

So if I run a test suite or compile my rust program in a sub agent I’m going to get cache misses? Boo.

skeledrew · 2026-04-12T17:04:49 1776013489

Sub agents don't have much context and don't stay around for long, so misses in that case are trivial.

HumanOstrich · 2026-04-12T22:55:38 1776034538

As of yesterday subagents were often getting the entire session copied to them. Happened to me when 2 turns with Claude spawned a subagent, caused 2 compactions, and burned 15% of my 5-hour limit (Max 5x).

aaronblohowiak · 2026-04-13T00:16:33 1776039393

how long they stay around after the cache miss is irrelevant if I am burning all the prior tokens again. also, how much context they have depends entirely on the task and your workflow. I you have a subagent implement a feature and use the compile + test loop to ensure it is implemented correctly before a supervisor agent reviews what was implemented vs asked then yes, subagents do have a lot of context.

skeledrew · 2026-04-14T12:12:41 1776168761

I'd say it's next to impossible to have a subagent doing a compile+test loop where at least 1 call doesn't get made to the API over multiple 5-minute stretches to keep the cache warm. In such a case it may just be the same as doing the compile+test manually and then having the agent troubleshoot any issues before iterating.

highd · 2026-04-12T19:34:02 1776022442

... so how do API users enable 1hr caching? I haven't found a setting anywhere.

g4cg54g54 · 2026-04-12T22:58:28 1776034708

would like to know this too ;D

there is env.ENABLE_PROMPT_CACHING_1H_BEDROCK - but that is - as the name says "when using Bedrock"

for the raw API the docs are also clear -> "ttl": "1h" https://platform.claude.com/docs/en/build-with-claude/prompt...

but how to make claude-code send that when paying by API-key? or when using a custom ANTHROPIC_BASE_URL? (requests will contain cache_control, but no ttl!)