Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Not exactly true ... KV and prompt caching is a thing


Assuming you include the same prompts in the new request that were cached in the previous ones.


As far as I understand, the entire chat is the prompt. So at the each round, the previous chat up to that point could already be cached. If I'm not wrong, Claude APIs require an explicit request to cache the prompt, while OpenAI's handle this automatically.


I don't understand how you are downvoted…




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: