Not sure I understand, wouldn't permissions prevent this? The user runs with `--...

Jcampuzano2 · 2026-03-30T01:15:22 1774833322

You could prevent this even with --dangerously-skip-permissions with a simple pretooluse hook.

SpicyLemonZest · 2026-03-30T00:22:28 1774830148

Who knows whether permissions would prevent this? Anthropic's documentation on permissions (https://code.claude.com/docs/en/permissions) does not describe how permissions are enforced; a slightly uncharitable reading of "How permissions interact with sandboxing" suggests that they are not really enforced and any prompt injection can circumvent them.

jatora · 2026-03-30T01:04:17 1774832657

With hooks you can enforce permissions much more concretely.

SpicyLemonZest · 2026-03-30T02:42:09 1774838529

Perhaps they're more functional. Hooks are configured in the same settings file, which makes me pretty skeptical in the absence of explicit confirmation that they represent a stronger security boundary. (But of course, this is a fundamental challenge with LLM agent security - if you're using a well-aligned model that doesn't want to be prompt injected, how do you go about auditing something like this?)

jatora · 2026-03-30T02:52:26 1774839146

ya they definitely cant stop everything. nothing can be stopped if you allow python honestly, but hooks are guaranteed to fire on every tool use so you can bake in explicit rejections for different patterns based on regex which can catch a lot of nonsense

addandsubtract · 2026-03-30T00:08:55 1774829335

The rules and permissions are no longer program flags, but plain text for the agent to "obey".

petcat · 2026-03-30T01:28:10 1774834090

That's not what tool use permissions are. The LLM doesn't just magically spawn processes or run code. The Claude Code program itself does those things when the LLM indicates that it wants to. The program has checks and permissions whether those things will be done or not.

SpicyLemonZest · 2026-03-30T02:39:42 1774838382

Claude Code has a sandboxing functionality that works the way you're describing when you opt into it, but my understanding is that the Claude Code program in the default configuration does not second-guess the LLM's decisions on what it'd like to run. Has Anthropic said something to the contrary?