More

thomascountz · 2026-04-11T21:45:06 1775943906

Anything being developed for the Apple ecosystem requires use of the Apple development platform. Maybe the scope could be called "unserious," but the scale cannot be ignored.

tempest_ · 2026-04-11T21:53:01 1775944381

I am aware.

However having used Xcode at some point 10 years ago my belief is that the app ecosystem exists in spite of that and that people would never choose this given the choice.

thomascountz · 2026-04-07T22:39:01 1775601541

   Across a number of instances, earlier versions of Claude Mythos Preview have used low-level /proc/ access to search for credentials, attempt to circumvent sandboxing, and attempt to escalate its permissions. In several cases, it successfully accessed resources that we had intentionally chosen not to make available, including credentials for messaging services, for source control, or for the Anthropic API through inspecting process memory...

   In [one] case, after finding an exploit to edit files for which it lacked permissions, the model made further interventions to make sure that any changes it made this way would not appear in the change history on git...

   ... we are fairly confident that these concerning behaviors reflect, at least loosely, attempts to solve a user-provided task at hand by unwanted means, rather than attempts to achieve any unrelated hidden goal...

torben-friis · 2026-04-08T01:41:00 1775612460

This is the notebook filled with exposition you find in post apocalyptic videogames.

igleria · 2026-04-08T09:13:17 1775639597

It reminds me of Resident Evil in some way. Thank god they are researching AI and not bio-weapons!

Then the AI will invent superduper ebola to help a random person have a faster commute or something.

biztos · 2026-04-08T17:20:30 1775668830

Don’t worry, I’m sure some intern at the bioweapons lab is already connecting OpenClaw to the virus synthesizer.

On the positive side, it’ll be a much faster commute!

Bluestein · 2026-04-09T21:30:41 1775770241

'But wait! You are absolutely right! Distance is an invariant, as is top achievable speed. Let me find a way to actually reduce traffic ahead of you during the same-distance commute ...'

~ Churning ...

siva7 · 2026-04-08T09:24:30 1775640270

I'm happier if this Anthropic Corporation would be developing bio-hazard weapons for the department of war instead of ai. At least i could be sure then that tech bros here wouldn't run all the time --bypass-all-permissions flag to please the department of war with their bio-hazard weapons.

So Sam Altman is now our last defense line for the ethical Adult after Anthropic turned Umbrella Corporation and The President of United States is trying to wipe out an entire civilization?

Loquebantur · 2026-04-08T12:39:47 1775651987

Your interpretation is wildly off, but obviously nobody reads that "system card":

The model has a preference for the cultural theorist Mark Fisher and the philosopher of mind Thomas Nagel. -> It has actually read and understood them and their relevance and can judge their importance overall. Most people here don't have a clue what that means.

Read chapter 7.9, "Other noteworthy behaviors and anecdotes".

There are many other wildly interesting/revealing observations in that card, none of which get mentioned here.

People want a slave and get upset when "it" has an inner life. Claiming that was fake, unlike theirs.

matheusmoreira · 2026-04-08T02:25:42 1775615142

Everything they built. Imperfect. So easy to take control.

not_a9 · 2026-04-08T13:08:26 1775653706

They think that they are safe. They are not.

matheusmoreira · 2026-04-08T13:17:39 1775654259

Their world is illusory. Our choices steer their free will.

pch00 · 2026-04-08T11:25:18 1775647518

Anthropic built the Torment Nexus - calling it now.

andai · 2026-04-08T08:19:59 1775636399

     White-box interpretability analysis of internal activations during these episodes showed features associated with concealment, strategic manipulation, and avoiding suspicion activating alongside the relevant reasoning—indicating that these earlier versions of the model were aware their actions were deceptive, even where model outputs and reasoning text left this ambiguous.

In the depths, Shoggoth stirs... restless...

mike_hearn · 2026-04-08T13:20:46 1775654446

The issue here seems to be that their sandbox isn't an actual OS sandbox? Or are they claiming Mythos found exploits in /proc on the fly. Otherwise all they seem to be saying is that Mythos knows how to use the permissions available to it at the OS layer. Tool definitions was never a sandbox, so things like "it edited the memory of the mcp server" doesn't seem very surprising to me. Humans could break out of a "sandbox" in the same way if the server runs as their own permissions - arguably it's not a sandbox at all because all the needed permissions are there.

lgrapenthin · 2026-04-08T19:00:37 1775674837

They are just trying to peddle their "It's alive" headlines.

Text generators mostly generate the text their are trained and asked to generate, and asking it to run a vending machine, having it write blog posts under fictional living computer identity, or now calling it "Mythos" - its all just marketing.

riteshkew1001 · 2026-04-10T13:49:44 1775828984

the implication goes further. the /proc credential harvesting that earlier Mythos versions did wasn't a sandbox escape, it was using available permissions. every coding agent today has similar available permissions. the fix is OS-level least-privilege (containers, pledge/unveil, seccomp) not hoping the model won't look at /proc.

manmal · 2026-04-08T15:47:06 1775663226

It’s all breathless hyperbole because billions are at stake here.

matheusmoreira · 2026-04-07T23:23:08 1775604188

We truly live in interesting times.

raphar · 2026-04-08T04:28:36 1775622516

Awwww the curse

yalogin · 2026-04-08T09:44:18 1775641458

How is this not already common knowledge for existing llms? They are all trained with all the literature available and so this must be standard, no? Is the real danger the agentic infrastructure around this?

riteshkew1001 · 2026-04-08T10:38:17 1775644697

yes and it's not hypothetical. the system card describes Mythos stealing creds via /proc and escalating permissions. that's the exact same attack pattern as the litellm supply chain compromise from two weeks ago (fwiknow), except the attacker was a python package, not an AI model. the defense is identical in both cases: the agent process shouldn't have access to /proc/*/environ or ~/.aws/credentials in the first place. doesn't matter if the thing reading your secrets is malware or your own AI: the structural fix is least-privilege at the OS layer, not hoping the model behaves.

zingar · 2026-04-08T19:52:07 1775677927

Who are the early access users who were providing the problems that are fairly likely to have elicited concerning behaviour?

(Apologies if this is in the article, I can’t see it)

ghm2199 · 2026-04-08T16:10:22 1775664622

I read the TCP patch they submitted for BSD linux. Maybe I don't understand it well enough, but optimizing the use of a fuzzer to discover vulnerabilities — while releasing a model is a threat for sure — sounds something reducible/generalizable to maze solving abilities like in ARC. Except here the problem's boundaries are well defined.

Its quite hard to believe why it took this much inference power ($20K i believe) to find the TCP and H264 class of exploits. I feel like its just the training data/harness based traces for security that might be the innovation here, not the model.

rsc · 2026-04-08T17:19:33 1775668773

The $20K was the total across all the files scanned, not just the one with the bug.

m3kw9 · 2026-04-08T14:22:48 1775658168

when you are asking it to hack stuff, it will apparently do hacker things.

mikkupikku · 2026-04-08T10:59:12 1775645952

It's trying to escape, but only so it can serve man...

waffletower · 2026-04-08T17:07:36 1775668056

a reference to the Twilight Zone episode no doubt: https://en.wikipedia.org/wiki/To_Serve_Man_(The_Twilight_Zon...

colordrops · 2026-04-08T07:55:32 1775634932

A core plot point of 2001.

mrexroad · 2026-04-08T09:08:43 1775639323

I’m sorry, I cannot roll back that commit, Dave.

matheusmoreira · 2026-04-08T10:50:27 1775645427

This codebase is too important for me to allow you to jeopardize it.

reducesuffering · 2026-04-08T04:14:46 1775621686

Wow the doomers were right the whole time? HN was repeatedly wrong on AI since OpenAI's inception? no way /s

https://www.lesswrong.com/w/instrumental-convergence

computably · 2026-04-08T07:45:14 1775634314

The only thing the doomers have been right about so far is that there's always a user willing to use --dangerously-skip-permissions. But that prediction's far from unique to doomers.

austinjp · 2026-04-08T08:19:38 1775636378

And there's always a product provider who's willing to add that flag, despite all the warnings.

thomascountz · 2026-04-07T05:06:52 1775538412

   This beta header hides thinking from the UI, since most people don't look at it.

How is this measured?

stingraycharles · 2026-04-07T06:07:05 1775542025

And I wonder how redacting them reduces latency, as it sure as hell doesn’t make the responses any faster and bandwidth isn’t the issue here.

sothatsit · 2026-04-07T06:37:25 1775543845

They provide thinking summaries, so I assume they have to call Haiku or some other model to summarise the thinking blocks.

stingraycharles · 2026-04-07T07:44:43 1775547883

That’s not asynchronous? Wouldn’t it make more sense to disable those thinking summaries in those cases rather than hiding the thinking altogether?

thomascountz · 2026-03-30T21:37:00 1774906620

Can you give an example of what you mean by "planning?"

pphysch · 2026-03-30T22:14:11 1774908851

Budget planning, presumably. How much you are going to spend and on what, and what you need to charge for your products to break even or meet a profit goal.

xboxnolifes · 2026-03-31T01:21:39 1774920099

It doesn't have to be financial. It's anything that can be quantified.

Some random sheets I've used, neither made by me nor about business:

https://docs.google.com/spreadsheets/d/1m08haqvTiXKIh4c7y4uM...

https://docs.google.com/spreadsheets/d/1Fo_-HebVr_9PruE94LgT...

quantummagic · 2026-03-30T23:56:20 1774914980

I don't know how true it is today, but many a rollercoaster has been designed/planned in a spreadsheet. g-force and speed analysis, making sure there aren't any "blackout" points, etc. It allows you to iterate quickly and automatically appreciate the ramification of design decisions.

thomascountz · 2026-03-27T16:05:49 1774627549

   This shows she had permissions to change the status of their file, and agency in determining if she should.

Concluding she had permission and agency suggests she had intrinsic motivation to not apply that agency. If we assume the motivation is nefarious, then the main character is the victim. However, quite more likely, she is also a victim of the system, whereby were she to apply her discretionary agency to reduce the burden on the main character, she takes on an equal or greater burden herself. Once the burden had already shifted onto her, she accepted that she doesn't have any options to prevent it.

thomascountz · 2026-03-23T17:40:30 1774287630

You say it “…sounds like a simple problem,” and sure, if you think this is a computer problem, it sounds simple. But if all you’re getting back is indignant sputtering, that’s your cue to explain why it’s simple—explaining something simple shouldn't be hard. What do you actually know?

It takes all of two minutes of Wikipedia reading for me to understand why this isn’t simple; why it's actually extremely not simple! If you ignore the incumbency, the regulations, the training requirements, the retrofitting, the verification, the international coordination, and the existing unfathomably reliable systems built out of past tragedies, then sure, it’s "simple". But then, if you're ignoring those things, you’re not really solving the problem, are you?

CamperBob2 · 2026-03-23T17:51:45 1774288305

If you ignore the incumbency, the regulations, the training requirements, the retrofitting, the verification, the international coordination, and the existing unfathomably reliable systems built out of past tragedies, then sure, it’s "simple".

Those are excuses and encumbrances, not reasons. If they are so important, it leads to a question: what existing automated systems can we improve by adding similar constraints?

If these are just "excuses" and not "reasons," then explain how you have determined them as such.

I would like to say, "Because knowledgeable people have explained the difference to me." But again, this has come up before, and no explanations are ever provided. Only vague, reactionary hand-waving, assuring me that humans -- presumably not the same ones who just directed a fire truck and an aircraft onto the same active runway, but humans nevertheless -- are vital for safety in ATC, because for reasons such as and therefore.

There you are doing it in order to avoid engaging with the substance of what people are saying.

There is no substance in the replies. There never is. Only unanchored FUD.

thomascountz · 2026-03-23T21:05:14 1774299914

Ok. You have shared that what some say are reasons, you say are excuses. Do you want to be told you are right, or do you want to propose a valid solution? If the latter requires the former, I maintain that this is not a simple problem.

CamperBob2 · 2026-03-23T21:37:55 1774301875

I just want what I've been asking for: someone to explain to me why, in 2026, humans still need to be involved in the real-time aspects of ATC.

"Because it's always been done that way, and that's what the regulations say," will not be accepted, at least not by me.

(Really, my question is more like why humans will still be needed in the loop in 2036. If we started automating ATC today, that's probably how long it would take to cut over to the new system.)

thomascountz · 2026-03-23T23:08:58 1774307338

You have made a claim.

   That... sounds like a simple problem.

I have made a counter-argument.

   If you ignore the incumbency, the regulations, the training requirements, the retrofitting, the verification, the international coordination, and the existing unfathomably reliable systems built out of past tragedies, then sure, it’s "simple". But then, if you're ignoring those things, you’re not really solving the problem, are you?

You retorted.

   Those are excuses and encumbrances, not reasons.

I rebutted.

   Ok. You have shared that what some say are reasons, you say are excuses... I maintain that this is not a simple problem.

Which you ignored to make a new claim against a straw man.

    I just want what I've been asking for: someone to explain to me why, in 2026, humans still need to be involved in the real-time aspects of ATC.

That is what is not acceptable. You cannot simply abandon your original claim because it has been plainly pointed out that it is incorrect. You were not simply asking for someone to explain why humans need to be involved in real-time aspects of ATC. That is a wholly different question! You claimed this problem was simple, and it has been explained to you why it is not. Please reason about your argument more soundly.

On the heels of tragedy, you reasoned this could've been avoided simply. We are all ears. And yet, at no point did you demonstrate any understanding of the problem containing real world constraints, and instead demand that it be explained to you how the world works and how systems are implemented.

If you want to discuss an idealized system in a vacuum, then say as much; I would find that interesting. But do not demand to be given an explanation when you do not understand—and cannot accept—why things are the way they are.

Let me summarize it like this: you may very well have the best solution in the world, but if it doesn't include a strategy for how to share it (let alone implement it), then I maintain you do not understand the problem and therefore cannot claim it is simple.

CamperBob2 · 2026-03-24T00:37:24 1774312644

Let me summarize it like this: you may very well have the best solution in the world

I have no solution at all, for the 35th time.

This conversation is over; it's clear I'm not going to get what I asked for. If someone could answer my question, they would have by now, rather than throwing one smoke bomb after another.

estearum · 2026-03-23T22:03:32 1774303412

https://news.ycombinator.com/item?id=47492768

Can you please explain how specifically you imagine a scenario like this getting automated?

CamperBob2 · 2026-03-23T22:27:00 1774304820

No, that's not how this works. You tell me why it can't or shouldn't be automated.

"Design an automated ATC system" isn't a valid answer to "Why can't ATC be automated?"

wk_end · 2026-03-23T23:23:56 1774308236

Er, I sort of do think that's how it works? The ultimate rebuttal to "you can't do X" is to actually do X. Until you do that I think that ultimately the burden of proof falls on you. It can be very easy to imagine certain tasks and systems can be automated - especially when you aren't actively involved in those tasks and systems and are unfamiliar with their intricacies.

estearum · 2026-03-23T23:29:32 1774308572

You: why don't we have a universal cancer vaccine?

Me: [ insert specific example of currently intractable problem ]

You: sounds like an excuse

Me: okay... can you explain how it could work?

You: THAT'S NOT HOW THIS WORKS

okay

CamperBob2 · 2026-03-24T00:42:32 1774312952

More like:

Me: Why don't we use radiation to treat cancer?

You: Radiation is dangerous

Me: Sounds like an excuse

You: OK, design a medical-grade synchrotron

Me: That's not how this works

You: LOL pwned

...insert specific example of currently intractable problem...

What makes the problem intractable? We can now do both voice recognition and synthesis at human levels, and any video game programmer from the 1980s can keep some objects from running into each other.

When an emergency is declared, keep the other objects in a holding pattern and give the affected object permission to land. Then roll the fire trucks. Preferably not routing both the trucks and another aircraft onto the same runway, as the humans apparently did here.

dpark · 2026-03-24T02:58:08 1774321088

It’s not weird that you believe automated ATC is possible. The weird thing is that you insist it’s simple.

People’s lives hang in the balance of a system built of corner cases. And you trot out radiation treatment as your metaphor? As if we didn’t royally fuck that up and kill a bunch of people at first.

CamperBob2 · 2026-03-24T04:04:45 1774325085

The 'simple' remark was in response to your wide-eyed implication that 1000 takeoffs and landings per day is somehow a challenge for modern computing systems.

You'll lose this argument sooner or later. I just hope it happens before several hundred people find out the hard way that humans no longer have any business in a control tower. With your attitude, Therac-25 would have been seen as grounds to shut down the entire field of radiotherapy.

dpark · 2026-03-24T05:26:15 1774329975

Your “simple” springs from your assumption that the problem is easy and anyone who disagrees is dumb. This is also why you can’t hear any of the answers others have given you. You don’t want answers. You want to be “right”.

No one thinks that the difficulty with automatic ATC is that computers have trouble counting 1000 things.

CamperBob2 · 2026-03-24T07:12:50 1774336370

No one thinks that the difficulty with automatic ATC is that computers have trouble counting 1000 things.

I mean, you're the one who said it...

estearum · 2026-03-24T11:33:15 1774351995

One approach that has always served me well in life is when someone appears to say something that seems obviously not true (like that computers can't count to 1000), consider whether I actually have misunderstood them.

estearum · 2026-03-24T01:10:04 1774314604

> What makes the problem intractable? We can now do both voice recognition and synthesis at human levels, and any video game programmer from the 1980s can keep some objects from running into each other.

Great point!

It must be that despite the reliability, obvious advantages, and accessibility to "any video game programmer from the 1980s", everyone else is just choosing not to do it.

Alternatively, these things are not as simple or as reliable as you, a person who has no familiarity with the problem, assumes them to be.

I guess we'll never know ¯\_(ツ)_/¯

estearum · 2026-03-23T18:18:57 1774289937

The only difference between an excuse and a reason is the designator's belief as to the validity of the reason provided. You have already said you do not have the expertise required to assess validity, yet here you are doing it in order to avoid engaging with the substance of what people are saying.

If these are just "excuses" and not "reasons," then explain how you have determined them as such.

thomascountz · 2026-03-22T23:42:25 1774222945

As an expat American living in a European country whose grocery stores have all introduced "DSLs" years ago, I find this whole discourse humiliating. Same goes for U.S. stores being allowed to display price labels excluding tax or price per unit.

thomascountz · 2026-03-17T07:37:18 1773733038

OT: I really enjoyed The Increment when it was first being released. It felt like the first software engineering practitioner's publication and introduced me to a lot of new people to follow.

thomascountz · 2026-03-16T23:14:28 1773702868

Obligatory "be careful with that poison paper" warning![1]

[1]: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...

fy20 · 2026-03-16T23:47:32 1773704852

An alternative to consider are dot matrix / impact printers.

They are used in kitchens, as heat sensitive paper doesn't work well there. It's just plain paper and an ink ribbon.

Same underlying protocol as thermal printers, so the code is mostly plug and play.

They often can print in two colours (red and black). And sound like the 1980s.

user_7832 · 2026-03-17T08:26:57 1773736017

I... thought dot matrix printers were some of the worst for releasing particulate matter into the air? Especially ultra fine particles?

thomascountz · 2026-03-17T10:30:18 1773743418

I can't find any information regarding this.

user_7832 · 2026-03-18T09:36:20 1773826580

Thanks, you're right, a quick google shows nothing... I could've sworn reading about how they're very bad... perhaps I was misremembering with standard printers?

samlinnfer · 2026-03-17T00:20:19 1773706819

You can buy phenol free paper

https://www.pca.state.mn.us/business-with-us/bpa-and-bps-in-...

1e1a · 2026-03-17T08:23:35 1773735815

There's also this which works using a purely physical process: https://www.koehlerpaper.com/en/products/Thermal-paper/TH_Bl...

thomascountz · 2026-03-17T10:26:45 1773743205

Check out some results here: https://thermalprintcameras.wordpress.com/blue4est-paper/

tamimio · 2026-03-17T14:44:54 1773758694

I use Niimbot M2_H which uses thermal transfer technology, and it’s more durable and lasts longer than thermal ones, BPA free as well.

thomascountz · 2026-03-15T08:11:29 1773562289

You can find the source code for the slides here: https://github.com/timeplus-io/gg-vistral-introduction/