Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The Chinese models are distilled from GPT and Claude, so it's not like China would pull ahead if those companies went away for six months. They really are at the forefront of innovation right now, as much as I hate to think of the consequences of this (a single company owning a superintelligence is basically a nightmare scenario for me).
 help



Don't worry, if someone truly achieves superintelligence it won't be controlled by anyone for long.

There will be a blinding flash which signals the superintelligence singularity. When the smoke clears, you'll see a 50-foot tall Altman/Borg hybrid. He is about to destroy humanity with his death ray. Suddenly, a 50-foot tall Musk/Borg hybrid appears out of nowhere, and stops Altman just in time. Then they work together to destroy all humans.

Seems our best hedge in that case is Levi Ackerman.

That's my other nightmare scenario :P

Just imagine how inexpensive paperclips will become, there is always a silver lining.

We will finally have achieved abundance.


Not just abundance, we will have the maximum amount of paperclips possible.

I think that’s the realm of conspiracy theories. There are also not only Chinese alternatives- Mistral in Europe is doing pretty good in several categories they’ve opted to focus on.

This kind of reiterates the parent’s question I think - people are maybe too focused on the gpt/claude model and forget about all the other ways of using the tech.


Is it? I thought it was pretty well established that open models were distilled from the proprietary, frontier ones. Maybe I'm wrong.

It's well established that the companies who own the proprietary frontier models complain loudly that open models are distilled from theirs.

There's surely some truth to it (and it's well deserved), but it's happening in every direction.


No, that is not well established at all, and generalizing all open models under that inaccurate umbrella doesn't really help anyone.

i don't buy this. distilled how? you don't get access to logprobs, and the thinking traces are fake and compressed. it's an expensive way to get potentially substandard training data.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: