More

asymmetric · 2026-04-28T12:13:03 1777378383

Yeah, this seems like a textbook case where one could apply Jevons Paradox.

twoodfin · 2026-04-28T12:19:21 1777378761

OK well now you have to look at changing the economy-wide energy mix or embracing de-growth. Switching data centers to 100% solar or nuclear or … solves Wired’s complaint but not this one.

asymmetric · 2026-04-12T09:25:38 1775985938

> Also avoid their object store.

Curious as to why you say this. I’m using litestream to backup to Hetzner object storage, and it’s been working well so far.

I guess itt’s probably more expensive than just a storage box?

Not sure but I also don’t have to set up cron jobs and the like.

gobdovan · 2026-04-12T10:25:15 1775989515

Historical reliability and compatibility. They claimed they were S3 compatible, but they were requiring deprecated S3 SDKs, plus S3 advanced features are unimplemented (but at least they document it [0]). There was constant timeouts for object creation and updates, very slow speeds and overall instability. Even now, if you check out r/hetzner on reddit, you'll see it's a reliability nightmare (but take it with a grain of salt, nobody reports lack of problems). Not as relevant for DB backups, but billing is dumb, even if you upload a 1KB file, they charge you for 64KB.

At least with Storage Box you know it's just a dumb storage box. And you can SSH, SFTP, Samba and rsync to it reliably.

[0] https://docs.hetzner.com/storage/object-storage/supported-ac...

asymmetric · 2026-04-05T21:55:10 1775426110

Is a framework desktop with >48GB of RAM a good machine to try this out?

pshirshov · 2026-04-05T23:29:26 1775431766

Only for chat sessions, not for agentic coding. It's just too slow to be practical (10 minutes to answer a simple question about a 2k LoC project - and that's with a 5070 addon card).

ac29 · 2026-04-06T06:35:27 1775457327

This article is about a MoE model with only 4B active parameters, it shouldn't take 10 minutes to answer a question about a small project.

I measured a 4bit quant of this model at 1300t/s prefill and ~60t/s decode on Ryzen 395+.

nl · 2026-04-06T04:11:43 1775448703

Doesn't the framework desktop have a Ryzen 395 AI? That's a unified memory architecture like the Macs.

pshirshov · 2026-04-06T13:39:41 1775482781

Ah, forgot to add, it's not really "unified" you have to explicitly specify your allocations. You may have a reasonably good 48gb chunk assigned to the GPU, but that DDR5 is 5-10 times slower than GDDR/HBM and the GPU itself isn't stellar.

So, framework laptops are great for chatting but nearly useless in agentic coding.

My Radeon W7900 answers a question ("what is this project") in 2 minutes, it takes my Framework 16 with 5070 addon around 11 minutes without the addon - around 23 (qwen 3.5 27b, claude code)

pshirshov · 2026-04-06T11:16:54 1775474214

That's discrete DDR5, it's not as fast as your regular VRAM.

asymmetric · 2026-03-30T09:56:52 1774864612

Nix is definitely taking off though ;)

asymmetric · 2026-03-10T07:48:17 1773128897

For me it’s org-mode. Although now that I think of it, there’s a Neovim implementation I’ve been meaning to try.

asymmetric · 2026-03-09T19:30:13 1773084613

And so it begins.

jb1991 · 2026-03-09T19:39:43 1773085183

Go on..

asymmetric · 2026-03-09T16:12:31 1773072751

I'm curious, what do you think the future of the car industry is, then?

asymmetric · 2026-02-28T09:17:39 1772270259

Have you tried it? I’ve been meaning to.

msh · 2026-02-28T11:37:59 1772278679

Yes. Somewhat expensive given its web only (no api) but it works very well and new features are added continuously.

asymmetric · 2026-02-28T08:57:45 1772269065

> there are at least a dozen companies that provide non-Anthropic/non-OpenAI models in the cloud

Do you have some links?

Also I assume the privacy implications are vastly different compared to running locally?

0xbadcafebee · 2026-02-28T16:25:03 1772295903

Throw a rock and you'll hit one... Groq (not Grok, elon stole the name), Mistral, SiliconFlow, Clarifai, Hyperbolic, Databricks, Together AI, Fireworks AI, CompactifAI, Nebius Base, Featherless AI, Hugging Face (they do inference too), Cohere, Baseten, DeepInfra, Fireworks AI, DeepSeek, Novita AI, OpenRouter, xAI, Perplexity Labs, AI21, OctoAI, Reka, Cerebras, Fal AI, Nscale, OVHcloud AI, Public AI, Replicate, SambaNova, Scaleway, WaveSpeedAI, Z.ai, GMI Cloud, Nebius, Tensorwave, Lamini, Predibase, FriendliAI, Shadeform, Qualcomm Cloud, Alibaba Cloud AI, Poe, Bento LLM, BytePlus ModelArk, InferenceAI, IBM Wastonx.AI, AWS Bedrock, Microsoft, Google

spiffytech · 2026-02-28T12:49:57 1772282997

I use Ollama Cloud. $20/mo and I never come close to hitting quota (YMMV obviously).

They don't log anything, and they use US datacenters.

gunalx · 2026-02-28T13:09:22 1772284162

for privacy preserving direct inference: Fireworks ai nebius

otherwise openrouter for routing to lots of different providers.

evolighting · 2026-02-28T09:12:58 1772269978

openrouter, for example, there are models both open and closed

asymmetric · 2026-02-26T08:42:06 1772095326

The ideas in the update were previously explored by Gwern 2 years ago: https://www.lesswrong.com/posts/PQaZiATafCh7n5Luf/gwern-s-sh...

gwern · 2026-02-27T07:56:13 1772178973

Specifically, Cochrane wrote:

> On reflection I have started to worry again. In 10 to 20 years nobody will read anything any more, they just will read LLM digests. So, the single most important task of a writer starting right now is to get your efforts wired in to the LLMs. Nothing you write will matter if it is not quickly adopted to the training dataset. As the art of pushing your results to the top of the google search was the 1990s game, getting your ideas into the LLMs is today’s. Refine is no different. It’s so good, everyone will use it. So whether refine and its cousins take a FTPL or new Keynesian view in evaluating papers is now all determining for where the consensus of the profession goes.

For more recent comments, see https://dwarkesh.com/p/gwern-branwen https://gwern.net/llm-writing https://www.lesswrong.com/posts/34J5qzxjyWr3Tu47L/is-buildin... https://gwern.net/blog/2025/ai-cannibalism https://gwern.net/blog/2025/good-ai-samples https://gwern.net/style-guide

The scaling will continue until morale improves. I advise people to skate to where the puck will be, and to ask themselves: "if I knew for a fact that LLMs could do something I am doing in 1-2 years, would I still want to do it? If not, what should I be doing now instead?"