More

sudhirb · 2026-03-17T00:30:56 1773707456

Is this not the good kind of problem to have? Subnautica 2 is doing so well that the developers get their earn-out bonus?

Seems like pure profit-maximizing/greed!

sudhirb · 2026-03-13T05:43:22 1773380602

I can hardly believe my eyes! I helped do some related research specifically concerning thin-film drainage from tubes, way back in my undergraduate days: https://doi.org/10.1016/j.expthermflusci.2018.04.015

leni536 · 2026-03-17T16:15:17 1773764117

I also handed in a solution for a similar problem in a university Physics competition.

11th problem here:

https://ortvay.elte.hu/2009/E09.pdf

I was the only one who handed in a solution for that particular problem, it was scored 70 out of 100. I no longer have my solution, but I doubt that it was very accurate, and I didn't have time for experiments.

sudhirb · 2026-02-26T01:07:59 1772068079

a 90% saving is huge isn't it?

for long agent sessions, I would expect a very high cache hit rate unless you're editing the system prompt, tools, or history between turns, or some turns take longer than the cache timeout

sudhirb · 2026-02-10T23:59:29 1770767969

So long as perceived LLM skill is still "spiky" - e.g. within a domain, still showing relatively high variation in ability (often depending on the task or user, to be fair), people will continue to dismiss it

sudhirb · 2026-02-10T23:51:18 1770767478

In the general case I think this is a great idea - if we do a good job of documenting intent etc. in commit messages, agents have an easier time understanding why lines of code exist with no additional specs/mechanisms/etc.

Interested to see what techniques in this area pull ahead and gain traction!

sudhirb · 2026-01-24T11:21:59 1769253719

Coding agents are such a congested space right now that to me this mostly reads as an advertisement.

sudhirb · 2026-01-24T11:16:45 1769253405

I have 150Mb/s FTTP for £37/month - upgrading to gigabit would be £75/month, for example!

sudhirb · 2026-01-23T22:47:02 1769208422

Interesting that non-salty water didn't make the string conductive(?) enough - I'd have thought that there might have been enough soluble stuff in string.

Also I believe this person runs the ISP I use (and I couldn't speak more highly of it - Andrews & Arnold).

sudhirb · 2026-01-23T22:40:50 1769208050

For me, a lot of the draw is that it's cheaper than managed db services for small/toy projects of mine (that I don't want to use dynamo db for) - that and in a previous job it was useful as relatively temporary multi-tenant storage.

sudhirb · 2025-12-19T18:07:03 1766167623

The partner for these projects has a benchmark that the top frontier LLM labs seem to be running on their new model releases - I think there's _some_ value to these numbers in helping people compare and contrast model performance.

https://andonlabs.com/evals/vending-bench