More

gobdovan · 2026-04-27T21:46:55 1777326415

You've got the blues.

gobdovan · 2026-04-27T15:39:56 1777304396

Very interesting, especially the harness point, how much of performance is in the wrapper tools (when I almost run out of credits, I change my model to a smaller one and try to give it more structured prompts; very often gpt-5.4-mini with structure works better than gpt-5.4 with vibes)

This inspired me to start a "skill distillery" [0] where I take good agent workflow ideas and turning them into small, inspectable/installable skills.

The first one is dirac-workflow, based on Dirac's structural code workflow. It's not a Dirac clone tho, it has no runtime, persistent AST index, hash-anchor editing engine, or benchmark harness. Just a small AST helper and the workflow discipline as a portable skill.

I also dogfooded it on the Dirac repo itself and included a short report.

Would appreciate feedback from the original author, if the prompts and tools [1] are representative.

[0] https://github.com/ouatu-ro/skill-distillery

[1] https://github.com/ouatu-ro/skill-distillery/blob/main/skill...

gobdovan · 2026-04-26T19:23:24 1777231404

> asking a coding agent “why did you do that” to be illustrating a misunderstanding in the users mind about how the agent works

I think the same thing, but about agents in general. I am not saying that we humans are automata, but most of the time explanation diverges profoundly from motivation, since motivation is what generated our actions, while explanation is the process of observing our actions and giving ourselves, and others around us, plausible mechanics for what generated them.

gobdovan · 2026-04-23T00:07:06 1776902826

> Agents also often respond well to user frustration signs, like threatening to not continue your subscription.

From the phrasing, I can't but imagine you as a very calm, completely unemotional person that only emulates user frustration signs, strategically threatening AI that you'll close your subscription when it nukes your code.

gobdovan · 2026-04-22T10:00:50 1776852050

Pretty cool. Would be nice to have the equation system as well in a recap, and the math not collapsed by default. Also had to look up other resources to understad that time correction refers to correcting a relatively short window of time, as it was not clear that receiver clock is actually accurate enough for short periods (milliseconds) to treat as affine.

So the trick, as always, boils down to engineering approximations, haha.

loodish · 2026-04-22T13:50:48 1776865848

The receiver frequency is generally assumed to be accurate. In practice the time quantization is the far bigger error and masks any frequency fault. That time quantization is why most receivers report an accuracy of 3m or 10ns.

The time correction is better thought of as the fixed offset between the receiver clock and the GPS clock.

gobdovan · 2026-04-20T10:58:31 1776682711

These kinds of articles make you feel like there are specific, actionable problems that just need an adjustment and then they disappear. However, the system is much worse than you'd expect. Studies like this are extremely valuable, but they don't address the systematic problems affecting all signaling channels: most signals themselves have been manufactured into a product.

Build a SaaS and you'll have "journalists" asking if they can include you in their new "Top [your category] Apps in [current year]", you just have to pay $5k for first place, $3k for second, and so on (with a promotional discount for first place, since it's your first interaction).

You'll get "promoters" offering to grow your social media following, which is one reason companies may not even realize that some of their own top accounts and GitHub stars are mostly bots.

You'll get "talent scouts" claiming they can find you experts exactly in your niche, but in practice they just scrape and spam profiles with matching keywords on platforms like LinkedIn once you show interest, while simultaneously telling candidates that they work with companies that want them.

And in hiring, you'll see candidates sitting in interview farms quite clearly in East Asia, connecting through Washington D.C. IPs, present themselves with generic European names, with synthetic camera backgrounds, who somehow ace every question, and list experience with every technology you mention in the job post in their CVs already (not hyperbole, I've seen exactly this happen).

If a metric or signal matters, there is already an ecosystem built to fake it, and faking it starts to be operational and just another part of doing business.

xorcist · 2026-04-20T12:19:18 1776687558

Well put!

Have an upvote. The first one is free.

DoctorOW · 2026-04-20T15:55:07 1776700507

> If a metric or signal matters, there is already an ecosystem built to fake it, and faking it starts to be operational and just another part of doing business.

https://www.xkcd.com/2899/

vachina · 2026-04-20T11:01:24 1776682884

It all boils down to making more money.

mankins · 2026-04-20T11:45:38 1776685538

The spoilage by money is half right, but I think the more interesting part is where the money ends up and how that influences the system.

I'm increasingly convinced the issue isn't feedback itself, but centralized, global, aggregated feedback that becomes game-able without stronger identity signals.

Right now the incentives are tied (correctly or not) to these global metrics, so you get a market for faking them, with money flowing to whoever is best at juicing that signal.

If instead the signal was based on actual usage and attributions by actual developers, the incentives shift. With localized insight (think "Yeah, I like Golang") it becomes both harder to fake and harder to get at the metric rollup.

Useful reputation on the web is actually much more localized and personal. I gladly receive updates on and would support the repos I've starred. If I could chose where to put my dollars (not an investors), it would likely include the list of repos I've personally curated.

This suggests a different direction: instead of asking "how many stars does this have?", ask "who is actually depending on this, and in what context?" or better retroactively compare your top-n repos to mine and we'll get a metric seen through our lenses. If you want to include everyone in that aggregation you'll end up where we are now, but if in stead you chose the list, well, the stars could align as a good metric once more.

The interesting part is that the web already contains most of that information, we just don't treat identity as a part of the signal (yet? universally?).

aquariusDue · 2026-04-20T13:31:45 1776691905

Tangential but I have more than 5k repos starred (according to my GitHub profile) organized into lists but the way I discover interesting stuff on GitHub these days is through people I follow. Follow interesting people, find interesting stuff. Sometimes it's that easy.

What's more it became obvious to me two or so years ago that GitHub is going the way of LinkedIn slowly but surely. Lots of professionals on there just because it's expected of them, some interact occasionally with the "social media" aspect of it and fewer still really thrive on that part. Time will tell how this will pan out but just look how many Developer and Linux influencers became huge on YouTube and other places this last year. Most of them barely had more than 10k subscribers 3 years ago and now people look to them for their next tech stack and hot framework/tool/library/distro and so on.

gobdovan · 2026-04-20T11:23:22 1776684202

Yeah, but it's not a great way to do it.

Short term, you pay the cost of fake signaling, which is simply deadweight loss. People spend resources to inflate signals instead of improving the actual thing.

Medium term, I suppose you could see how it increases consumption, since users would probably try something with 100k stars instead of 2, GitHub wants to seem that it's used more than it really is, repo owner is also benefiting.

Long term, the correspondence between how important a (distorted) system is perceived (Github, OSS, IT in general) vs how important it really is collapses quite abruptly and unnecessarily, and you end up with a lemon market [0] where signals stop being reliable at all.

[0] https://en.wikipedia.org/wiki/The_Market_for_Lemons

philipallstar · 2026-04-20T11:04:29 1776683069

Of course - money is a good proxy for value in these instances. Not perfect, but good.

whattheheckheck · 2026-04-20T12:10:49 1776687049

Yeah imagine how nature feels with all of the fake eyes and other fake predator signals like bright colors. Evolution finds a way

gobdovan · 2026-04-20T13:11:26 1776690686

A way for what? Evolution does not have a will, does not work for anybody in particular, and it's statistically likely that it will work against you, as almost all species that ever were went extinct and almost no individual from just 15 generations ago has left DNA to present people. [0] [1]

[0] http://www.stat.yale.edu/~jtc5/papers/Ancestors.pdf [1] https://pubmed.ncbi.nlm.nih.gov/11542058/

motakuk · 2026-04-20T11:25:25 1776684325

At the end it's a company choice: do you buy BS metrics or you don't.

We've recently decided to complicate life of AI bots in our repo https://archestra.ai/blog/only-responsible-ai, hoping they will just choose those AI startups who are easier to engage with.

gobdovan · 2026-04-20T09:00:25 1776675625

About protocols in this vicinity, I've been noticing a missing piece in OSS around transport as well. In Python, you often need incompatible dependency sets in one app, and the usual choices are either ad-hoc subprocess RPC that gets messy over time or HTTP / containers that are overkill and make you change deployment strategy.

I ended up building a protocol for my own use around a very strict subprocess boundary for Python (initially at least, protocol is meant to be universal). It has explicit payload shape, timeout and error semantics. I already went a little too far beyond my usecase with deterministic canonicalization for some common pitfall data types (I think pickle users would understand, though). It still needs some documentation polish, but if anyone would actually use it, I can document it properly and publish it.

gobdovan · 2026-04-19T12:41:00 1776602460

there's a few more semantic families: verilog, petri nets and variants, Kahn process networks and dataflow machines, process calculi, reactive, term rewriting, constraint solvers/theorem provers (not the same with Prolog), probabilistic programming,

plus up and coming (actual production-ready) languages that don't fit perfectly in the 7 categories: unison, darklang, temporal dataflow, DBSP

It may feel like a little bit of cheating mentioning the above ones, as most are parallel to the regular von Neumann machine setup, but was meaning for a while to do an article with 'all ways we know how to compute (beyond von Neumann)'.

andai · 2026-04-19T17:20:15 1776619215

> was meaning for a while to do an article with 'all ways we know how to compute (beyond von Neumann)'.

Would be very glad to read this.

In the meantime, I reproduce a part of an article by Steve Yegge:

---

What Computers Really Are

Another realization I had while reading the book is that just about every course I took in my CS degree was either invented by Johnny von Neumann, or it's building on his work in mostly unintelligent ways.

Where to start? Before von Neumann, the only electronic computing devices were calculators. He invented the modern computer, effectively simulating a Universal Turing Machine because he felt a sequential device would be cheaper and faster to manufacture than a parallel one. I'd say at least 80% of what we learned in our undergrad machine-architecture course was straight out of his first report on designing a programmable computer. It really hasn't changed much.

He created a sequential-instruction device with a fast calculation unit but limited memory and slow data transfer (known as the infamous "von Neumann bottleneck", as if he's somehow responsible for everyone else being too stupid in the past 60 years to come up with something better. In fact, Johnny was well on his way to coming up with a working parallel computer based on neuron-like cellular automata; he probably would have had one in production by 1965 if he hadn't tragically died of cancer in 1957, at age 54.)

Von Neumann knew well the limitations of his sequential computer, but needed to solve real problems with it, so he invented everything you'd need to do so: encoding machine instructions as numbers, fixed-point arithmetic, conditional branching, iteration and program flow control, subroutines, debugging and error checking (both hardware and software), algorithms for converting binary to decimal and back, and mathematical and logical systems for modelling problems so they could be solved (or approximated) on his computing machine.

-Steve Yegge, Math Every Day

https://archive.ph/6tOQF

versteegen · 2026-04-20T01:36:01 1776648961

Von Neumann may possibly have been the smartest man to ever live, but giving him credit for all of this is too much, brushing aside many other inventors (oft independent, to his credit).

andai · 2026-04-19T17:16:17 1776618977

>term rewriting

In uni we had to make a spreadsheet software.

I volunteered to do the formula parser, thinking it sounded like a fun challenge.

I was stumped for a week, until I realized I could rewrite the formulas into a form I knew how to parse. So it would rewrite 1+1 into ADD(1,1) and so on.

I also refused to learn regex, so the parsing code was "interesting" ;)

I recall a comment from a colleague. "Okay, Andy says it works. Don't touch it." XD

Guy from another group used regex and his solution was 20x shorter than mine.

gf000 · 2026-04-19T21:35:11 1776634511

Regular expressions are probably not enough for parsing formulas (depending of course on the exact task given), they usually are at least a context free language.

codebje · 2026-04-20T06:54:23 1776668063

Regular expressions are definitely enough for turning characters into tokens, after which a simple recursive descent parser is vastly more straightforward to write. Lexing is optional, but generally advised.

Someone · 2026-04-19T18:08:31 1776622111

> plus up and coming (actual production-ready)

Plus up and coming (not quite production-ready IMO, but used in production anyways): ChatGPT and the like.

Of course, it’s debatable whether they are programming languages, but why wouldn’t they be. They aren’t deterministic, but I don’t think that is a must for a programming language, and they are used to let humans tell computers what to do.

anthk · 2026-04-19T17:37:08 1776620228

Logic programming in S9 scheme:

https://www.t3x.org/amk/index.html

You can just get the code without buying the book, learn with Simply Scheme or any other book and apply the functions from the code, the solvers are really easy to understand.

gobdovan · 2026-04-19T12:56:14 1776603374

also Sussman's propagators are nice to check out [0]

[0] The Art of the Propagator (mit url down for the moment)

lioeters · 2026-04-19T17:11:06 1776618666

Great list of languages that don't fit the conventional families. I've been curious about some of them, like Petri nets and term rewriting, and will enjoy exploring the others.

Found a working link to the paper about propagators.

The Art of the Propagator, Alexey Radul and Gerald Jay Sussman. https://groups.csail.mit.edu/mac/users/gjs/6.945/readings/ar... (PDF)

zem · 2026-04-19T19:07:41 1776625661

pure [https://agraef.github.io/pure-lang/] is probably the most "practical" term rewriting language, though mathematica is the most used one by far.

jounker · 2026-04-19T22:22:35 1776637355

<snark>So they reinvented speadsheets?</snark>

gobdovan · 2026-04-19T08:20:00 1776586800

When I present TLA+ [0], I am referencing game pauses (pause buffer / item duplication Legend of Zelda exploit, Dark Souls menuing to cancel animations) and deliberate crashes as mechanics to exploit games, as those are still valid actions that can be modeled, and not doing that allows such exploits to happen.

A system is only correct relative to the transition system you wrote down. If the real system admits extra transitions that you care about (pause, crash, re-entry, partial commits), and you didn't model them, then you proved correctness of the wrong system.

[0] https://lamport.azurewebsites.net/video/videos.html

Jach · 2026-04-20T08:58:48 1776675528

Do you have any of these presentations available publicly? I'm always amused by the glitch names people come up with (force quit wrong warps, skirtless parry-walks...) and it'd be fun to see them in a TLA+ context.

gobdovan · 2026-04-18T11:22:41 1776511361

Disclaimer for the book, and for category theory in general: most books are optimized for people who already master mathematics at an undergraduate level. If you're not familiar with algebraic structures, linear algebra, or topology, be prepared to learn them along the way from different resources.

Category theory is also not that impressive unless you already understand some of the semantics it is trying to unify. In this regards, the book itself presents, for example, the initial property as trivial at first hand, unless you notice that it does not simply hold for arbitrary structures.

senderista · 2026-04-19T03:21:27 1776568887

_Conceptual Mathematics_ is basically category theory for high school students.

https://www.amazon.com/Conceptual-Mathematics-First-Introduc...

seanhunter · 2026-04-19T14:45:04 1776609904

That looks fantastic actually. Thanks very much for the recommendation.