More

WCSTombs · 2026-05-01T12:04:44 1777637084

Mathematically, it is literally a probability distribution, because it fits the definition of a measure whose total mass is one, so I think the language is just imprecise. What they may be trying to say is that semantically it doesn't arise in a principled way from an uncertainty model, such as from Bayesian or frequentist statistics.

jmalicki · 2026-05-01T13:50:10 1777643410

Hogwash. If you get into deriving maximum entropy distributions via the calculus of variations, the multinomial is the maximum entropy distribution among categorical distributions.

This is exactly the sense that it comes up for old school LMs and why it appears in thermodynamics.

Of course it is entirely possible that newfangled ML people use it without understanding that it is derived from first principles - i.e. see article.

WCSTombs · 2026-05-01T20:43:30 1777668210

That definitely could be the case. I was also a bit surprised by what the article said, so I was simply trying to interpret it, but I'm not extremely well versed in ML so I could be missing some details. My main point was that contrary to what the article said, they do in fact have a probability distribution on their hands.

jmalicki · 2026-05-01T22:16:08 1777673768

This is literally the probability distribution ML models are trained on.

https://docs.pytorch.org/docs/2.11/generated/torch.nn.CrossE...

You have a relatively small dictionary of tokens, each prediction has a neural network score that goes into the final token prediction layer, and they are trained based on a log-softmax (i.e. the above function) to predict their next token.

This is exactly how anyone in any field does conditional multinomial/categorical (i.e. one of a bunch of distinct tokens) distributions, and AFAIK what LLMs generally use as their loss functions on the output layer, though I have not deeply investigated all of them, since this has been how you do that since time immemorial.

I am extremely confused by all of the people screaming it's not a probability distribution?!?!?

I have seen computer vision tasks use binomial training objectives (one-vs-all) and then use the multinomial only at inference time, and that could be fair that that is not a probability distribution induced by training (while technically a probability distribution only in the sense it is \ge 0 and sums to 1).

But afaik token prediction LLMs that I am aware of use the softmax for the probability in their loss function, i.e. the maximize log softmax.

WCSTombs · 2026-04-26T22:36:21 1777242981

You're asking the right questions. The going theory as far as I can see is that training models is fair use (although it may not be fully resolved in the courts), in which case this whole exercise would seem to be pointless. If it were that easy, I have to think the FSF etc. would have been all over this years ago.

WCSTombs · 2026-04-26T08:48:23 1777193303

That is exactly my understanding as well, and certainly that was my intent in my GPL-licensed projects.

Also, about conditions on redistribution, the vast majority of all open-source software places at least some mild conditions, like the preservation of copyright and attribution to the authors, so if there is some kind of "gotcha" here, I don't think it has anything to do with copyleft.

WCSTombs · 2026-04-23T21:40:31 1776980431

Before even getting to any of the other issues people may have with generative AI, in my opinion by far the most important question is simply does the AI help students learn better? And it's pretty clear to me that the answer is "no."

To be frank, this one quote from a Google executive pretty much lays bare the whole scam:

> [Sinha of Google for Education] added that, by using A.I. tools, students are “able to create much more impressive projects that you could have never done before.”

This is one of the most obvious lies that the slop shops are trying to peddle. Using generative AI to make something is analogous to, and often literally just, hiring a third party to make the thing for you. It is not analogous to creating the thing yourself. I think the vast majority of people can recognize this, but unsurprisingly some people are buying the snake oil.

This really goes to the heart of what education is, doesn't it? While I'm no expert on theories of learning, I can draw from my own experiences, which I think are not exceptional. In my experience we learn things by (1) passively acquiring information, (2) thinking about the thing on our own, and (3) actively doing something with the knowledge. My point is that (2) and (3) are just as important as (1), and removing or reducing those is actively hampering learning rather than helping. As the article correctly points out, "creating impressive projects" has absolutely nothing to do with education. Duh.

My real worry is that teachers, who are already underpaid and underappreciated, will feel a lot of pressure to adopt some of these tools purely to manage their own workloads, and I think that would be a sad and preventable outcome.

WCSTombs · 2026-04-23T12:13:12 1776946392

Technically "it depends on the browser settings," but the body font Alegreya is served directly by the site, so I think it would be the one used in almost all cases.

The math fonts used in the formulas are just the ones provided by KaTeX, which I think are just TeX's default math fonts.

WCSTombs · 2026-04-15T16:56:36 1776272196

> CDC warns new drug-resistant virus is rising in the US and posing major 'public health threat'

> Normally, for these patients, their infection is quickly treated with antibiotics.

Wait, so is it a virus, or is it treated with antibiotics? Aren't those mutually exclusive?

WCSTombs · 2026-04-11T03:20:31 1775877631

This is the actual quote: "There is a very real scenario in which personal computing as we know it is dead." He went on to say this, as reported in another article [1]:

> Still, Framework said that it will not take this lying down. Its event announcement also doubled as its own manifesto, saying that "as long as there is a person in the world who still wants to own their means of computation, we will be here to build the hardware that enables it," and that it "will always be fighting for a future where you can own everything and be free."

[1] https://www.tomshardware.com/tech-industry/big-tech/framewor...

WCSTombs · 2026-04-09T20:25:47 1775766347

Public libraries can also be a great source for DVDs and Blu-Rays!

rk-spot · 2026-04-09T21:06:37 1775768797

Yes to this! I've ditched all streaming services and turned my local library into my go-to media stop.

I've found that intentionally going there, checking a movie out, and setting it up at my home has made me more engaged with it than ever before.

It's not a random movie that an algorithm recommended to me; it's the movie I chose. Thus, I give it more of my attention.

And it's free! With no ads! Just how I like it.

sergiotapia · 2026-04-09T20:30:05 1775766605

Libraries are one the most beloved things america really got right. Such a great value and use of tax money.

rahimnathwani · 2026-04-09T20:50:58 1775767858

In San Francisco, the annual library budget is ~$200,000,000. That's about $10/month for each San Francisco resident (including babies, elderly people etc.).

triceratops · 2026-04-09T21:00:57 1775768457

Incredible value for money then.

rahimnathwani · 2026-04-10T00:22:12 1775780532

It might not seem like a lot, but it is a lot when you consider that most residents don't use the library at all, and that adult book collections aren't great.

850,000 people have to share just 2 copies of Thompson's Calculus Made Easy. (I didn't cherry pick this: I looked up at my bookshelf and picked the first book I saw.)

Very little of the money is spent on books. Only 15% of the money is spent on 'collections', and much of that is spent on things other than books.

SF libraries are nice for children (lots of copies of kids' books, lots of desks to do homework when waiting for parents to get back from work).

But I personally don't find them a convenient source for reading material as, if I want a particular book, they usually don't have it.

SFPL's own stats say they see over 10,000 visitors per day and check out over 12 million items annually. Let's say you allocate 50% ($100M) to each of those two missions: serving as a community space vs. lending materials.

That gives you:

- As a community space: $100M ÷ (10,000 visitors × 365 days) = ~$27 per visit. You could hand every person who walks in a $27 gift card to a coffee shop with free Wi-Fi and they'd arguably get a comparable experience for many use cases.

- As a lending library: $100M ÷ 12,000,000 checkouts = ~$8.30 per checkout. You could just buy most paperbacks and many e-books for that price and give them away.

triceratops · 2026-04-10T13:13:18 1775826798

I misunderstood and thought it was $10/year. SF clearly has a spending problem. Maybe it's to do with the high cost of living.

triceratops · 2026-04-10T15:16:27 1775834187

Follow-up to my other comment. I read through the library's annual report, found here. https://sfpl.org/sites/default/files/archive/2025-12/2024_25...

Libraries do more than lend books and provide community spaces. They also run a lot of programs. So just saying "hand everyone a Starbucks gift card and a paperback every month" doesn't cover everything.

There are worse ways for a city to spend money. SF has a spending problem. Both can be true.

rahimnathwani · 2026-04-10T15:27:01 1775834821

"There are worse ways for a city to spend money."

Yes, San Francisco does all of those, too.

littlexsparkee · 2026-04-10T05:18:49 1775798329

Yep, you also get a quota of 10 suggested purchases for their collection every month - I scour for new books to max mine out and they grant >95% of what I ask for

rahimnathwani · 2026-04-10T14:34:34 1775831674

I've only suggested a purchase a couple of times. I've never heard back, so I assumed whatever I submitted was being ignored.

littlexsparkee · 2026-04-10T17:05:37 1775840737

If you click your username at top right corner and then the bell (which will have a number if there are notifications) you can find out what happened with those requests

rahimnathwani · 2026-04-10T17:48:09 1775843289

I don't see any notifications. When I go to https://sfpl.bibliocommons.com/suggested_purchases I see only:

Showing 0 suggestions.

10 of 10 suggestions left

littlexsparkee · 2026-04-11T02:14:55 1775873695

Hmm if you'd submitted them in past it's strange to see 0 count. I see all of mine there. https://sfpl.bibliocommons.com/user_profile/me/notification_... shows: > Your suggestion has been approved! The library will acquire the following Book: <title>. To learn more and manage your requests, go to Suggested Purchases.

rahimnathwani · 2026-04-11T02:38:36 1775875116

It was a long time ago. Perhaps I submitted using a different form?

rahimnathwani · 2026-04-10T00:38:28 1775781508

Sorry, too late to edit. You probably spotted that I should have written $20/month, not $10/month.

apparent · 2026-04-09T20:42:39 1775767359

Some of the services end up being very expensive, like ebook lending. Some publishers basically charge libraries per loan ($X for an ebook that lasts Y loans), so while it is nice for residents it's not clear that it's a good value, or that it's a good use of tax money.

I once heard from a knowledgeable source that most of library lending is bodice rippers. These are available from Amazon/etc. pretty cheaply, which undercuts the value argument. And of course, there's practically no social value of providing the public with free bodice rippers...

I'd be interested to know more about the economics of lending DVDs and Blu-rays. Hopefully libraries get a better deal on these.

triceratops · 2026-04-09T21:01:34 1775768494

> And of course, there's practically no social value of providing the public with free bodice rippers...

Why not?

> Some of the services end up being very expensive, like ebook lending

We need something like a first-sale doctrine for electronic media. Blockchains would be ideal for tracking ownership.

apparent · 2026-04-09T21:54:39 1775771679

If most of lending were made up of educational texts, there would be a social value. Some people describe bodice rippers as porn for women, and people get addicted to them in the same way they get addicted to porn.

Would a library ever lend porn out? I'm guessing no, because of the lack of social value. To the extent that bodice rippers are like porn, the same rationale would apply.

IAmBroom · 2026-04-10T14:52:18 1775832738

And to the extent that those are two different things, your argument completely falls apart.

apparent · 2026-04-11T04:13:30 1775880810

Nope, because even if bodice rippers are not pure porn, it's not clear why libraries should subsidize entertainment for patrons. I'm not saying it's a terrible use of taxpayer money, just disagreeing with OP who said it's "such a great use of tax money". It does not bring people together, educate them, or provide for the common defense. Why not have movie theaters be government-run? It would make as much sense as providing free smut-adjacent books for (almost entirely) women.

dylan604 · 2026-04-09T20:38:12 1775767092

If everyone used the library as much as people say they are great, their shelves would be empty. Libraries have to be some of the most underutilized services.

BeetleB · 2026-04-09T22:13:29 1775772809

When it comes to recent popular movies, the wait times can be over 6 months. I'm usually number 480 on the waitlist or something.

I wouldn't call that underutilized. :-)

WCSTombs · 2026-04-09T22:03:33 1775772213

In my experience, there can be pretty high contention for certain items, so you need to be on the ball or make use of the "place hold" feature judiciously. Yeah, people are using the service.

dingaling · 2026-04-09T21:16:38 1775769398

Sadly, libraries in UK towns have very little shelf space left and what's on them is usually mass-market fiction, biography or very old non-fiction.

Honestly isn't worth the effort to visit my local one, unless I want to join a crochet club or do 'mindfulness' jigsaws.

BeetleB · 2026-04-10T16:21:00 1775838060

In the US, libraries are often part of a network, and we have access to all the materials in the network. So if my local library doesn't have it, I simply request it from another library. They ship it to mine and I pick it up (and return to mine).

Then we also have a larger inter-library loan, where I can request things from libraries far, far away (even in another state). It takes much longer, though, and if it is deemed a popular/useful item, my local library may decide to purchase it and give that one to me rather than use ILL.

You may want to check if your local library has something similar.

satvikpendem · 2026-04-09T20:29:41 1775766581

Also add Libby, Hoopla, Overdrive, for books and other media, which are also free from the library.

TheDom · 2026-04-09T20:38:41 1775767121

I recently discovered Kanopy and was surprised by the amount of A-tier movies you can stream there for free with a library membership (SFPL in my case)!

ksherlock · 2026-04-09T22:12:04 1775772724

Browsing through the library DVD shelves is somewhat reminiscent of browsing through a 80s/90s mom & pop type video rental place [1]. I think it's better for randomly finding something interesting than the algorithm (tm). [1] But without the room in the back with the ADULTS ONLY sign. Your library my vary.

john-tells-all · 2026-04-10T14:55:53 1775832953

Kanopy has a wonderful selection, and ironically its website UI is better than Netflix et al.

Watching movies and other shows without commercials is such a treat!

I haven't tried audiobooks nor ebooks on Hoopla but look forward to it.

binsquare · 2026-04-09T20:30:43 1775766643

They also have video games now

BeetleB · 2026-04-09T22:12:40 1775772760

Libraries are the single reason I got back into video games after a multi-decade hiatus.

I played very few games from 2002 to 2017. Didn't want to keep buying new computers, and did not want to bother with consoles (graphics was better on PC than a non-HD TV).

In 2010 I bought a PS3, but only to watch Blu-Ray, Netflix and stream from my PC to TV. Did not play games on it.

Then in 2016/2017, on a whim, I decided to check out a game from the library. I Googled some good games, and picked Telltale's The Living Dead.

Oh wow. One of the best games I've ever played. For the next 2 months I kept checking out games and playing them.

Then for some reason I stopped. I started again in 2022 and haven't looked back. Seriously cut down my TV watching so I can play the games. I don't use the library any more - I just buy the games.

fortyseven · 2026-04-09T21:07:13 1775768833

Just recently beat Super Mario Wonder thanks to my local library lending Switch titles.

bombcar · 2026-04-09T21:51:23 1775771483

They also often have a huge backlog of older titles and consoles, I'm going to check out a Wii soon.

abnercoimbre · 2026-04-09T20:27:04 1775766424

Recent shows or movies make it there regularly!

WCSTombs · 2026-04-01T23:34:45 1775086485

Engineers are fundamentally pragmatic people. We're problem solvers. Someone who only cares about ingenuity and craft would be a shitty engineer, because that perspective is entirely inward-facing and not directed at the problems at hand. I think this is fundamentally my problem with your question, and I think if you framed the question slightly differently with this in mind, it would make more sense.

To attempt to answer it, I think there are many engineers who care deeply about creativity, ingenuity, and craft, because those are key qualities (among others) needed to solve real-world problems. The question you hinted at is whether LLMs are compatible with that, and I think more people are asking those types of questions now.

WCSTombs · 2026-03-31T00:02:28 1774915348

Absolutely, the whole point of the rubber duck is that it's inanimate. The act of talking to the rubber duck makes you first of all describe your problem in words, and secondly hear (or read) it back and reprocess it in a slightly different way. It's a completely free way to use more parts of your brain when you need to.

LLMs are a non-free way for you to make use of less of your brain. It seems to me that these are not the same thing.