It seems like they realized the value of user content for AI training too late, and are trying to hamfistedly lock down the golden egg goose for their upcoming IPO.
Pushshift.io used to host a complete repository for Reddit. I think there were archives on the internet archive as well. There are numerous torrents with terabytes of text content for training AIs. Perhaps they might lock it down going forward, but language training horse fled and was eaten by coyotes a decade ago.