In other news, BlueSky's put out a proposal on letting users declare how their data gets used, and BlueSky post announcing this got some pretty hefty backlash - not for the proposal itself, but for the mere suggestion that their posts were scraped by AI. Given this is the same site which tore HuggingFace a new one and went nuclear on ROOST, I'm not shocked.
Additionally, Molly White's put out her thoughts on AI's impact on the commons, and recommended building legal frameworks to enforce fair compensation from AI systems which make use of the commons.
Personally, I feel that building any kind of legal framework is not going to happen - AI corps' raison d'etre is to strip-mine the commons and exploit them in as unfair a manner as possible, and are entirely willing to tear apart any and all protection (whether technological or legal) to make that happen.
As a matter of fact, Brian Merchant's put out a piece about OpenAI and Google's assault on copyright as I was writing this.
...eh, fuck it, here's my sidenote on Brian's piece:
Google and OpenAI's campaign gives me the suspicion that the ongoing copyright lawsuits may be what finally pops this bubble. Large Language Models are built though large-scale copyright infringement, and built to facilitate large-scale copyright infringement - if the actions of OpenAI and pals are ruled not to be fair use, it would be open season on LLMs.