404 added an update that the dataset was removed:
Update: Following the publication of this article on Tuesday evening, van Strien removed the dataset. "I've removed the Bluesky data from the repo," he wrote on Bluesky. "While I wanted to support tool development for the platform, I recognize this approach violated principles of transparency and consent in data collection. I apologize for this mistake."
It's also the place where you go to to download models to use by yourself instead of sending all your data to the most unscrupulous people possible, so at least they've got that going for them.