this post was submitted on 21 Aug 2025
1106 points (96.9% liked)
Microblog Memes
9117 readers
2964 users here now
A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.
Created as an evolution of White People Twitter and other tweet-capture subreddits.
Rules:
- Please put at least one word relevant to the post in the post title.
- Be nice.
- No advertising, brand promotion or guerilla marketing.
- Posters are encouraged to link to the toot or tweet etc in the description of posts.
Related communities:
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Yeah but LLMs don't train off of data automatically, you need a separate dedicated process for that, it won't happen from just using them. In that sense, companies can still use your data to train them in the background, even if you aren't directly using an LLM, or they can not train them even when you are using them. I guess in the latter case there is a bigger incentive for them to train them than otherwise, but to me it seems basically the same thing privacy wise.
If they're exposing their LLM to the public, there's a higher chance of it leaking training data to the public. You don't know what they trained with, but there's a chance it's customer data. Sure they may not train with anything, but why assume they don't? If they have an internal LLM that's of lesser concern, because that LLM would probably only show them data those employees already have access to.