OpenAI and Reddit Partnership (openai.com)

submitted 5 months ago* (last edited 5 months ago) by governorkeagan@lemdro.id to c/privacy@lemmy.ml

15 comments fedilink hide all child comments

It has finally happened...not surprised though.

you are viewing a single comment's thread
view the rest of the comments

[-] Wanangwa_Bamidele@thelemmy.club 3 points 5 months ago

How bad this could be ? Enlightning me please.

[-] ResoluteCatnap@lemmy.ml 2 points 5 months ago

Its not any different than how it already was. Initially the GenAI models were all being trained on masses of unlicensed data including data from reddit. The problem is some companies like New York Times are suing for training an LLM off of their data. So in response companies like OpenAI are now trying to reach partnerships that basically license the use of the data (that they already had). This also means that they will be able to continue to have future access to that data as long as the partnership is in place. Whereas some companies without a partnership could start to ban scraping activity or update their terms to forbid training AI off of their data.

Overall these partnerships are a good thing. Licensed training data is good. But from a privacy standpoint, the AI models were already trained on reddit data. This is just formalizing the relationship

this post was submitted on 17 May 2024

92 points (96.9% liked)

Privacy

31874 readers

405 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Posting a link to a website containing tracking isn't great, if contents of the website are behind a paywall maybe copy them into the post
Don't promote proprietary software
Try to keep things on topic
If you have a question, please try searching for previous discussions, maybe it has already been answered
Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
Be nice :)

Related communities

Chat rooms

[Matrix/Element]Dead
Discord

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago

MODERATORS