-32
submitted 9 months ago* (last edited 9 months ago) by wargreymon2023@sopuli.xyz to c/showerthoughts@lemmy.world
  1. Reddit sells its api for high and is about to go for an IPO, its economy bases entirely on the data made by the users/communities. It is the work of the public, get robbed by a small group of individuals. A living example of capitalism.

  2. Fediverse isn't enough to secure the publicity and usage of public data. What if the host of Lemmy instance also releases the snapshots of all the posts and modlogs, everyday, in the form of bittorrent? Only by doing so, we are safe from the host erasing public knowledge and data brokers.

top 16 comments
sorted by: hot top controversial new old
[-] originalucifer@moist.catsweat.com 20 points 9 months ago

its already public information that can be scraped with a custom instance, no torrent needed. not really sure what your concern is here

[-] wargreymon2023@sopuli.xyz -5 points 9 months ago* (last edited 9 months ago)

already public

By the host, that host could be Spez

Data is more valuable than ever, bc it can be used to train AI

[-] originalucifer@moist.catsweat.com 15 points 9 months ago

the fediverse itself is broadcasting all the information publicly. there is no expectation of privacy.

[-] wargreymon2023@sopuli.xyz -3 points 9 months ago* (last edited 9 months ago)

It is 1% better, but it is irrelevant.

The instance can blacklist other instances, the data stored by the host doesn't have to be published. There is also no integrity check of what would be the public data.

The host can do anything it wants with the data, unless we scarpe as much data as we can and release it in p2p network.

[-] originalucifer@moist.catsweat.com 9 points 9 months ago

if youre that concerned, run your own instance and create this torrenting thing you believe needs to exist.

i doubt you'll get much traction

[-] INeedMana@lemmy.world 7 points 9 months ago

Good. Let the data flow freely

[-] Bezier@suppo.fi 6 points 9 months ago

I don't quite understand what you're going for.

How does this protect us from data brokers?

Can't you already pull the data of an entire instance via the api?

If you don't trust your instance admins, have you considered setting up your own?

[-] wargreymon2023@sopuli.xyz 0 points 9 months ago* (last edited 9 months ago)

Can't you already pull the data of an entire instance via the api?

You could ask the host through the api to do that, and that's the same the problem with Reddit. What we have changed here on Lemmy is more instances to choose from, the public knowledge grows with the instance and I wouldn't expect the host to let you access the data for free as the value(communities) grows.

There is real value in the data, way more than ever, bc it is the source for LLM.

[-] Bezier@suppo.fi 4 points 9 months ago* (last edited 9 months ago)

So you fear that instance admins would start closing their apis. It would've been real helpful if that was on the on the original post.

The api is accessible and I don't see that changing because it is, y'know, required for federation. If you want backups, you can start scraping or host a private instance that subscribes to everything you want to save.

Releasing dumps the way you described would be a massive burden on admins, or even completely infeasible.

[-] wargreymon2023@sopuli.xyz 0 points 9 months ago

Releasing dumps the way you described would be a massive burden on admins, or even completely infeasible.

It adds work but it is actually very easy, the texts and images aren't that big. The burden is eased by more people engage in seeding(hosting) the data.

[-] Bezier@suppo.fi 5 points 9 months ago

Well, let's just say that I doubt you'll be able to sell this idea to instance admins.

However, you can become the change you wish to see yourself. You can start scraping today and create your own dumps that way; the apis are yet to be closed.

[-] Brewchin@lemmy.world 2 points 9 months ago

Think of the problem being solved. The Fediverse solves multiple problems, but most notably ensuring that our contributions won't be paywalled by some corporate grifter. The post and comment data itself is free and open, subject only to TOS and regional legislation.

If you consider your conversations valuable, stick with something like secure messaging application groups. And then hope nobody in that group does what you imagine in your second point.

[-] hahattpro@lemmy.world 1 points 8 months ago

Anything public in the internet or allow anyone signup is scraped-able, or sellable.

If you want a true private Lemmy, then meet some real life friends, host your own forum that only allow friends read stuff.

this post was submitted on 21 Feb 2024
-32 points (10.0% liked)

Showerthoughts

29522 readers
552 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. A showerthought should offer a unique perspective on an ordinary part of life.

Rules

  1. All posts must be showerthoughts
  2. The entire showerthought must be in the title
  3. Avoid politics
    1. NEW RULE as of 5 Nov 2024, trying it out
    2. Political posts often end up being circle jerks (not offering unique perspective) or enflaming (too much work for mods).
    3. Try c/politicaldiscussion, volunteer as a mod here, or start your own community.
  4. Posts must be original/unique
  5. Adhere to Lemmy's Code of Conduct-----

founded 1 year ago
MODERATORS