this post was submitted on 14 Jul 2025
19 points (100.0% liked)

Privacy

39869 readers
353 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
 

The issue with Google's personalised search results is, imo:

  1. Not only is it not opt-in, but you can't even opt out of it. Personalised search results should be opt-in and disabled by default.
  2. The data kept on you is used to sell you ads
  3. The data kept on you will be handed over to state entities fairly easily

Given those three problems, how feasible would it be to self-host a search engine that personalises your results to show you things that are more relevant to you? Avoiding issues 1 & 2 as you're self-hosting so presumably you have made the decisions around those two things. And issue 3 is improved as you can host it off-shore if you are concerned about your domestic state, and if you are legally compelled to hand over data, you can make the personal choice about whether or not to take the hit of the consequences of refusing, rather than with a big company who will obviously immediately comply and not attempt to fight it even on legal grounds.

A basic use-case example is, say you're a programmer and you look up ruby, you would want to get the first result as the programming language's website rather than the wikipedia page for the gemstone. You could just make the search query ruby programming language on any privacy-respecting search engine, but it's just a bit of QoL improvement to not have to think about the different ways an ambiguous search query like that could be interpreted.

you are viewing a single comment's thread
view the rest of the comments
[–] CameronDev@programming.dev 13 points 15 hours ago (6 children)

Self hosting search engines is very hard. The scraping, indexing and storage requirements are immense. You could definitely self-host a front end (with your QoL improvements), but the back end search engines (Bing/Google/etc) will be able to track you all the same.

[–] HappyFrog@lemmy.blahaj.zone 4 points 15 hours ago (4 children)

Are there even open source indexing software available?

[–] Max_P@lemmy.max-p.me 6 points 15 hours ago

There's YaCy. I've run a node for a while but it ended up filling up my server's drive just indexing german wikipedia and the results were terrible.

And it's still not private because you have to broadcast the query across the network.

load more comments (3 replies)
load more comments (4 replies)