this post was submitted on 19 Jun 2025
28 points (96.7% liked)
Technology
1157 readers
54 users here now
A tech news sub for communists
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Not sure if anybody is hosting it online aside from the official instance. Running locally or using something like GCP to host your own private model is basically the way to go. Unfortunately, smaller sized models you can run locally aren't quite as good as the full blown version. Still can be quite useful though, and they are rapidly improving as well. I've found qwen 3 to be the best local option right now.
Thanks for explaining. Is it easy to turn off censorship in qwen 3 ("non-coder" here but that doesn't matter as I will be starting that journey soon enough as I up my maths)? Also I thought the free version of deepseek r1 on the app is 7b?
You don't have to do anything, the open source models that are published don't censor themselves. And the online version of deepseek is the full model. They charge for API level access for it.
Excellent, thank you!
We are playing around with the idea on ProleWiki of automatically translating our pages to our different language instances, instead of having to do it ourselves which we're not doing in the first place lol. Would you possibly have any ideas for that? There's no limit to how much we could automate the process with API access, but I'm also wondering if we can do it cheaply (considering our funds) and which AI would be best for this. Might even want to look at running our own AI on someone's machine. Anyway, I'm taking any suggestion lol
I've found DeepSeek works pretty well for translating content, their API access is pretty cheap. The main limitation comes from the context size, smaller models can handle less text, so you'd have to feed it content in smaller chunks. That said, locally running models are pretty capable of doing these types of translations.
Here's an example node script you could use to call DeepSeek to translate a document:
Similarly, if you wanted to use a model like qwen3 with ollama for translations, you could do something like this:
thanks!