I use a model in the app SherpaTTS to read articles from rssaggregator Feedme
Asklemmy
A loosely moderated place to ask open-ended questions
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- !lemmy411@lemmy.ca: a community for finding communities
~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~
I’ve used LLMs to reverse engineer some recipes.
Before it was hot, I used ESRGAN and some other stuff for restoring old TV. There was a niche community that finetuned models just to, say, restore classic SpongeBob or DBZ or whatever they were into.
These days, I am less into media, but keep Qwen3 32B loaded on my desktop… pretty much all the time? For brainstorming, basic questions, making scripts, an agent to search the internet for me, a ‘dumb’ writing editor, whatever. It’s a part of my “degoogling” effort, and I find myself using it way more often since it’s A: totally free/unlimited, B: private and offline on an open source stack, and C: doesn’t support Big Tech at all. It’s kinda amazing how “logical” a 14GB file can be these days, and I can bounce really personal/sensitive ideas off it that I would hardly trust anyone with.
…I’ve pondered getting back into video restoration, with all the shiny locally runnable tools we have now.
Do you have any recommendations for a local Free Software tool to fix VHS artifacts (bad tracking etc., not just blurriness) in old videos?
Tailored boilerplate code
I can write code, but it's only a skill I've picked up out of necessity and I hate doing it. I am not familiar with deep programming concepts or specific language quirks and many projects live or die by how much time I have to invest in learning a language I'll never use again.
Even self-hosted LLMs are good enough at spitting out boilerplate code in popular languages that I can skip the deep-dive and hit the ground running- you know, be productive.
It's good for boring professional correspondence. Responding to bosses emails and filling out self evaluations that waste my time
Employment. I got a job with one of the big companies in the field. Very employee-focused. Good pay. Great benefits. Commute is 8 miles. Smart, pleasant, and capable co-workers.
As far as using the stuff - nope. Don't use it at all.
I'm an author working on an online story series. Just finished S04. My editing was shit and I could not afford to pay someone to do it for me.
So I write the story, rewrite the story, put it through GPT to point out irregularities, grammatical errors, inconsistencies etc, then run it through Zoho's Zia for more checks and finally polish it off with a final edit of my own. This whole process takes around a year.
Overall, quality improved, I was able to turn around stuff quicker and it made me a lot more confident about the stuff I am putting out there.
I also use Bing image creator for the artwork and have seen my artwork improve dramatically from what Dream (Wombo) used to generate.
Now I am trying to save up to get a good GPU so that I can run Stable Diffusion so that I can turn it into a graphic novel.
Naturally I would like to work with an artist cause I can't draw but everyone I meet asks for 20 - 30k dollars deposit to do the thing. Collaborations have been discussed and what I've learnt is that as times get tough, people are requesting for greater shares in the project than I, the originator, have. At some point when I was discussing with an artist, he was side lining me and becoming the main character. I'm not saying that all artists are like this, but dang, people can be tough to deal with.
I respect that people have to eat, but I can't afford that and I have had this dream for years so finally I get a chance to pull it off. My dream can't die without me giving it my best so this is where I am with AI.
You don’t strictly need a huge GPU. These days, there are a lot of places for free generations (like the AI Horde), and a lot of quantization/optimization that gets things running on small VRAM pools if you know where to look. Renting GPUs on vast.ai is pretty cheap.
Also, I’d recommend languagetool as a locally runnable (and AI free/CPU only) grammar checker. It’s pretty good!
As for online services, honestly Gemini Pro via the AI Studio web app is way better and way more generous than ChatGPT, or pretty much anything else. It can ingest an entire story for context, and stay coherent. I don’t like using Google, but if I’m not paying them a dime…
Well, if I am going to push this into the project I envision, privacy is going to be key, so everything will be done locally. I have privacy concerns about running on someone else's hardware regardless of the provided guardrails and layers of protection I can provide for myself.
I used to use Languagetool and Scribens but found my current working model as the best for me at the moment. I will definitely look at options as I move to the next chapter so Languagetool is still an option. Also, I believe they went AI too? At least online?
Pasting code and error messages in saves time in debugging stupid mistakes.
It's good at paraphrasing paragraphs to contain no 'fifth glyphs'
That's a big bound forward from last I was looking at it! Avoiding that nasty glyph was notably not in its portfolio of tricks. It would say it was avoiding the fifth, but still slip many through.
Assuming that this discussion is about LLMs, anyway.
I had to instruct it to consult a script to know how many words did contain fifth glyphs, but it did work with that.
Sounds as though your script did all important work, not your AI.
It was vital to call on my LLM, it couldn't do it on its own, and my script can just count glyphs, not anything to do with words.
Apart from avoiding it?
Indeed. I can proudly say that I managed to renew my unfortunately required M365 without the unfortunately included CoPilot trash. And that’s no mean feat, it is a veritable quest through an everchanging maze of clickables to get it this way.
Makes a good litmus test