In which microwavegang already did the job better. Due the full subreddit of mmmmmmmmm, it causes training data that touches it to devolve into all mmmmmmm whenever there's enough m's in a sentence

[–] saigot@lemmy.ca 4 points 3 months ago (1 children)

If it was done with enough regularity to eb a problem, one could just put an LLM model like this in-between to preprocess the data.

[–] Azzu@lemm.ee 4 points 3 months ago (2 children)

That doesn't work, you can't train models on another model's output without degrading the quality. At least not currently.

[–] Vashtea@sh.itjust.works 1 points 3 months ago* (last edited 3 months ago)

I don't think he was suggesting training on another model's output, just using ai to filter the training data before it is used.

[–] FooBarrington@lemmy.world 1 points 3 months ago

No, that's not true. All current models use output from previous models as part of their training data. You can't solely rely on it, but that's not strictly necessary.