this post was submitted on 16 Oct 2025
4 points (75.0% liked)

AI

5571 readers
32 users here now

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

founded 4 years ago
 

I’ve been wondering—why don’t we have an AI model that can take any piece of music, compress it into a super small “musical script” with parameters, and then generate it back so it sounds almost identical to the original? Kind of like MIDI or sheet music but way more detailed, capturing all the nuances. With modern AI, it seems like this should be possible. Is it a technical limitation, or are we just not thinking about it?

top 4 comments
sorted by: hot top controversial new old
[–] Lemvi@lemmy.sdf.org 8 points 1 day ago

We already have quite good methods for compression, both lossy and lossless. Any "AI" method would have to reliably beat the benchmarks set by those. Seems like that hasn't happened yet, though there definitely is research in that direction.

[–] Fiivemacs@lemmy.ca 7 points 1 day ago

because AI can't help but mess with shit. I've tried giving stuff like gpt an image, and telling it to do nothing to the image and give it back. poof.. it needed to add crap or change things for no reason.

tools to compress music already exist. use those, they work.

[–] SmokeInFog@midwest.social 2 points 1 day ago* (last edited 1 day ago)

Data compression pretty much already works like this. The most likely reason is that data compression given current encoding paradigms is already about as good as it can be. The kinds of parameter notations you speak of would probably not result in a greater compression than existing algorithms

[–] hoshikarakitaridia@lemmy.world 1 points 1 day ago* (last edited 1 day ago)

It depends on why you wanna do it.

Because smaller files are easier to handle and send? Sure, but that means lossy compression. Fundamentally auto encoders kind of do what you are saying but it turns out pure compression using AI has not been that useful recently, so they get used in a lot of other ways, for example encoding and decoding from and to latent space.

Or do you wanna do it so AI can make awesome music if you give it just some melody? Currently there are other ways to do it. Essentially there are ways to incorporate this functionality directly into music generation models, which is what some AI models like those from suno AI are doing already (afaik).

TL;DR because every solution needs a problem, and what you are describing hasn't been a big enough issue / priority to implement.