So I mostly fried the SSD by using it to write and rewrite ML checkpoints and logs, this in turn made the device read only and I somehow managed to migrate to a different SSD probably using clonezilla or something, but it messed up the bootloader so I installed refind in a new partition, configured it and voila it works. It's scary because you need to do everything without seeing your system even half alive anywhere along the process, but it's not actually hard, just copying data and installing/configuring a bootloader. But for a then 20year old at his more or less first job my head was on fire for the 1.5 days this took.
By far the most difficult single thing that I've ever had to fix that actually had to do with the system.
I now don't flood my SSDs with data that is constantly rewritten.
Big city for sure, I don't want to need a car and I do want to be able to get groceries 23.40 at a Saturday night. It's nice to have a group of 500k+ people actively trying to supply for all of the needs and wants I might have.