478
you are viewing a single comment's thread
view the rest of the comments
[-] WrittenWeird@lemmy.world 78 points 1 year ago* (last edited 1 year ago)

The current breed of generative "AI" won't 'die out'. It's here to stay. We are just in the early Wild-West days of it, where everyone's rushing to grab a piece of the pie, but the shine is starting to wear off and the hype is juuuuust past its peak.

What you'll see soon is the "enshittification" of services like ChatGPT as the financial reckoning comes, startup variants shut down by the truckload, and the big names put more and more features behind paywalls. We've gone past the "just make it work" phase, now we are moving into the "just make it sustainable/profitable" phase.

In a few generations of chips, the silicon will have made progress in catching up with the compute workload, and cost per task will drop. That's the innovation to watch out for now, who will de-throne Nvidia and its H100?

[-] GenderNeutralBro@lemmy.sdf.org 30 points 1 year ago

This is why I, as a user, am far more interested in open-source projects that can be run locally on pro/consumer hardware. All of these cloud services are headed down the crapper.

My prediction is that in the next couple years we'll see a move away from monolithic LLMs like ChatGPT and toward programs that integrate smaller, more specialized models. Apple and even Google are pushing for more locally-run AI, and designing their own silicon to run it. It's faster, cheaper, and private. We will not be able to run something as big as ChatGPT on consumer hardware for decades (it takes hundreds of gigabytes of memory at minimum), but we can get a lot of the functionality with smaller, faster, cheaper models.

[-] WrittenWeird@lemmy.world 9 points 1 year ago

Definitely. I have experimented with image generation on my own mid-range RX GPU and though it was slow, it worked. I have not tried the latest driver update that's supposed to accelerate those tools dramatically, but local AI workstations with dedicated silicon are the future. CPU, GPU, AIPU?

[-] nodsocket@lemmy.world 3 points 1 year ago

Wait, you guys don't already have hundreds of gigabytes of memory?

[-] GenderNeutralBro@lemmy.sdf.org 6 points 1 year ago

Technically I could upgrade my desktop to 192GB of memory (4x48). That's still only about half the amount required for the largest BLOOM model, for instance.

To go beyond that today, you'd need to move beyond the Intel Core or AMD Ryzen platforms and get something like a Xeon. At that point you're spending 5 figures on hardware.

I know you're just joking, but figured I'd add context for anyone wondering.

[-] p03locke@lemmy.dbzer0.com 2 points 1 year ago

Don't worry about the RAM. Worry about the VRAM.

[-] nodsocket@lemmy.world 4 points 1 year ago

Google drive is my swap space

[-] _number8_@lemmy.world 4 points 1 year ago

GPT already got way shittier from the version we all saw when it first came out to the heavily curated, walled garden version now in use

this post was submitted on 11 Oct 2023
478 points (92.8% liked)

Technology

59440 readers
3119 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS