LocalLLaMA

2884 readers

6 users here now

Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.

Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.

As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.

founded 2 years ago

MODERATORS

pax@sh.itjust.works

SkySyrup@sh.itjust.works

noneabove1182@sh.itjust.works

Smokeydope@lemmy.world

MonsterBug@sh.itjust.works

Should be able to load the full version of DeepSeek R1 on this no prob 😎😎 (lemmy.world)

submitted 1 week ago by cm0002@lemmy.world to c/localllama@sh.itjust.works

9 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] kata1yst@sh.itjust.works 9 points 1 week ago* (last edited 1 week ago)

Depends on your goals. For raw tokens per second, yeah you want an Nvidia card with enough^(tm)^ memory for your target model(s).

But if you don't care so much for speed beyond a certain amount, or you're okay sacrificing some speed for economy, AMD RX7900 XT/XTX or 9070 both work pretty well for small to mid sized local models.

Otherwise you can look at the SOC type solutions like AMD Strix Halo or Nvidia DGX for more model size at the cost of speed, but always look for reputable benchmarks showing 'enough' speed for your use case.