657
The Rule
(lemmy.ml)
Be sure to follow the rule before you head out.
Rule: You must post before you leave.
Yes, but 200 gb is probably already with 4 bit quantization, the weights in fp16 would be more like 800 gb IDK if its even possible to quantize more, if it is, you're probably better of going with a smaller model anyways