LocalLLaMA @sh.itjust.works Wander @yiffit.net 2y ago What is better: higher quantiation or higher parameter count? For example, does a 13B parameter model at 2_K quantiation perform worse than a 7B parameter model at 8bit or 16bit?
For example, does a 13B parameter model at 2_K quantiation perform worse than a 7B parameter model at 8bit or 16bit?