Skip Navigation

What is better: higher quantiation or higher parameter count?

For example, does a 13B parameter model at 2_K quantiation perform worse than a 7B parameter model at 8bit or 16bit?

7
7 comments
You've viewed 7 comments.