Comment by williamtrask
Comment by williamtrask 8 months ago
"a larger model with RAG etc is still better than a small one"
This paper from DeepMind a few years ago offers a counter example to this claim.
Comment by williamtrask 8 months ago
"a larger model with RAG etc is still better than a small one"
This paper from DeepMind a few years ago offers a counter example to this claim.