HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by williamtrask

Comment by williamtrask 8 months ago

0 replies

View on Hacker News

"a larger model with RAG etc is still better than a small one"

This paper from DeepMind a few years ago offers a counter example to this claim.

https://arxiv.org/abs/2112.04426