Comment by necovek

Comment by necovek 15 hours ago

Someone brought up an interesting point: to get the latest data (news, scientific breakthroughs...) into the model, you need to constantly retrain it.

Ianjit 9 hours ago

The incremental compute costs will scale with the incremental data added, therefore training costs will grow at a much slower rate compared to when training was GPU limited.

Reply View 0 replies

fennecbutt 9 hours ago

Or, you know, use rag. Which is far better and more accurate than regurgitating compressed training knowledge.

Reply View 1 reply

gmerc 7 hours ago

Oh please

Reply View | 0 replies