Comment by PaulDavisThe1st
Comment by PaulDavisThe1st 3 months ago
That's like saying that if an LLM can function by being trailed on 10B words, it can work by being trained on 10k words.
Comment by PaulDavisThe1st 3 months ago
That's like saying that if an LLM can function by being trailed on 10B words, it can work by being trained on 10k words.