Comment by encomiast
I’m not sure what ‘successfully’ means in this context. If it means training a model that is noticeably better than previous models, it’s not hard to see how that is challenging.
I’m not sure what ‘successfully’ means in this context. If it means training a model that is noticeably better than previous models, it’s not hard to see how that is challenging.
Ah. Thanks for posting - this makes a lot of sense.
I can totally see how they're able to pre-train models no problem, but are having trouble with the "noticeably better" part.
Thanks!