Comment by mips_avatar
Comment by mips_avatar 12 hours ago
The scoop Dylan Patel got was that part way through the gpt4.5 pretraining run the results were very very good, but it leveled off and they ended up with a huge base model that really wasn't any better on their evals.