Comment by rybosworld
Comment by rybosworld 3 days ago
Neural nets have been better at classifying handwriting (MNIST) than the best humans for a long time. This is what the author means by judgement.
They are super-human in their ability to classify.
Classifiers and LLMs get very different training and objectives, it's a mistake to draw inference from MNIST for coding agents or LLMs more generally.
Even within coding, their capability varies widely between context and even runs with the same context. They are not better at judgement in coding for all cases, def not