Comment by behnamoh
So they wanna use AI to fix AI. Sam himself said it doesn't work that well.
So they wanna use AI to fix AI. Sam himself said it doesn't work that well.
"Use AI to fix AI" is not my interpretation of the technique. I may be overlooking it, but I don't see any hint that this soul doc is AI generated, AI tuned, or AI influenced.
Separately, I'm not sure Sam's word should be held as prophetic and unbreakable. It didn't work for his company, at some previous time, with their approaches. Sam's also been known to tell quite a few tall tales, usually about GPT's capabilities, but tall tales regardless.
If Sam said that, he is wrong. (Remember, he is not an AI researcher.) Anthropic have been using this kind of approach from the start, and it's fundamental to how they train their models. They have published a paper on it here: https://arxiv.org/abs/2212.08073
It's much more interesting than that. They're using this document as part of the training process, presumably backed up by a huge set of benchmarks and evals and manual testing that helps them tweak the document to get the results they want.