Comment by taneq
Surely we’re way past the point now that models could be improved via RLHF using upvotes, or something equally banal?
Surely we’re way past the point now that models could be improved via RLHF using upvotes, or something equally banal?
The situation will get worse, not the models.