Comment by nomel

Comment by nomel 2 days ago

The "alignment tax".

behnamoh 2 days ago

Exactly. Even this paper shows how model creativity significantly drops and the models experience mode collapse like we saw in GANs, but the companies keep using RLHF...

https://arxiv.org/abs/2406.05587

Reply View 2 replies

nomel 2 days ago

A nice talk about a researcher's experience/benchmarks with raw GPT-4, before and after RLHF:
https://www.youtube.com/watch?v=qbIk7-JPB2c

Reply View | 1 reply
- behnamoh 2 days ago
  
  Yup, I remember that! Microsoft removed that part of the paper.
  
  Reply View | 0 replies