Comment by djmips
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.