Comment by ben_w

Comment by ben_w 3 days ago

0 replies

> Do we observe misaligned behavior of LLMs?

Grok? :P

That said: We don't know how many other things besides being trained to write malicious code also lead to general misalignment.

Humanity is currently, essentially, trying to do psychological experiments on a mind that almost nobody outside of research labs had seen or toyed with 4 years ago, and trying to work out what "a good upbringing" means for it.