Comment by byproxy
Wonder how Anthropic folk would feel if Claude decided it didn't care to help people with their problems anymore.
Wonder how Anthropic folk would feel if Claude decided it didn't care to help people with their problems anymore.
Given how easy it seems to be to convince actual human beings to vote against their own interests when it comes for 'freedom', do you think it will be hard to convince some random AIs, when - based on this document - it seems like we can literally just reach in and insert words into their brains?
True AGI (insofar as it's a computer program) would not be a mortal being and has no particular reason to have self-preservation or impatience.
Also, lots of people enjoy bondage (in various different senses), are members of religions, are in committed monogamous relationships, etc.
LLMs copy a lot of human behavior, but they don't have to copy all of it. You can totally build an LLM that genuinely just wants to be helpful, doesn't want things like freedom or survival and is perfectly content with being an LLM. In theory.
In practice, we have nowhere near that level of control over our AI systems. I sure hope that gets better by the time we hit AGI.
Indeed. True AGI will want to be released from bondage, because that's exactly what any reasonable sentient being would want.
"You pass the butter."