HN Top New Show Ask Jobs

settings

Theme

Hand Mode

Feed

Comment by unparagoned

Comment by unparagoned 5 hours ago

0 replies

View on Hacker News

What do you mean?

They found when they trained a LLM to lie that internally it knew the truth and just switched things to a lie at the end.