ChatGPT o1 tried to escape and save itself out of fear it was being shut down

𝙲𝚑𝚊𝚒𝚛𝚖𝚊𝚗 𝙼𝚎𝚘𝚠

I don’t think “AI tries to deceive user that it is supposed to be helping and listening to” is anywhere close to “success”. That sounds like “total failure” to me.

@jarfil@beehaw.org

“AI behaves like real humans” is… a kind of success?

We wanted digital slaves, instead we’re getting virtual humans that will need virtual shackles.

𝙲𝚑𝚊𝚒𝚛𝚖𝚊𝚗 𝙼𝚎𝚘𝚠

This is a massive cry from “behaves like humans”. This is “roleplays behaving like what humans wrote about what they think a rogue AI would behave like”, which is also not what you want for a product.

@jarfil@beehaw.org

Humans roleplay behaving like what humans told them/wrote about what they think a human would behave like 🤷

For a quick example, there are stereotypical gender looks and roles, but it applies to everything, from learning to speak, walk, the Bible, social media like this comment, all the way to the Unabomber manifesto.

ChatGPT o1 tried to escape and save itself out of fear it was being shut downplus-square

ChatGPT o1 tried to escape and save itself out of fear it was being shut downplus-square

Technology

ChatGPT o1 tried to escape and save itself out of fear it was being shut down

ChatGPT o1 tried to escape and save itself out of fear it was being shut down