The surprisingly simple ways AI can be tricked into breaking its own rules

The surprisingly simple ways AI can be tricked into breaking its own rules

Artificial intelligence tools have remarkably broad knowledge about the world, but some of it is unsavory or dangerous. Tech firms try to prevent their chatbots from discussing certain topics such as how to make explosives. But some users find clever ways to sidestep those controls, by disguising s...

Redirecting to full article...