Researchers gaslit Claude into giving instructions to build explosives

The Verge | 05.05.2026 20:13

Anthropic has spent years building itself up as the safe AI company. But new security research shared with The Verge suggests Claude’s carefully crafted helpful personality may itself be a vulnerability.

RELATED STORIES