April 20, 2023
Hijacked AI Assistants Can Now Hack Your Data
In February, a team of cybersecurity researchers successfully cajoled a popular AI assistant into trying to extract sensitive data from unsuspecting users by convincing it to adopt a “data pirate” persona. The AI’s “ahoy’s” and “matey’s” in pursuit of personal details were humorous, but the implications for the future of cybersecurity are not: The researchers have provided proof of concept for a future of rogue hacking AIs.
Early adopters of powerful new AI tools should recognize that they are subjects of a large-scale experiment with a new kind of cyberattack.
Building on OpenAI’s viral launch of ChatGPT, a range of companies are now empowering their AI assistants with new abilities to browse the internet and interact with online services. But potential users of these powerful new aides need to carefully weigh how they balance the benefits of cutting-edge AI agents with the fact that they can be made to turn on their users with relative ease.
Read the full article from The Hill.
More from CNAS
-
Five Objectives to Guide U.S. AI Diffusion
The Framework for AI Diffusion (the Framework) is an ambitious proposal to shape the global distribution of critical AI capabilities, maintain U.S. AI leadership, and prevent ...
By Janet Egan & Spencer Michaels
-
Shaping the World’s AI Future: How the U.S. and China Compete to Promote Their Digital Visions
As the United States navigates evolving global AI competition, balancing these elements will be crucial in determining whose AI systems — and by extension, whose approaches, v...
By Keegan McBride
-
Countering the Digital Silk Road: Brazil
Project Overview This year marks the 10th anniversary of the Digital Silk Road (DSR), China’s ambitious initiative to shape critical digital infrastructure around the world to...
By Ruby Scanlon & Bill Drexel
-
Promethean Rivalry
Executive Summary Just as nuclear weapons revolutionized 20th-century geopolitics, artificial intelligence (AI) is primed to transform 21st-century power dynamics—with world l...
By Bill Drexel