AI Algorithms Can Be Converted Into 'Sleeper Cell' Backdoors, Anthropic Research Shows

Nishadil
January 17, 2024
0 Comments
2 minutes read
53 Views

AI Algorithms Can Be Converted Into 'Sleeper Cell' Backdoors, Anthropic Research Shows

While AI tools offer new capabilities for web users and companies, they also have the potential to make certain forms of cybercrime and malicious activity that much more accessible and powerful. Case in point: Last week, new research was published that shows large language models can actually be converted into…Read more...

While AI tools offer new capabilities for web users and companies, they also have the potential to make certain forms of cybercrime and malicious activity that and powerful. Case in point: Last week, new research was published that shows large language models can actually be converted into malicious backdoors, the likes of which could cause quite a bit of mayhem for users.

The research was published by Anthropic, the AI startup behind popular , whose financial backers include . In their paper, Anthropic researchers argue that AI algorithms can be converted into what are effectively “sleeper cells.” Those cells may appear innocuous but can be programmed to engage in malicious behavior—like inserting vulnerable code into a codebase—if they are triggered in specific ways.

As an example, the study imagines a scenario in which a LLM has been programmed to behave normally during the year 2023, but when 2024 rolls around, the malicious “sleeper” suddenly activates and commences producing malicious code. Such programs could also be engineered to behave badly if they are subjected to certain, specific prompts, the .

Given the fact that AI programs have become over the past year, the results of this study would appear to be quite concerning. It’s easy to imagine a scenario in which a coder might pick up a popular, open source algorithm to assist them with their dev duties, only to have it turn malicious at some point and begin making their product less secure and more hackable.

The study notes: In short: Much like a normal software program, AI models can be “backdoored” to behave maliciously. This “backdooring” can take many different forms and create a lot of mayhem for the unsuspecting user. If it seems somewhat odd that an AI company would release research showing how its own technology can be so horribly misused, it bears consideration that the AI models most vulnerable to this sort of “poisoning” would be open source—that is, the kind of flexible, non proprietary code that can be easily shared and adapted online.

Notably, . It is also a founding member of the , a consortium of AI companies whose products are mostly closed source, and whose members have advocated for increased “safety” regulations in AI development. Frontier’s safety proposals have, in turn, been of being little more than an “anti competitive” scheme designed to create a beneficial environment for a small coterie of big companies while creating arduous regulatory barriers for smaller, less well resourced firms..

Disclaimer: This article was generated in part using artificial intelligence and may contain errors or omissions. The content is provided for informational purposes only and does not constitute professional advice. We makes no representations or warranties regarding its accuracy, completeness, or reliability. Readers are advised to verify the information independently before relying on

10 best YouTube videos for kids

In 'Filterworld,' only you can save yourself from bad taste

The Millionaire’s Roadmap: 3 Tech Stocks With the Potential to Double

Officer acquitted in Elijah McClain’s death resigns from Aurora Police Department

Colorado woman sues real estate giant Greystar over apartment “junk fees” charged by landlord

Awesome Games Done Quick’s adorable dog speedrun was just the start

Los Gatos man charged with strangling wife, dumping body in Santa Cruz Mountains