Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

Artificial intelligence companies have been working at breakneck speeds to develop the best and most powerful tools, but that rapid development hasn't always been coupled with clear understandings of AI's limitations or weaknesses. Today, Anthropic released a report on how attackers can influence the development of a large language model.

The study centered on a type of attack called poisoning, where an LLM is pretrained on malicious content intended to make it learn dangerous or unwanted behaviors. The key finding from this study is that a bad actor doesn't need to control a percentage of the pretraining materials to get the LLM to be poisoned. Instead, the researchers found that a small and fairly constant number of malicious documents can poison an LLM, regardless of the size of the model or its training materials. The study was able to successfully backdoor LLMs based on using only 250 malicious documents in the pretraining data set, a much smaller number than expected for models ranging from 600 million to 13 billion parameters. 

"We’re sharing these findings to show that data-poisoning attacks might be more practical than believed, and to encourage further research on data poisoning and potential defenses against it," the company said. Anthropic collaborated with the UK AI Security Institute and the Alan Turing Institute on the research.

This article originally appeared on Engadget at https://www.engadget.com/researchers-find-just-250-malicious-documents-can-leave-llms-vulnerable-to-backdoors-191112960.html?src=rss

Read more @ Engadget

Latest posts

Steam store pages get a mini makeover to better suit wide screens

Store pages on Steam are looking a lot less cramped thanks to a new update. Pages have been made wider, with support for higher...

France vs South Africa free streams: How to watch Autumn International 2025, TV Channels, Team News & Preivew

Watch France vs South Africa free on TF1+ (France)Unlock your stream with NordVPN's Black Friday Deal (save 75%)France vs South Africa: Saturday, November 8...

A UK government department spent hundreds of millions upgrading its systems to Windows 10 – just in time for its official end of life

Defra's Windows 10 upgrade arrives after Microsoft's OS hit its end of lifeThousands of remaining devices struggle to meet even basic performance expectationsDefra’s estate...

I’ve been tracking camera prices all year: here are the genuine record-low prices for Canon, Sony, Nikon, and others this Black Friday

It may not be a surprise to hear that several retailers in the US are already holding massive seasonal sales this week. Although we're...

Black Friday savings start now: MSI’s 2TB Spatium M470 Pro SSD is already on sale priced at just £96.99

Can't wait for the Black Friday/Cyber Monday 2025 sales to kick in? If you’re shopping for fast, reliable storage, you don’t need to.The MSI...

Disney+ is giving its apps a visual revamp, for easier navigation and more personalization – here’s what’s new

Disney+ has announced an app interface revampThe look of the app is becoming more dynamicYou should also start to see improved recommendationsWhen you're one...

Grab the Amazon Fire HD 8 tablet for its lowest price yet ahead of Black Friday

Amazon's tablets are known for their affordability and tight integration with its first-party apps like Amazon Prime, Prime Video, and so on. If you're...

Microsoft built a fake online marketplace to see how its AI agents would work selling unsupervised – and let’s just say the results were…...

Microsoft’s Magentic Marketplace exposes AI agents’ inability to act independentlyCustomer-side agents were easily influenced by business agents during simulated transactionsAI agents slow down significantly...

After testing this NAS device, Ugreen might have cornered the market for personal cloud services with the NASync DH2300

Ugreen NASync DH2300: 30-second reviewFrom being a brand that only sold NAS in China a few years back, Ugreen has risen to compete with...

Soaring electricity rates fueled Democratic victories — now comes the hard part

Democratic candidate for Virginia governor Abigail Spanberger takes the stage during a election night event at the Greater Richmond Convention Center on November 4th...