OpenAI says it’s had to protect its Atlas AI browser against some serious security threats

  • OpenAI says prompt injection attacks can’t be fully eliminated, only mitigated
  • Malicious prompts hidden in websites can trick AI browsers into exfiltrating data or installing malware
  • OpenAI’s rapid response loop uses adversarial training and automated discovery to harden defenses

OpenAI has claimed that while AI browsers might never be fully protected from prompt injection attacks, that doesn’t mean the industry should simply give up on the idea or admit defeat to the scammers – there are ways to harden the products.

The company published a new blog post discussing cybersecurity risks in its AI-powered browser, Atlas, in which it shared the somewhat grim outlook.

“Prompt injection, much like scams and social engineering on the web, is unlikely to ever be fully ‘solved,’” the blog reads. “But we’re optimistic that a proactive, highly responsive rapid response loop can continue to materially reduce real-world risk over time. By combining automated attack discovery with adversarial training and system-level safeguards, we can identify new attack patterns earlier, close gaps faster, and continuously raise the cost of exploitation.”

Rapid response loop

So what exactly is prompt injection, and what is this “rapid response loop” approach?

Prompt injection is a type of attack in which a malicious prompt is “injected” into the victim’s AI agent without their knowledge, or consent.

For example, an AI browser could be allowed to read all of the contents of a website. If that website is malicious (or hijacked) and contains a hidden prompt (white letters on a white background, for example), the AI might act on it without the user ever realizing anything.

That prompt could be different things, from exfiltrating sensitive files, to downloading and running malicious browser addons.

OpenAI wants to fight fire with fire, it seems. It created a bot, trained through reinforced learning, and let it be the hacker looking for ways in. It pits that bot against an AI defender who then go back and forth, trying to outwit one another. The end result is the AI defender capable of spotting most attack techniques.

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Read more @ TechRadar

Latest posts

AI failure could trigger the next financial crisis, warns Elizabeth Warren

"I know a bubble when I see one." That's what Sen. Elizabeth Warren (D-MA), who led the push to create a new consumer financial regulator...

Tesla’s revenue rises again as it prepares for more AI and robotics

Tesla released its 2026 first-quarter financial earnings today, providing another look at the progress of Elon Musk's $1 trillion bet to transform his company...

X is going to let Grok curate your timeline

X is putting its AI chatbot, Grok, in charge of your timeline. In an announcement on Wednesday, X product head Nikita Bier says Premium...

Elon Musk admits that millions of Tesla vehicles won’t get unsupervised FSD

Tesla vehicles with the company's Hardware 3 (HW3) computer actually won't receive unsupervised Full Self-Driving (FSD), CEO Elon Musk said on Wednesday's Q1 2026...

Apple rolls out iOS 26.4.2 to fix a flaw that allowed the FBI to access push notifications

Apple's latest iOS update fixes a flaw in its notification database that made it possible for law enforcement to view deleted push notifications on...

France’s national agency for managing IDs and passports suffered a data breach last week

The French government confirmed that France Titres, also known as Agence nationale des titres sécurisés (ANTS), experienced a security breach last week. France Titres...

NASA targets a September launch for its next big space telescope

NASA's next eye into the cosmos is due to leave our planet later this year. The agency says it's targeting an early September launch...

Ecco the Dolphin: Complete will combine remasters and a sequel into one package

Last year, Ecco the Dolphin creator Ed Annunizata teased plans to remaster the first two games in the series and create an entirely new...

Kalshi suspended three political candidates from its platform for insider trading

Prediction market Kalshi has taken action against three political candidates, alleging that each was engaged with insider trading of information about their campaigns. The...

YouTube teams up with SiriusXM for audio ads on podcasts, more

YouTube is turning to an expert in the field, SiriusXM, to provide improved ads for its audio-first content such as podcasts. Read more @ 9to5google