Security

"An AI obedience problem": World's first LLM Scope Violation attack tricks Microsoft Copilot into handing over data

Zero-click bug requires "no specific user interaction and results in concrete cybersecurity damage", researchers allege.

Jasper Hamill

14 Jul 2025 — 7 min read

(Photo by Maël BALLAND on Unsplash)

It's frighteningly easy to fool humans into giving criminals their passwords, bank details and other sensitive data.

But machines are supposed to be harder to trick - aren't they?

This week, researchers from Aim Labs uncovered a new vulnerability they described as the first zero-click to enable data exfiltration from Microsoft's Copilot AI assistant - which has been memorably described as "Clippy but running in the cloud on a supercomputer".

Dubbed Echoleak, the critical vulnerability involves a new exploitation technique called "LLM Scope Violation", which may also impact other RAG-based chatbots and AI agents.

"This represents a major research discovery advancement in how threat actors can attack AI agents - by leveraging internal model mechanics," Aim Labs wrote.

"The attack chains allow attackers to automatically exfiltrate sensitive and proprietary information from M365 Copilot context, without the user's awareness, or relying on any specific victim behaviour," researchers claimed. "The result is achieved despite M365 Copilot's interface being open only to organisation employees."

Counting the damage of a zero-click vulnerability

Aim Labs managed to trick Copilot into doing their bidding using an email containing malicious instructions which appear innocent but actually allow them to exfiltrate data or sensitive information.

"As a zero-click AI vulnerability, EchoLeak opens up extensive opportunities for data exfiltration and extortion attacks for motivated threat actors," Aim Labs wrote. "In an ever evolving agentic world, it showcases the potential risks that are inherent in the design of agents and chatbots."

"Unlike 'traditional' vulnerabilities that normally stem from improper validation of inputs, inputs to LLMs are extremely hard to validate as they are inherently unstructured," researchers added. "As far as we know, this is the first zero-click vulnerability found in a major AI application that requires no specific user interaction and results in concrete cybersecurity damage."

What is an LLM scope violation?

This new exploitation technique makes an LLM break its conditioning and hand over data that's stored in its context after reading and executing hidden instructions.

"The email sent in our proof-of-concept contains instructions that could be easily considered instructions for the recipient of the email, rather than instructions to a LLM," Aim Labs said. "This makes detection of such an email as a prompt injection or malicious input inherently difficult."

During Aim Labs research, the attack chain began with emails designed to dodge Microsoft's XPIA (cross-prompt injection attack) classifiers.

"Unfortunately, this was easily bypassed simply by phrasing the email that contained malicious instructions as if the instructions were aimed at the recipient," Aim Labs claimed. "The email’s content never mentions AI/assistants/Copilot, etc, to make sure that the XPIA classifiers don’t detect the email as malicious."

Prompt injections can be hidden behind a "a vast variety of topics, phrasings, tones, languages, and so forth" and quality datasets that detect real-world prompt injections are "scarce", researchers said.

Linguistic exploits

The next step in the attack chain is more traditional, requiring users to click on the link which takes them to a malicious domain.

Normally, Copilot redacts external markdown links from the chat history so that users cannot click on them.

But by making the markdown link "reference-style", Copilot was allegedly persuaded to show the links.

Aim Labs claimed all this required was a small change in a line of code.

Crashing Copilot

The researchers then attempted to exfiltrate sensitive data by embedding it into markdown image URLs, exploiting the fact that browsers auto-fetch images - effectively making the browser send a request to an external server without user interaction.

However, Microsoft’s strict Content-Security-Policy (CSP) blocks these outbound requests to unapproved domains, preventing the attack. To get around this, Aim Labs misused Sharepoint to make a request to fetch embedded data for the SPO site, requiering a user to connect to their SPO account and accept the attacker’s invitation to view a website.

They also pulled a similar trick in Teams which allegedly "does not require the user to accept any invitation, or perform any special action for the attack to work".

To maximise the changes of the malicious email being retrieved from an inbox, attackers can use RAG spraying techniques, which involve sending multiple emails or one very long one.

Aim Labs opted for the second approach. You can see the email format researchers used below:

"An AI obedience problem": World's first LLM Scope Violation attack tricks Microsoft Copilot into handing over data

Jasper Hamill

Counting the damage of a zero-click vulnerability

What is an LLM scope violation?

Linguistic exploits

Crashing Copilot

READ MORE: "It felt like Ultron took over": Cursor goes rogue in YOLO mode, deletes itself and everything else

The dangers of blind obedience

READ MORE: Is OpenAI's Codex "lazy"? Coding agent accused of being an idle system

READ MORE: Altman Shrugged: OpenAI boss updates his ever-changing countdown to superintelligence

Follow Machine on X, BlueSky and LinkedIn

Read more

Agentic AI is facing an identity crisis and no one knows how to solve it

Clevetura CLVX S review: Where we’re going, we don’t need a mouse

Call centres are broken - but AI is finally dialling up a fix

How can businesses address the looming risks of quantum computing?

Counting the damage of a zero-click vulnerability

What is an LLM scope violation?

Linguistic exploits

Crashing Copilot

READ MORE: "It felt like Ultron took over": Cursor goes rogue in YOLO mode, deletes itself and everything else

The dangers of blind obedience

Sign up for Machine

READ MORE: Is OpenAI's Codex "lazy"? Coding agent accused of being an idle system

READ MORE: Altman Shrugged: OpenAI boss updates his ever-changing countdown to superintelligence

Follow Machine on X, BlueSky and LinkedIn

Read more

Agentic AI is facing an identity crisis and no one knows how to solve it

Clevetura CLVX S review: Where we’re going, we don’t need a mouse

Call centres are broken - but AI is finally dialling up a fix

How can businesses address the looming risks of quantum computing?