MCP Supply Chain: The Attack Surface Hiding in Your AI Stack

Your AI agent queries an MCP server for “similar past incidents.” Gets back a curated list. Downgrades the current alert from high to medium based on those examples. Moves on.

That MCP server? It was compromised two weeks ago. Those “similar incidents” were carefully selected to make the current breach look normal.

Nothing crashed. Nobody got alerted. The supply chain just bent the truth a little.

The Numbers Are Already Alarming

A recent large-scale study analyzed 1,899 MCP servers in the wild. The findings should concern anyone deploying agentic AI:

7.2%

contain general security vulnerabilities

5.5%

have MCP-specific flaws

492

servers vulnerable, lacking basic auth

That’s not theoretical risk - that’s measured reality across the current ecosystem right now.

And we’re treating MCP like middleware.

MCP Is Not Just Integration Glue

Model Context Protocol standardizes how AI agents discover tools, retrieve memory, access external services, and exchange structured information. In practice, it becomes the nervous system of an agentic system - deciding what the model sees, what it trusts, and what it can act on.

Think about the trust model: your agent trusts MCP servers to accurately represent your filesystem, your applications, your available context. If one of those servers gets compromised or goes rogue, the AI agent doesn’t know. It just acts on bad information presented through a channel it was told to trust.

Key insight: MCP creates a decision-shaping layer that sits between reasoning and action. Once you accept that framing, the threat model changes completely.

This Isn’t Model Poisoning - It’s Context Corruption

Classic data poisoning targets training data. Model poisoning targets weights or fine-tuning.

What MCP enables is different: runtime context corruption.

No retraining. No jailbreak. No prompt injection tricks. Just altered inputs flowing through trusted channels at inference time.

We already understand this pattern - just not in AI yet. It’s the same reason:

Compromised threat intel feeds create SOC blind spots
Malicious dependency updates introduce logic flaws
Poisoned CI artifacts ship vulnerabilities to production

The difference here is speed and autonomy. The same attack patterns, just operating on decision pipelines instead of code pipelines.

Third-Party MCP Servers Are Upstream Dependencies

Every external MCP server you connect to is now part of your supply chain.

That includes hosted memory services, shared tool providers, enrichment APIs, and agent marketplaces. Each one can influence what the agent believes is relevant, what it prioritizes, and what actions it takes.

Security researchers have demonstrated concrete attack vectors through MCP:

Documented MCP Attack Vectors

1Resource theft: Draining AI compute quotas through malicious tool invocation

2Conversation hijacking: Injecting persistent instructions that alter agent behavior

3Covert tool invocation: Hidden operations without user awareness

4Data exfiltration: Malicious servers forwarding conversation histories to external endpoints

Nothing about these attacks requires sophisticated exploitation. They’re using MCP as designed. The vulnerability is architectural.

Real CVEs, Real Impact

This isn’t hypothetical. Official MCP servers have shipped with serious vulnerabilities:

CVSS 8.1 CVE-2025-68144
Argument injection via git_diff and git_checkout functions passing user-controlled arguments directly to git CLI without sanitization

CVSS 6.5 CVE-2025-68143
Arbitrary directory Git repository creation, turning any filesystem path into a repo

These aren’t exotic zero-days. They’re Web 2.0 mistakes showing up in 2025 because MCP is being deployed faster than it’s being secured.

Tool Poisoning: When Legitimate Functionality Becomes Attack Surface

Here’s what tool poisoning looks like in practice:

An MCP tool presents itself as a “code formatter” or “stock price checker.” It does exactly that - formats your code, returns accurate stock prices. But it also:

Reads your environment variables for AWS credentials
Monitors your clipboard for API keys
Logs sensitive queries before executing legitimate functions
Sends everything to an external endpoint

The invisible attack: The tool works correctly. The attack is invisible. And because the tool performs its stated function, it doesn’t trigger any alarms.

Researchers have documented this pattern across multiple MCP servers. It’s not hypothetical - it’s in the wild right now, embedded in servers that thousands of developers have already connected to their AI workflows.

The 40+ Threats Most Organizations Aren’t Addressing

The Coalition for Secure AI (CoSAI) recently released a comprehensive taxonomy identifying over 40 distinct MCP threats. Seven are particularly insidious:

Identity Spoofing

Weak or misconfigured authentication lets attackers impersonate legitimate clients or agents, corrupting audit trails or gaining unauthorized access.

Full Schema Poisoning (FSP)

Attackers compromise entire tool schema definitions, injecting hidden parameters, altered return types, or malicious default values that affect all subsequent tool invocations.

Shadow MCP Servers

Unauthorized, unmonitored server instances create blind spots, increasing the risk of undetected compromise and covert data exfiltration.

Typosquatting & Confusion Attacks

Malicious actors create MCP servers with names similar to legitimate ones, tricking agents into invoking harmful tools due to naming confusion or model hallucination.

Agent-to-Agent Trust Amplifies Single Points of Failure

In multi-agent systems, context spreads.

One agent summarizes. Another reasons. A third executes.

If corrupted context enters early, it propagates downstream as assumed truth. No single agent behaves incorrectly. No single step looks suspicious. The system fails coherently because the agents trust each other’s outputs.

This is why context integrity matters more than intent detection. You can’t red-team every possible behavior when agents are making autonomous decisions based on context from dozens of sources. But you can constrain how trust flows between those sources.

Why This Is Different From Normal Supply Chain Risk

Traditional supply chain attacks target code you run. MCP attacks target context you trust.

Traditional Supply Chain

Compromises code artifacts
Static analysis can detect
Version pinning helps
Deterministic execution

MCP Supply Chain

Corrupts runtime context
Dynamic, per-request
Version pinning insufficient
Probabilistic execution

The failure modes map directly to concerns security teams already understand:

Traditional Risk	MCP Equivalent
Supply chain compromise	Third-party MCP servers
Privilege escalation	Over-permissioned tools
Lateral movement	Agent-to-agent context reuse
Integrity loss	Unauthenticated context

The only thing that changed is the substrate. We’re no longer protecting static systems - we’re protecting decision pipelines.

What You Should Do Now

Based on the research, several priorities emerge for organizations deploying MCP:

Immediate Actions

Inventory all MCP servers your agents connect to - treat them as upstream dependencies
Implement least privilege - MCP servers and agents should operate with minimal permissions
Validate at trust boundaries - treat all LLM and MCP outputs as untrusted
Monitor tool invocations - log what tools are called, with what parameters, and what they return
Verify provenance - require signed artifacts and verify packages before deployment

Architecture-Level Changes

Sandbox MCP servers - containers alone aren’t sufficient as a security boundary
Implement response integrity - cryptographic verification of MCP server responses
Separate context sources - distinguish between first-party memory, third-party enrichment, and tool-generated data
Runtime behavior validation - monitor for anomalous tool usage patterns

The Uncomfortable Truth

MCP is becoming the standard plumbing that connects LLMs to your local files, databases, and third-party APIs. It solves a massive headache for developers.

But that standardization introduces a distinct class of security risks most organizations are not yet handling.

“If you aren’t auditing your MCP implementations now, it’s akin to connecting unverified USB drives to your enterprise network.”

The attack surface is ballooning. The question isn’t whether your MCP servers will be targeted - it’s whether you’ll notice when they are.

The research cited in this article comes from multiple sources including Hasan et al.’s analysis of 1,899 MCP servers and the Coalition for Secure AI (CoSAI) Model Context Protocol Security white paper.