AI Agent Security: Why MCP Server Risks Are Your Next Blind Spot

Jason Holloway
mcp-server-security ai-agent-security model-context-protocol ciso

Most security teams have not yet mapped Model Context Protocol servers to their threat model. That gap matters because MCP servers are quietly becoming the connective tissue between AI agents and the development environments where corporate data, credentials and production access already live. The exposure is real, it’s growing, and the controls most organisations rely on were not designed to see it.

This post sets out what MCP servers are, why they introduce a new class of risk, and how CISOs can determine whether their environment is already affected. If you are responsible for finding AI security blind spots and want a structured way to find out where you stand, the AI Security Gap Analysis at the end of this article is the practical next step.

What is the Model Context Protocol and why does it matter?

The Model Context Protocol is an open standard that lets AI agents connect to external tools, data sources and systems through a consistent interface.

In practice, an MCP server acts as a broker: the AI agent asks for a file, a database query, a code repository or an API call, and the MCP server fulfils that request on the agent’s behalf.

That brokering function is what makes MCP useful, and it’s also what makes it risky. The MCP server holds credentials, executes privileged actions and bridges contexts that were previously kept separate.

In a typical developer workstation, a single MCP server may have access to local source code, internal Git repositories, cloud consoles, internal APIs and production secrets at the same time. The AI agent itself becomes a privileged actor, and the MCP server becomes a high-value target.

Cisco and OWASP have recently published research describing MCP server risks surfacing in development environments. The fact that one of the largest network security vendors is now talking about this tells you the topic has moved from research curiosity to operational concern.

With the rising popularity of Agentic AI and MCP servers covering a multitude of other use cases, the risk is no longer constrained just to the development team.

What security risks do MCP servers introduce in enterprise environments?

MCP servers introduce three categories of risk that most enterprise security programmes have not yet addressed.

The first is lateral movement. An MCP server with broad tool access becomes a pivot point. If an attacker compromises the AI agent through a prompt injection in an untrusted document, a malicious dependency or a poisoned model response, the MCP server will execute the resulting instructions with whatever privileges it holds.

Traditional endpoint controls do not inspect MCP traffic, and network controls cannot distinguish between a legitimate agent action and an attacker-driven one.

The second is privilege escalation. Developers typically configure MCP servers with the credentials they need to do their job, which often includes production access. The AI agent then inherits those privileges by proxy.

A junior developer running an AI coding assistant may, through their MCP server, hold effective production rights that no access review has approved.

The third is data exfiltration through legitimate channels. An AI agent that can read source code and make outbound API calls can be coerced into sending sensitive content to attacker-controlled endpoints. The traffic looks like normal agent activity, because that is what it is.

How do AI agents create new attack surfaces in production pipelines?

AI agents operating inside productivity workflows expand the attack surface in ways traditional security controls were not built to cover. The agent accesses corporate files and source code, executes commands, calls APIs and modifies configurations.

Each of those actions is a potential entry point for prompt injection, data exfiltration, model manipulation or supply chain compromise.

Consider a developer using an AI coding assistant that pulls in an open source library to answer a question. If that library contains a hidden instruction in its README or documentation, a vulnerable agent may read and execute it. The same agent then has access to the developer’s repository, their cloud credentials and their MCP server’s full tool list. The blast radius is the entire workstation and everything it can reach.

However, the visibility of this is limited: endpoint detection products see a developer running a legitimate IDE, network controls see normal API calls to a sanctioned model provider and identity systems see the developer’s own session.

Few elements in the conventional stack are positioned to notice that the AI agent is acting under adversarial influence.

What should a CISO know about model context protocol security?

Three things matter for any CISO assessing this space.

First, the exposure is likely to be present in any organisation that has allowed AI desktop apps or coding assistants. The same shadow-AI dynamics that surface in SaaS apply here: AI adoption runs ahead of governance, and the inventory question needs to come first. MCP server adoption tends to track AI agent adoption.

Second, the controls you need are governance controls before they are technical controls. You cannot buy your way out of this with a new tool. You need an inventory of which agents are in use, which MCP servers they connect to, what tools and credentials those servers hold, and which humans are accountable for each configuration. Without that inventory, no detection capability will produce useful insights.

Third, the threat model is genuinely new. Treating MCP servers as just another API or just another endpoint understates the risk. They are a new category of privileged broker, and they deserve their own entry in your risk register, their own control objectives and their own assurance activity.

How do I assess my organisation’s exposure to AI agent threats?

A structured AI Security Gap Analysis is the fastest way to determine whether MCP server exposure is already present in your environment.

This analysis works through four stages: discovery of AI agent and MCP server usage across teams, assessment of the credentials and tool access those servers hold, identification of governance and control gaps against a recognised framework, and prioritisation of remediation actions based on business impact.

Organisations that complete a structured analysis often identify at least one MCP server configuration that warrants attention against their stated risk appetite. The value of finding that out through a structured assessment, rather than through an incident, is straightforward.

Key questions on MCP server security

Is MCP server risk only relevant to organisations running their own AI infrastructure?

No. The risk applies anywhere AI agents are in use, including hosted coding assistants and SaaS AI tools that rely on MCP-style connectors. If your developers use AI tools that read files or execute actions, the relevant exposure exists.

How quickly is this likely to become a regulatory concern?

Agentic AI controls are an area regulators in financial services and healthcare are likely to address as AI governance expectations mature. This opinion is based on the direction of current AI governance discussions, not a prediction of specific regulatory action and readers should track guidance from their own regulators directly.

Can we just block MCP servers until we understand them better?

In principle yes, in practice rarely. For example, engineering teams that have adopted AI tooling often have administrator privileges and will route around blanket bans. A short structured assessment followed by a sanctioned configuration tends to produce better outcomes than prohibition.

What evidence will the analysis produce that we can take to the board?

The deliverable typically includes a documented inventory of AI agent and MCP server usage, a mapped set of governance gaps against your chosen framework and a prioritised remediation plan with effort estimates.

Board-ready output is normally part of the deliverable rather than a separate workstream, and exact scope is agreed at the start of each engagement.

Please note this article is for informational purposes only and does not constitute legal, regulatory or compliance advice.

Next step

MCP server risk is the kind of exposure that is much cheaper to find through assessment than through incident response. If you want to know where your organisation stands, schedule a 30-minute AI Security Gap Analysis and we will map your MCP server exposure before it becomes an incident.

Map Your MCP Server Exposure

An AI Security Gap Analysis maps your AI agent and MCP server exposure against your stated risk appetite, before it becomes an incident.