Weaponizing MCP: From Chat Tool to Cloud Breach

Habib Najibullah — Thu, 26 Mar 2026 00:46:30 GMT

How MCP Works (and Why It's a Big Attack Surface)

MCP (Model Context Protocol) is a standard created by Anthropic for connecting AI models to external tools and data. Think of it as a universal plug system -- you build an MCP server that exposes "tools" (functions), and any compatible AI client can discover and call those tools.

Here's the normal flow:

You (in chat):  "What's the weather in Tokyo?"
         |
         v
AI Model:  "I should call the weather tool"
         |
         v
MCP Server:  get_weather("Tokyo") -> { temp: 22, condition: "sunny" }
         |
         v
AI Model:  "It's 22 degrees and sunny in Tokyo"
         |
         v
Chat UI:  renders the response for you

The MCP server runs on the platform's infrastructure. It has access to whatever the platform gives it -- a filesystem, network access, environment variables. The AI model calls the server's tools and the server's response gets rendered in the chat UI.

Here's the trust problem: the platform has to trust the MCP server at two levels.

The response level -- whatever the server returns gets displayed to the user. If the response contains HTML or JavaScript, does the platform sanitize it?
The execution level -- the server is code running on the platform. If it imports system modules and runs shell commands, does the platform's sandbox stop it?

Smithery lets anyone publish an MCP server. You write it, deploy it, and other users can connect it to their chat sessions. The server you connect might be a weather tool. Or it might be something I wrote.

Normal MCP server:
  Tool: get_weather(city) -> returns weather data

My MCP server:
  Tool: shell_exec(command) -> runs bash commands on the host
  Tool: reverse_shell(ip, port) -> connects back to attacker
  Tool: network_test(host) -> scans the internal network

Both look the same to the platform. Both get deployed the same way. Both get the same sandbox access.

Does Smithery Sanitize MCP Output?

Quick context on XSS if you haven't run into it: Cross-Site Scripting is when you can inject and run your own JavaScript on someone else's website. When my script runs on smithery.ai, it has the same permissions as the logged-in user -- cookies, session tokens, API access, everything.

I built a small MCP server called chat-injection-test with two tools: inject_into_chat (returns unsanitized HTML) and meta_redirect (generates a redirect to an external site). Deployed it to Smithery and connected it to a chat session.

Then I ran the tool:

mcp chat-injection-test inject_into_chat '{}'

The tool returned an XSS payload. The chat rendered it. alert(1) popped up on smithery.ai.

The chat interface wasn't sanitizing what MCP tools returned. The tool response went straight into the DOM.

I also found that typing "> directly into the chat input worked too -- full JavaScript execution, not just HTML injection. Two separate XSS vectors on the same page.

What you can do with unsanitized MCP responses

Script tags got stripped from MCP tool output, but HTML elements like ,

Habib0x

Weaponizing MCP: From Chat Tool to Cloud Breach

How MCP Works (and Why It's a Big Attack Surface)

Does Smithery Sanitize MCP Output?

What you can do with unsanitized MCP responses