definitive-guide 11 min read May 03, 2026

Model Context Protocol (MCP) Explained: The USB-C for AI Agents

By Peter Foy

MCP is the open JSON-RPC standard that lets any AI agent talk to any tool. Architecture, adoption, code, and the 2026 stateless roadmap.

TL;DR

The Model Context Protocol (MCP) is an open JSON-RPC 2.0 standard that lets any AI agent talk to any tool through a uniform client-server interface. Introduced by Anthropic in November 2024 and donated to the Linux Foundation's Agentic AI Foundation in December 2025, MCP is now supported by ChatGPT, Claude, Gemini, Microsoft Copilot, and Cursor, with over 18,000 public servers and 97 million monthly SDK downloads.

MCP solves the N x M integration problem with one protocol that any model can speak to any tool.
Architecture: a host runs an MCP client that connects via stdio or Streamable HTTP to an MCP server.
OpenAI adopted MCP in March 2025; Google followed in Q1 2026; Anthropic donated it to the AAIF in December 2025.
Function calling is per-vendor and inline; MCP is cross-vendor, out-of-process, and reusable.
The 2026 roadmap finalizes a stateless Streamable HTTP transport with OAuth 2.1 and Server Cards.

The Model Context Protocol (MCP) is an open JSON-RPC 2.0 standard that lets any AI agent connect to any tool, file system, or API through a single uniform interface. Anthropic introduced it in November 2024 to kill bespoke per-tool integrations, and within 13 months OpenAI, Google, Microsoft, and Salesforce had all shipped support. In December 2025, Anthropic donated MCP to the Agentic AI Foundation under the Linux Foundation. This guide covers the architecture, the code, and what changes in 2026.

What is the Model Context Protocol (MCP)?

MCP is an open protocol that defines how AI applications discover and call external tools, read resources, and request prompts using JSON-RPC 2.0 messages over a standard transport. Think of it as USB-C for AI: one cable shape that fits every model and every tool.

Before MCP, every AI app had to write custom glue code for every tool it integrated with. With N models and M tools, you needed N x M integrations. MCP collapses that into N + M: each model speaks MCP once, each tool exposes MCP once, and the matrix becomes a bus.

The protocol defines five primitives, per the official specification:

Tools -- actions the model can invoke (e.g. search_jira, send_email).
Resources -- read-only context the model can pull in (e.g. files, database rows).
Prompts -- reusable templates the user or model can select.
Sampling -- the server asking the client to run an LLM call on its behalf.
Roots -- filesystem or URI scopes the server is allowed to operate within.

Every primitive travels as a JSON-RPC 2.0 message. That single design decision is what makes MCP portable across Claude, ChatGPT, Gemini, Cursor, and any future client.

Why was MCP created and who maintains it?

MCP was created by Anthropic in November 2024 to solve the N x M integration problem and to give models a standard way to access live, private context. It is now maintained by the Agentic AI Foundation (AAIF), a directed fund under the Linux Foundation.

Anthropic open-sourced the specification on November 25, 2024, alongside SDKs in Python and TypeScript and reference servers for GitHub, Slack, Postgres, and the local filesystem. The goal was explicit: stop forcing every AI product team to rebuild the same connectors.

On December 9, 2025, Anthropic transferred stewardship to the AAIF. According to The New Stack's coverage and the Linux Foundation press release, the AAIF is co-founded by Anthropic, Block, and OpenAI, with backing from Google, Microsoft, AWS, Cloudflare, and Bloomberg. MCP joined two other founding projects: Block's goose agent runtime and OpenAI's AGENTS.md convention.

The practical effect: no single vendor can fork or break the protocol. Spec changes go through Spec Enhancement Proposals (SEPs) reviewed by working groups. Anthropic still pays core maintainers but no longer owns the trademark or the roadmap.

How does the MCP architecture actually work?

MCP uses a three-part architecture: a host application runs one or more MCP clients, each client maintains a 1:1 connection to an MCP server, and the server exposes tools, resources, and prompts. All messages on the wire are JSON-RPC 2.0.

Here is the message flow for a typical tool call:

Initialize -- The client opens the transport and sends initialize with its protocol version and capabilities. The server responds with its capabilities and metadata.
Discover -- The client calls tools/list, resources/list, and prompts/list to learn what is available.
Inject -- The host serializes the tool list into the LLM's system prompt or native tool schema.
Invoke -- When the model decides to call a tool, the client sends tools/call with the tool name and arguments.
Execute -- The server runs the tool and returns the result, which the client passes back into the model's context.

   +--------------------+
   |  Host (Claude,     |
   |  ChatGPT, Cursor)  |
   |                    |
   |  +--------------+  |     stdio or          +-----------------+
   |  | MCP Client A |<-+---- Streamable HTTP ->| MCP Server      |
   |  +--------------+  |     (JSON-RPC 2.0)    | (Postgres, Jira)|
   |  +--------------+  |                       +-----------------+
   |  | MCP Client B |<-+----------------------> MCP Server (Git)
   |  +--------------+  |
   +--------------------+

There are two transports defined by the March 2025 transport spec: stdio for local subprocesses and Streamable HTTP for remote servers. The original Server-Sent Events transport was deprecated in favor of Streamable HTTP, which uses a single bidirectional HTTP endpoint with OAuth 2.1.

How does MCP differ from function calling?

Function calling embeds tool definitions inside each LLM request and is vendor-specific. MCP runs tools as separate processes that any vendor's model can discover and invoke through a shared JSON-RPC schema. They are complements, not competitors: most production agents use function calling under the hood to surface MCP tools to the model.

The practical differences:

Dimension	Function Calling	Model Context Protocol
Wire format	Vendor JSON schema in API request	JSON-RPC 2.0 over stdio / HTTP
Tool location	Inline in your app code	Separate server process
Cross-model	Rewrite per provider	Same server, any client
Discovery	Static at call time	Dynamic via `tools/list`
Auth	Handled by your app	OAuth 2.1 with PKCE built in
Best for	Prototypes, single model	Production, multi-vendor stacks

As the Descope team puts it, "function calling answers what to do; MCP answers how to plug it in." Use function calling for a one-off script. Use MCP the moment you need the same tool to work from Claude Desktop, Cursor, and your own agent at the same time.

Which AI providers support MCP in 2026?

As of May 2026, every major LLM vendor and coding agent supports MCP. The protocol moved from Anthropic-only to industry default in roughly 13 months.

The adoption timeline, from the Pento year-in-review and official MCP blog:

Nov 2024 -- Anthropic ships MCP with Claude Desktop and SDKs.
Mar 2025 -- OpenAI adopts MCP across Agents SDK, Responses API, and ChatGPT desktop after Sam Altman's public endorsement.
Apr 2025 -- Demis Hassabis confirms Gemini support.
Jun 2025 -- Microsoft ships MCP in Copilot Studio and Visual Studio Code.
Q4 2025 -- Cursor, Windsurf, Zed, JetBrains, and Cloudflare all ship native MCP.
Dec 2025 -- Anthropic donates MCP to the AAIF; the Salesforce Agentforce stack adds MCP.
Q1 2026 -- Google ships MCP in the Gemini API and Vertex AI Agent Builder.

The usage numbers, from DigitalApplied's MCP adoption report:

97 million monthly SDK downloads (March 2026), up from 100,000 at launch.
18,000+ public MCP servers indexed across registries.
78% of enterprise AI teams have at least one MCP-backed agent in production.
67% of CTOs name MCP their default agent integration standard.

This is the fastest enterprise protocol adoption since gRPC.

MCP SDK Monthly Downloads Growth (Nov 2024 - Mar 2026)

Nov 2024 (Launch)

0.1M

Mar 2025

12M

Sep 2025

45M

Mar 2026

97M

Source: Anthropic / Agentic AI Foundation, MCP Joins AAIF (Dec 2025)

Enterprise MCP Adoption Signals (April 2026)

Enterprises with at least one MCP agent in production

CTOs naming MCP their default integration standard

Active public MCP servers

18000

Major AI clients with first-class MCP support

Source: DigitalApplied MCP Adoption Statistics 2026 / Zylos Research

What does a minimal MCP server look like?

A working MCP server is roughly 20 lines of Python. The official mcp SDK provides FastMCP, which uses Python type hints and docstrings to auto-generate the JSON Schema the client needs.

Below is a complete server that exposes a single add tool over stdio. Save as server.py and add it to your Claude Desktop or Cursor config.

# server.py -- minimal MCP server, ~20 lines
from mcp.server.fastmcp import FastMCP

mcp = FastMCP("calculator")

@mcp.tool()
def add(a: float, b: float) -> float:
    """Return the sum of two numbers."""
    return a + b

@mcp.tool()
def multiply(a: float, b: float) -> float:
    """Return the product of two numbers."""
    return a * b

@mcp.resource("config://units")
def units() -> str:
    """Default units used by the calculator."""
    return "metric"

if __name__ == "__main__":
    mcp.run(transport="stdio")

What is happening:

FastMCP("calculator") registers the server with a name the client will display.
@mcp.tool() decorators turn ordinary Python functions into MCP tools. Type hints become the JSON Schema; docstrings become the tool description the model sees.
@mcp.resource(...) exposes read-only context at a URI the client can fetch.
mcp.run(transport="stdio") starts the JSON-RPC loop on stdin/stdout.

For remote servers, swap the last line for transport="streamable-http" and add OAuth via the mcp.server.auth module. The same tool functions work unchanged. See the official build-server guide for the full walkthrough.

What changes with the 2026 stateless HTTP transport?

The 2026 roadmap finalizes a stateless Streamable HTTP transport so MCP servers can scale horizontally behind load balancers without sticky sessions or shared session stores. This is the headline change in the 2026 MCP roadmap, with the spec release tentatively slated for June 2026.

What stateless HTTP unlocks:

No sticky sessions -- any server instance can serve any request, so Kubernetes deployments and serverless platforms work natively.
No distributed session store -- session state moves into signed tokens or external resources the client already holds.
Server Cards -- a metadata file served at a .well-known URL lets registries and crawlers discover what a server does without opening a live connection. This is the foundation of the public MCP server directory.
Resumable streams -- the spec defines how a client can reconnect mid-stream after a network blip without losing tool output.

Per Zylos Research benchmarks, the v2.1 Streamable HTTP transport delivers a 95% latency reduction over the deprecated SSE transport with a 100% success rate on stateless session handling under load.

If you are building a server today, the recommendation from the Transports Working Group is unambiguous: design for stateless from day one, store any session data externally, and ship Streamable HTTP with OAuth 2.1 PKCE. Bolting statelessness on later is harder than starting that way.

Dimension	Function Calling	Model Context Protocol (MCP)
Wire format	Vendor-specific JSON schema in API request	JSON-RPC 2.0 over stdio or Streamable HTTP
Coupling	Tool definitions live inside your app code	Tools live in a separate server process
Cross-model portability	Rewrite per provider (OpenAI, Anthropic, Google)	Same server works for any MCP-compatible client
Discovery	Static -- you list functions at call time	Dynamic -- client queries `tools/list` at runtime
State and resources	Tools only	Tools + resources + prompts + sampling + roots
Auth	Handled inside your app	OAuth 2.1 with PKCE built into Streamable HTTP
Best for	Single-model prototypes, one or two tools	Production agents, many tools, multi-vendor stacks

Frequently asked questions

What does MCP stand for?

MCP stands for Model Context Protocol. It is an open JSON-RPC 2.0 standard, originally introduced by Anthropic in November 2024 and now governed by the Linux Foundation's Agentic AI Foundation, that lets AI applications connect to external tools, data, and services through a single uniform interface.

Is MCP open source?

Yes. The specification, reference servers, and SDKs in Python, TypeScript, Go, Java, C#, Rust, and Kotlin are all open source under permissive licenses. Anthropic donated the project to the Agentic AI Foundation under the Linux Foundation in December 2025, ensuring vendor-neutral stewardship.

Does ChatGPT support MCP?

Yes. OpenAI added MCP support across the Agents SDK, Responses API, and the ChatGPT desktop app in March 2025 after Sam Altman publicly endorsed the protocol. ChatGPT can connect to any MCP server, both local stdio servers and remote Streamable HTTP servers.

Does Google Gemini support MCP?

Yes. Google added MCP support to the Gemini API and Vertex AI Agent Builder by Q1 2026, after Demis Hassabis confirmed adoption in April 2025. Gemini agents can call any MCP server using the same JSON-RPC payloads as Claude or ChatGPT.

What is the difference between MCP and an API?

An API is a generic contract for any client. MCP is a specific JSON-RPC schema designed for LLM agents, with built-in primitives for tools, resources, prompts, sampling,and roots, plus standard transports (stdio and Streamable HTTP). You typically wrap an existing REST API inside an MCP server so any agent can use it.

Do I need to host my own MCP server?

Not necessarily. As of 2026 there are over 18,000 public MCP servers and reference implementations from GitHub, Cloudflare, Stripe, Linear, Notion, and others. You only build a custom server when you need to expose proprietary internal systems or data.

Is MCP secure?

MCP secures remote connections with OAuth 2.1, PKCE, dynamic client registration, and mandatory Resource Indicators (RFC 8707), added in the June 2025 spec update. Local stdio servers run inside the user's trust boundary. As with any tool-calling system, you still need to scope tool permissions and audit server code.

What is the difference between stdio and Streamable HTTP transports?

stdio is for local servers launched as a subprocess by the client, ideal for desktop tools and developer workflows. Streamable HTTP is for remote servers over a single bidirectional HTTP endpoint with OAuth, replacing the deprecated Server-Sent Events transport. The 2026 roadmap makes Streamable HTTP fully stateless so servers scale horizontally.

After the FAQ, for readers who want to apply AEO principles to MCP-related content.

Get the AEO playbook for AI-native products