无需夸大其词的MCP解释

无需夸大其词的MCP解释
MCP explained without hype or fluff

原始链接: https://blog.nilenso.com/blog/2025/05/12/mcp-explained-without-hype-or-fluff/

模型上下文协议 (MCP) 旨在通过将 M x N 问题转化为 M + N 来简化 AI 集成。MCP 提供了一个标准化接口，避免了每个 AI 应用都需要针对各种平台进行自定义集成的麻烦。 MCP 包括 MCP 服务器（连接到数据源并公开工具）和 MCP 客户端（AI 应用的一部分）。服务器公开工具、提示、资源和采样功能，使 AI 应用能够与 CKAN 等开放数据平台进行交互。作者构建了一个 CKAN MCP 服务器，允许像 Claude 这样的 AI 客户端通过 `list_tags`、`search_datasets` 和 `get_resource_details` 等工具访问和分析开放数据集。这突出了 MCP 解锁数据和促进 AI 驱动洞察的能力。虽然 MCP 提供了宝贵的解耦合并简化了集成，但它并非万能的最终解决方案。它增加了开销，可能会影响 LLM 的性能。对于像 GitHub 这样公开操作的平台来说，它很有益处，但对于需要更严格控制和优化响应的内部工具来说，其用处可能较小。MCP 是一种需要考虑的工具，而不是万能的灵丹妙药。

The Hacker News discussion revolves around MCP (likely a "Middleware Communication Protocol" for LLMs), aiming to simplify integrations. While MCP reduces protocol complexity to M+N instead of M x N, concerns arise about authentication and authorization, remaining an M x N problem. Critics argue that managing different credentials across MCP servers can be a headache, simply shifting complexity rather than solving it. Some suggest using existing API standards like OpenAPI. However, others highlight MCP's value in providing structured, focused data ideal for training LLMs and enabling better tool discovery. There is debate on whether the current enthusiasm for MCP is more hype than substance, and a "solution looking for a problem". Possible solutions like MCP "SSO" and the idea of using a web of trust model for authentication are also discussed. A comparison to CGI-bin highlights its role as a basic interface, while others see it as similar to RAG (Retrieval-Augmented Generation) with added standardization. Some commentors also raise issues with security, the need for better UI generation, and token usage.

什么鬼是MCP？为什么每个人都在谈论它？ 2025-03-26

MCP 的一切问题 2025-04-14

2025-04-14

（评论） 2025-05-17

OpenAI 为 Agents SDK 添加了 MCP 支持。 2025-03-26

原文

Model Context Protocol, like most protocols, solves the M ⨯ N integration problem by turning it into an M + N integration problem.

An AI client application that speaks this protocol does not have to figure out how to fetch data or take actions specific to a platform.

MCP may or may not make your AI smarter, or improve your product, but it will reduce the friction to integrate against other applications that already support MCP. This may or may not be important to you.

The protocol specifies MCP Servers, that generally connect to data sources and expose tools specific to it. Then there are MCP clients, which are a part of AI applications. They can connect to any MCP Server, typically through a configuration that specifies how to connect to or run the server.

The servers, more commonly implemented than clients, may expose:

Tools that the LLM can call, eg, fetch_file for a filesystem or send_mail for a mail client integration.
Prompts, which are reusable templates of instructions or multi-step conversations for the LLM, that are intended to be user-controlled.
Resources that are exposed via URIs; it’s up to the client application’s design to decide how these are fetched or used.
Sampling, which allows servers to request LLM completions on the client application, which is useful for agentic patterns and running context-aware inference without needing to receive all the contextual data from the client.

There are a few more functions and nuances to servers, but these are what broadly stood out to me. Most servers that I have seen or used mostly just expose tool calls.

A tiny concrete example: an MCP server for Open Data access

I wrote a tiny MCP server to expose actions to take on CKAN, an open source data management system that’s used by Governments and other organisations to publish open datasets. CKAN has a web interface that links to these tagged datasets, which are usually semi-structured (CSVs, XLS) or totally unstructured (PDF reports and papers).

A view of the CKAN interface

This is not particularly conducive to discovery and drilling through data. It’s also significant friction to connect dots across datasets. I thought it would be nice to have an AI application that can access all the datasets on CKAN and make sense of it. Open Data is as useful as the insights that can be extracted from it.

One way for me to have approached this is to write an AI application from scratch encoded with knowledge about all the CKAN REST APIs. Unfortunately, this would have “locked in” AI use of CKAN open data sets to just my application. And data, especially Open Data wants to be free.

What I really wanted is a well-known “doorknob” that a lot of AI applications and agents in the world would know how to open. This is what MCP servers do. I wrote one in a couple of hours.

I used the official MCP Python SDK and defined some tools. Here’s an excerpt of what that looks like:

@mcp.tool()
async def list_tags(query: Optional[str] = None, limit: int = 50, ctx: Context = None) -> str:
    """List available tags in CKAN.

    Args:
        query: Optional search string to filter tags
        limit: Maximum number of tags to return

    Returns:
        A formatted string containing available tags.
    """
    # code to list all the tags used to tag data, via the CKAN API

@mcp.tool()
async def search_datasets(
    query: Optional[str] = None,
    tags: Optional[List[str]] = None,
    organization: Optional[str] = None,
    format: Optional[str] = None,
    limit: int = 10,
    offset: int = 0,
    ctx: Context = None
) -> str:
    """Search for datasets with various filters.

    Args:
        query: Free text search query
        tags: Filter by tags (list of tag names)
        organization: Filter by organization name
        format: Filter by resource format (e.g., CSV, JSON)
        limit: Maximum number of datasets to return
        offset: Number of datasets to skip

    Returns:
        A formatted string containing matching datasets.
    """
    # code to handle searches, using the CKAN API

@mcp.tool()
async def get_resource_details(resource_id: str, ctx: Context = None) -> str:
    """Get detailed information about a specific resource (file/data).

    Args:
        resource_id: The ID of the resource

    Returns:
        A formatted string containing resource details.
    """
    # code to read the details and get the link to a specific resource, using the CKAN API

The details of the SDK are better explained in official guides, but the gist of it is that it is an abstraction over JSON-RPC request-response messages that are defined in the protocol. The server I have implemented runs locally, launched as a subprocess by the client app and uses the stdio streams to pass these protocol messages around. Remote MCP servers are a thing as well.

After I wrote this server, I exposed it to the Claude desktop app, which is also an MCP client by editing claude_desktop_config.json. I pointed it to JusticeHub, a CKAN instance that contains legal and justice data, created by the folks at CivicDataLabs.

{
  "mcpServers": {
    "CKAN Server": {
      "command": "/Users/atharva/.local/bin/uv",
      "args": [
        "run",
        "--with",
        "httpx",
        "--with",
        "mcp[cli]",
        "mcp",
        "run",
        "/Users/atharva/ckan-mcp-server/main.py"
      ],
      "env": {
        "CKAN_URL": "https://justicehub.in"
      }
    }
  }
}

This allowed me to use this data through Claude.

A screenshot of a conversation with an AI assistant about exploring JusticeHub open data. The assistant lists available datasets, tags, and organizations on the JusticeHub platform. The conversation shows multiple function calls like list_datasets, list_tags, list_organizations, and search_datasets to explore different aspects of the data.

Claude discovered my MCP server and gave me a summary of what kind of data was available in JusticeHub.

I was able to take advantage of Claude’s analysis tool to help me visualise the data in an interactive dashboard!

I can envision other MCP clients in the future that could make better use of this data, beyond this basic conversational interface and tackle problems such as backlinks and provenance, while providing more structured, opinionated visualisations and analysis.

Should I build “an MCP”?

It’s worth noting that this is not a mature protocol—it is continuously evolving. But the adoption has been fantastic—I opened the first random MCP aggregating website and it lists over 4000 servers coming from various organisations and individuals. I’d estimate there’s a lot more out there.

Building against MCP is a clear, well-defined thing to do, something that’s rare in the volatile landscape of AI. This could explain its popularity. But it doesn’t make a good product. It’s another tool in your toolbox.

I (and other folks at nilenso) maintain that good products are built on a foundation that requires software engineering maturity, and this is especially true of AI products.

So let’s revisit what MCP brings to the table:

Turns M ⨯ N integration problem by turning it into an M + N integration problem.
Decouples AI client applications from AI tools and workflows for a platform.

This decoupling is not free of cost. There is extra scaffolding to make your applications talk this protocol. Your LLM performance is sensitive to prompting and tool descriptions. Adding lots of tools indiscriminately affects latencies and overall quality of your responses.

It makes sense for GitHub to expose repository actions for AI tools like Cursor or Windsurf to carry out. This is a valuable form of decoupling.

Does it make sense to have this decoupling for an internal tool, where the clients and servers are under your control, and the value comes from having well-optimised finetuned responses? Probably not.

Anywho, here’s some references. Happy building.