SEApr 6

From REST to MCP: An Empirical Study of API Wrapping and Automated Server Generation for LLM Agents

arXiv:2507.1604446.97 citationsh-index: 6
Predicted impact top 54% in SE · last 90 daysOriginality Incremental advance
AI Analysis

It addresses the problem of inefficient and error-prone API wrapping for LLM agents, offering practical improvements for developers building MCP servers, though it is incremental as it builds on existing standards and tools.

This paper conducted the first large-scale empirical study of MCP server construction, analyzing 116 servers to find that 88.6% are REST-backed and expose only 19% of operations, and developed AutoMCP, an automated pipeline that improves tool generation success from 76% to 94.2% and reduces tool count by one-third.

The Model Context Protocol (MCP) is emerging as a standard interface through which LLM agents invoke external tools, and a growing ecosystem of MCP servers now mediates access to vendor services. Most of these servers target vendors that already expose REST APIs, yet the relationship between MCP tool interfaces and the underlying API surface has not been empirically characterised. This paper presents the first large-scale study of MCP server construction. We analyse 116 official servers to determine REST reliance and integration strategies (RQ1); examine servers paired with OpenAPI specifications to quantify operation exposure, omission, and mapping patterns (RQ2); evaluate automated generation from 80 real-world OpenAPI contracts (RQ3); and assess specification repair and tool-set transformations to improve correctness and reduce complexity (RQ4). We find that 88.6% of servers are fully or partially REST-backed, with 92% implementing tools as bare API wrappers. MCP servers expose a median of 19% of available operations, following systematic patterns predictable from the specification. Baseline generation succeeds for 76% of sampled tools; automated repair raises this to 94.2%, while filtering and regrouping reduce the median tool count per API by one-third. We release AutoMCP, an end-to-end pipeline integrating specification repair and empirically grounded tool-set transformations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes