SEAIJul 7, 2025

OASBuilder: Generating OpenAPI Specifications from Online API Documentation with Large Language Models

arXiv:2507.05316v1h-index: 18
Originality Synthesis-oriented
AI Analysis

This addresses the bottleneck of manual API specification creation for AI agents and business automation tools, though it is an incremental improvement by applying existing methods to a specific domain.

The paper tackles the problem of converting unstructured API documentation into machine-readable OpenAPI specifications, introducing OASBuilder, a framework that uses large language models and rule-based algorithms to automate this process, saving thousands of hours of manual effort and making hundreds of enterprise APIs accessible for LLMs.

AI agents and business automation tools interacting with external web services require standardized, machine-readable information about their APIs in the form of API specifications. However, the information about APIs available online is often presented as unstructured, free-form HTML documentation, requiring external users to spend significant time manually converting it into a structured format. To address this, we introduce OASBuilder, a novel framework that transforms long and diverse API documentation pages into consistent, machine-readable API specifications. This is achieved through a carefully crafted pipeline that integrates large language models and rule-based algorithms which are guided by domain knowledge of the structure of documentation webpages. Our experiments demonstrate that OASBuilder generalizes well across hundreds of APIs, and produces valid OpenAPI specifications that encapsulate most of the information from the original documentation. OASBuilder has been successfully implemented in an enterprise environment, saving thousands of hours of manual effort and making hundreds of complex enterprise APIs accessible as tools for LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes