CLJan 11, 2020

PatentTransformer-2: Controlling Patent Text Generation by Structural Metadata

arXiv:2001.03708v111 citations
AI Analysis

This work addresses patent drafting automation for inventors and legal professionals, but it is incremental as it builds on prior versions and existing methods.

The authors tackled patent text generation by leveraging structural metadata to control the output, introducing a bidirectional text-to-text flow from metadata like titles and claims. They released GPT-2 models and code, with quality measured using ROUGE and Google Universal Sentence Encoder.

PatentTransformer is our codename for patent text generation based on Transformer-based models. Our goal is "Augmented Inventing." In this second version, we leverage more of the structural metadata in patents. The structural metadata includes patent title, abstract, and dependent claim, in addition to independent claim previously. Metadata controls what kind of patent text for the model to generate. Also, we leverage the relation between metadata to build a text-to-text generation flow, for example, from a few words to a title, the title to an abstract, the abstract to an independent claim, and the independent claim to multiple dependent claims. The text flow can go backward because the relation is trained bidirectionally. We release our GPT-2 models trained from scratch and our code for inference so that readers can verify and generate patent text on their own. As for generation quality, we measure it by both ROUGE and Google Universal Sentence Encoder.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes