CuRIAM: Corpus re Interpretation and Metalanguage in U.S. Supreme Court Opinions
This work addresses the need for systematic analysis of legal metalanguage, which is incremental as it applies existing linguistic concepts to a specific domain.
The researchers tackled the problem of analyzing metalanguage in judicial opinions by developing an annotation schema and applying it to U.S. Supreme Court opinions, resulting in a corpus of 59k tokens and identifying patterns in metalanguage usage.
Most judicial decisions involve the interpretation of legal texts; as such, judicial opinion requires the use of language as a medium to comment on or draw attention to other language. Language used this way is called metalanguage. We develop an annotation schema for categorizing types of legal metalanguage and apply our schema to a set of U.S. Supreme Court opinions, yielding a corpus totaling 59k tokens. We remark on several patterns observed in the kinds of metalanguage used by the justices.