CLJul 4, 2022

BoAT v2 -- A Web-Based Dependency Annotation Tool with Focus on Agglutinative Languages

arXiv:2207.01327v21 citationsh-index: 7Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the labor-intensive annotation process for agglutinative languages, providing a domain-specific tool to benefit linguists and NLP researchers, though it is incremental as it builds on an existing tool.

The authors tackled the problem of creating dependency treebanks for agglutinative languages like Turkish by developing BoAT v2, a web-based annotation tool that improves upon BoAT v1 to increase annotation speed and user experience, resulting in a tool that supports multi-user collaboration and is open-source.

The value of quality treebanks is steadily increasing due to the crucial role they play in the development of natural language processing tools. The creation of such treebanks is enormously labor-intensive and time-consuming. Especially when the size of treebanks is considered, tools that support the annotation process are essential. Various annotation tools have been proposed, however, they are often not suitable for agglutinative languages such as Turkish. BoAT v1 was developed for annotating dependency relations and was subsequently used to create the manually annotated BOUN Treebank (UD_Turkish-BOUN). In this work, we report on the design and implementation of a dependency annotation tool BoAT v2 based on the experiences gained from the use of BoAT v1, which revealed several opportunities for improvement. BoAT v2 is a multi-user and web-based dependency annotation tool that is designed with a focus on the annotator user experience to yield valid annotations. The main objectives of the tool are to: (1) support creating valid and consistent annotations with increased speed, (2) significantly improve the user experience of the annotator, (3) support collaboration among annotators, and (4) provide an open-source and easily deployable web-based annotation tool with a flexible application programming interface (API) to benefit the scientific community. This paper discusses the requirements elicitation, design, and implementation of BoAT v2 along with examples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes