CL LGApr 1, 2025

Efficient Annotator Reliability Assessment with EffiARA

Owen Cook, Jake Vasilakes, Ian Roberts, Xingyi Song

arXiv:2504.00589v36.72 citationsh-index: 2Has CodeACL

Originality Incremental advance

AI Analysis

This addresses the problem of costly and unreliable data annotation for machine learning practitioners, though it appears incremental as it builds on existing annotation concepts.

The paper tackles the lack of a standard framework for document-level annotation tasks by introducing EffiARA, which supports the entire annotation pipeline and improves classification performance and annotator agreement in studies.

Data annotation is an essential component of the machine learning pipeline; it is also a costly and time-consuming process. With the introduction of transformer-based models, annotation at the document level is increasingly popular; however, there is no standard framework for structuring such tasks. The EffiARA annotation framework is, to our knowledge, the first project to support the whole annotation pipeline, from understanding the resources required for an annotation task to compiling the annotated dataset and gaining insights into the reliability of individual annotators as well as the dataset as a whole. The framework's efficacy is supported by two previous studies: one improving classification performance through annotator-reliability-based soft-label aggregation and sample weighting, and the other increasing the overall agreement among annotators through removing identifying and replacing an unreliable annotator. This work introduces the EffiARA Python package and its accompanying webtool, which provides an accessible graphical user interface for the system. We open-source the EffiARA Python package at https://github.com/MiniEggz/EffiARA and the webtool is publicly accessible at https://effiara.gate.ac.uk.

View on arXiv PDF Code

Similar