CLAug 25, 2018

MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction

arXiv:1808.08392v11089 citations
AI Analysis

This tool addresses the problem of efficient and accurate annotation for Arabic language researchers, but it is incremental as it builds on existing annotation methods with a new interface.

The authors introduced MADARi, a web interface for joint morphological annotation and spelling correction in Standard and Dialectal Arabic, which includes features like pre-computed analyses and remote management to enhance annotator productivity, as detailed in a user study.

In this paper, we introduce MADARi, a joint morphological annotation and spelling correction system for texts in Standard and Dialectal Arabic. The MADARi framework provides intuitive interfaces for annotating text and managing the annotation process of a large number of sizable documents. Morphological annotation includes indicating, for a word, in context, its baseword, clitics, part-of-speech, lemma, gloss, and dialect identification. MADARi has a suite of utilities to help with annotator productivity. For example, annotators are provided with pre-computed analyses to assist them in their task and reduce the amount of work needed to complete it. MADARi also allows annotators to query a morphological analyzer for a list of possible analyses in multiple dialects or look up previously submitted analyses. The MADARi management interface enables a lead annotator to easily manage and organize the whole annotation process remotely and concurrently. We describe the motivation, design and implementation of this interface; and we present details from a user study working with this system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes