CLAug 1, 2022

Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets

arXiv:2208.00932v112 citationsh-index: 20
Originality Synthesis-oriented
AI Analysis

This work provides a tool for researchers and users in Arabic NLP to more easily explore datasets, though it is incremental as it builds on an existing metadata structure.

The paper tackles the challenge of exploring a catalogue of over 500 Arabic NLP datasets by introducing Masader Plus, a web interface that enables users to browse, filter, and access datasets through an API, improving accessibility and user experience.

Masader (Alyafeai et al., 2021) created a metadata structure to be used for cataloguing Arabic NLP datasets. However, developing an easy way to explore such a catalogue is a challenging task. In order to give the optimal experience for users and researchers exploring the catalogue, several design and user experience challenges must be resolved. Furthermore, user interactions with the website may provide an easy approach to improve the catalogue. In this paper, we introduce Masader Plus, a web interface for users to browse Masader. We demonstrate data exploration, filtration, and a simple API that allows users to examine datasets from the backend. Masader Plus can be explored using this link https://arbml.github.io/masader. A video recording explaining the interface can be found here https://www.youtube.com/watch?v=SEtdlSeqchk.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes