BBC-Oxford British Sign Language Dataset
This dataset addresses the need for resources in sign language technology, but it is incremental as it extends an existing dataset.
The authors introduced the BBC-Oxford British Sign Language (BOBSL) dataset, a large-scale video collection for British Sign Language, and provided baseline experiments for tasks like sign recognition and translation.
In this work, we introduce the BBC-Oxford British Sign Language (BOBSL) dataset, a large-scale video collection of British Sign Language (BSL). BOBSL is an extended and publicly released dataset based on the BSL-1K dataset introduced in previous work. We describe the motivation for the dataset, together with statistics and available annotations. We conduct experiments to provide baselines for the tasks of sign recognition, sign language alignment, and sign language translation. Finally, we describe several strengths and limitations of the data from the perspectives of machine learning and linguistics, note sources of bias present in the dataset, and discuss potential applications of BOBSL in the context of sign language technology. The dataset is available at https://www.robots.ox.ac.uk/~vgg/data/bobsl/.