CVSep 10, 2018

The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

arXiv:1809.03171v17 citations
Originality Synthesis-oriented
AI Analysis

This provides a practical tool for researchers and practitioners needing multimodal object annotation, but it is incremental as it builds on existing annotation methods.

The authors introduced two annotation toolboxes for creating pixel/polygon masks and bounding boxes around objects in RGB and thermal images/videos, with each object receiving a classification tag, unique ID, and optional metadata. Tens of thousands of frames have been annotated using these tools.

This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes