CVOct 17, 2022

TIVE: A Toolbox for Identifying Video Instance Segmentation Errors

Wenhe Jia, Lu Yang, Zilong Jia, Wenyi Zhao, Yilin Zhou, Qing Song

arXiv:2210.08856v12.67 citationsh-index: 20Has Code

Originality Incremental advance

AI Analysis

This provides researchers with a diagnostic tool to better understand and improve VIS models, though it is incremental as it focuses on evaluation rather than new model architectures.

The paper tackles the lack of tools for analyzing errors in Video Instance Segmentation (VIS) models by introducing TIVE, a toolbox that identifies error types, weights their impact on mAP, and reveals model drawbacks in spatial segmentation and temporal association, with experiments showing how these aspects interact.

Since first proposed, Video Instance Segmentation(VIS) task has attracted vast researchers' focus on architecture modeling to boost performance. Though great advances achieved in online and offline paradigms, there are still insufficient means to identify model errors and distinguish discrepancies between methods, as well approaches that correctly reflect models' performance in recognizing object instances of various temporal lengths remain barely available. More importantly, as the fundamental model abilities demanded by the task, spatial segmentation and temporal association are still understudied in both evaluation and interaction mechanisms. In this paper, we introduce TIVE, a Toolbox for Identifying Video instance segmentation Errors. By directly operating output prediction files, TIVE defines isolated error types and weights each type's damage to mAP, for the purpose of distinguishing model characters. By decomposing localization quality in spatial-temporal dimensions, model's potential drawbacks on spatial segmentation and temporal association can be revealed. TIVE can also report mAP over instance temporal length for real applications. We conduct extensive experiments by the toolbox to further illustrate how spatial segmentation and temporal association affect each other. We expect the analysis of TIVE can give the researchers more insights, guiding the community to promote more meaningful explorations for video instance segmentation. The proposed toolbox is available at https://github.com/wenhe-jia/TIVE.

View on arXiv PDF Code

Similar