CVAIROMar 8, 2022

Boosting Mask R-CNN Performance for Long, Thin Forensic Traces with Pre-Segmentation and IoU Region Merging

arXiv:2203.03886v1h-index: 3
Originality Incremental advance
AI Analysis

This is an incremental improvement for forensic image analysis, addressing a specific segmentation bottleneck.

The paper tackles Mask R-CNN's poor performance in segmenting long, thin forensic traces by adding PSPNet pre-segmentation and custom training strategies, achieving significant but unspecified improvements.

Mask R-CNN has recently achieved great success in the field of instance segmentation. However, weaknesses of the algorithm have been repeatedly pointed out as well, especially in the segmentation of long, sparse objects whose orientation is not exclusively horizontal or vertical. We present here an approach that significantly improves the performance of the algorithm by first pre-segmenting the images with a PSPNet algorithm. To further improve its prediction, we have developed our own cost functions and heuristics in the form of training strategies, which can prevent so-called (early) overfitting and achieve a more targeted convergence. Furthermore, due to the high variance of the images, especially for PSPNet, we aimed to develop strategies for a high robustness and generalization, which are also presented here.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes