CV AI ROMar 8, 2022

Boosting Mask R-CNN Performance for Long, Thin Forensic Traces with Pre-Segmentation and IoU Region Merging

Moritz Zink, Martin Schiele, Pengcheng Fan, Stephan Gasterstädt

arXiv:2203.03886v1h-index: 3

Originality Incremental advance

AI Analysis

This is an incremental improvement for forensic image analysis, addressing a specific segmentation bottleneck.

The paper tackles Mask R-CNN's poor performance in segmenting long, thin forensic traces by adding PSPNet pre-segmentation and custom training strategies, achieving significant but unspecified improvements.

Mask R-CNN has recently achieved great success in the field of instance segmentation. However, weaknesses of the algorithm have been repeatedly pointed out as well, especially in the segmentation of long, sparse objects whose orientation is not exclusively horizontal or vertical. We present here an approach that significantly improves the performance of the algorithm by first pre-segmenting the images with a PSPNet algorithm. To further improve its prediction, we have developed our own cost functions and heuristics in the form of training strategies, which can prevent so-called (early) overfitting and achieve a more targeted convergence. Furthermore, due to the high variance of the images, especially for PSPNet, we aimed to develop strategies for a high robustness and generalization, which are also presented here.

View on arXiv PDF

Similar