CLAIMar 13, 2023

The System Description of dun_oscar team for The ICPR MSR Challenge

arXiv:2303.06878v1h-index: 4
Originality Synthesis-oriented
AI Analysis

This is an incremental system description for a specific video analysis challenge, with no broader implications stated.

The paper describes a system for the ICPR MSR Challenge that tackles video subtitle recognition and correction across three tasks, achieving results through a combination of OCR, ASR, and fusion modules, though no concrete performance numbers are provided.

This paper introduces the system submitted by dun_oscar team for the ICPR MSR Challenge. Three subsystems for task1-task3 are descripted respectively. In task1, we develop a visual system which includes a OCR model, a text tracker, and a NLP classifier for distinguishing subtitles and non-subtitles. In task2, we employ an ASR system which includes an AM with 18 layers and a 4-gram LM. Semi-supervised learning on unlabeled data is also vital. In task3, we employ the ASR system to improve the visual system, some false subtitles can be corrected by a fusion module.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes