Team PA-VCG's Solution for Competition on Understanding Chinese College Entrance Exam Papers in ICDAR'25
This work addresses document understanding challenges for educational assessment systems, but it is incremental as it builds on existing methods with specific optimizations.
The paper tackled the problem of dense OCR extraction and complex document layouts in Chinese college entrance exam papers by introducing domain-specific post-training strategies, achieving first place with an accuracy rate of 89.6%.
This report presents Team PA-VGG's solution for the ICDAR'25 Competition on Understanding Chinese College Entrance Exam Papers. In addition to leveraging high-resolution image processing and a multi-image end-to-end input strategy to address the challenges of dense OCR extraction and complex document layouts in Gaokao papers, our approach introduces domain-specific post-training strategies. Experimental results demonstrate that our post-training approach achieves the most outstanding performance, securing first place with an accuracy rate of 89.6%.