CVMar 25, 2019

Knowledge-driven Encode, Retrieve, Paraphrase for Medical Image Report Generation

arXiv:1903.10122v1327 citations
Originality Incremental advance
AI Analysis

This addresses the problem of generating reliable medical reports for healthcare professionals, though it appears incremental by combining existing knowledge- and retrieval-based methods with learning-based techniques.

The paper tackles the challenge of generating accurate and coherent medical image reports by proposing a Knowledge-driven Encode, Retrieve, Paraphrase (KERP) approach, which achieves state-of-the-art results on two benchmarks with improved abnormality classification accuracy and human evaluation performance.

Generating long and semantic-coherent reports to describe medical images poses great challenges towards bridging visual and linguistic modalities, incorporating medical domain knowledge, and generating realistic and accurate descriptions. We propose a novel Knowledge-driven Encode, Retrieve, Paraphrase (KERP) approach which reconciles traditional knowledge- and retrieval-based methods with modern learning-based methods for accurate and robust medical report generation. Specifically, KERP decomposes medical report generation into explicit medical abnormality graph learning and subsequent natural language modeling. KERP first employs an Encode module that transforms visual features into a structured abnormality graph by incorporating prior medical knowledge; then a Retrieve module that retrieves text templates based on the detected abnormalities; and lastly, a Paraphrase module that rewrites the templates according to specific cases. The core of KERP is a proposed generic implementation unit---Graph Transformer (GTR) that dynamically transforms high-level semantics between graph-structured data of multiple domains such as knowledge graphs, images and sequences. Experiments show that the proposed approach generates structured and robust reports supported with accurate abnormality description and explainable attentive regions, achieving the state-of-the-art results on two medical report benchmarks, with the best medical abnormality and disease classification accuracy and improved human evaluation performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes