CL AIDec 5, 2024

CLINICSUM: Utilizing Language Models for Generating Clinical Summaries from Patient-Doctor Conversations

Subash Neupane, Himanshu Tripathi, Shaswata Mitra, Sean Bozorgzad, Sudip Mittal, Shahram Rahimi, Amin Amirlatifi

arXiv:2412.04254v15.510 citationsh-index: 26BigData

Originality Highly original

AI Analysis

This addresses the problem of automating clinical summarization for healthcare professionals, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles automatic generation of clinical summaries from patient-doctor conversations by introducing ClinicSum, a framework that uses a retrieval-based filtering module and fine-tuned pre-trained language models, achieving superior performance in automatic metrics like ROUGE and BERTScore and high preference from experts.

This paper presents ClinicSum, a novel framework designed to automatically generate clinical summaries from patient-doctor conversations. It utilizes a two-module architecture: a retrieval-based filtering module that extracts Subjective, Objective, Assessment, and Plan (SOAP) information from conversation transcripts, and an inference module powered by fine-tuned Pre-trained Language Models (PLMs), which leverage the extracted SOAP data to generate abstracted clinical summaries. To fine-tune the PLM, we created a training dataset of consisting 1,473 conversations-summaries pair by consolidating two publicly available datasets, FigShare and MTS-Dialog, with ground truth summaries validated by Subject Matter Experts (SMEs). ClinicSum's effectiveness is evaluated through both automatic metrics (e.g., ROUGE, BERTScore) and expert human assessments. Results show that ClinicSum outperforms state-of-the-art PLMs, demonstrating superior precision, recall, and F-1 scores in automatic evaluations and receiving high preference from SMEs in human assessment, making it a robust solution for automated clinical summarization.

View on arXiv PDF

Similar