CLOct 3, 2019

Towards Understanding of Medical Randomized Controlled Trials by Conclusion Generation

Alexander Te-Wei Shieh, Yung-Sung Chuang, Shang-Yu Su, Yun-Nung Chen

arXiv:1910.01462v130.1998 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of aiding clinical decision-making by automating the interpretation of RCTs, though it is incremental as it adapts existing models to a new domain.

The authors tackled the problem of generating conclusions from medical randomized controlled trials (RCTs) using machine learning, and found that fine-tuning GPT-2 on a PubMed dataset improved quality and correctness compared to a baseline pointer-generator model, as shown by both automatic and human evaluations.

Randomized controlled trials (RCTs) represent the paramount evidence of clinical medicine. Using machines to interpret the massive amount of RCTs has the potential of aiding clinical decision-making. We propose a RCT conclusion generation task from the PubMed 200k RCT sentence classification dataset to examine the effectiveness of sequence-to-sequence models on understanding RCTs. We first build a pointer-generator baseline model for conclusion generation. Then we fine-tune the state-of-the-art GPT-2 language model, which is pre-trained with general domain data, for this new medical domain task. Both automatic and human evaluation show that our GPT-2 fine-tuned models achieve improved quality and correctness in the generated conclusions compared to the baseline pointer-generator model. Further inspection points out the limitations of this current approach and future directions to explore.

View on arXiv PDF Code

Similar