CLAINov 15, 2018

Implementing a Portable Clinical NLP System with a Common Data Model - a Lisp Perspective

arXiv:1811.06179v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of clinical data interoperability for healthcare researchers and practitioners, though it is incremental as it builds on existing NLP tools and models.

The paper tackles the challenge of creating a portable clinical NLP system by developing LAPNLP, a Lisp-based architecture that integrates various NLP tools and uses a Common Data Model for standardization, enabling tasks like computational phenotyping and semantic relation extraction across institutions.

This paper presents a Lisp architecture for a portable NLP system, termed LAPNLP, for processing clinical notes. LAPNLP integrates multiple standard, customized and in-house developed NLP tools. Our system facilitates portability across different institutions and data systems by incorporating an enriched Common Data Model (CDM) to standardize necessary data elements. It utilizes UMLS to perform domain adaptation when integrating generic domain NLP tools. It also features stand-off annotations that are specified by positional reference to the original document. We built an interval tree based search engine to efficiently query and retrieve the stand-off annotations by specifying positional requirements. We also developed a utility to convert an inline annotation format to stand-off annotations to enable the reuse of clinical text datasets with inline annotations. We experimented with our system on several NLP facilitated tasks including computational phenotyping for lymphoma patients and semantic relation extraction for clinical notes. These experiments showcased the broader applicability and utility of LAPNLP.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes