IRAPJun 7, 2021

Network-based Topic Interaction Map for Big Data Mining of COVID-19 Biomedical Literature

arXiv:2106.07374v4
Originality Incremental advance
AI Analysis

This provides a tool for researchers to efficiently mine and understand topic compositions in large biomedical datasets, though it is incremental in topic modeling methods.

The authors tackled the challenge of analyzing the rapidly growing COVID-19 biomedical literature by proposing a novel framework that estimates and visualizes topic interactions, resulting in an intuitive representation of topic relationships for COVID-19 studies.

Since the emergence of the worldwide pandemic of COVID-19, relevant research has been published at a dazzling pace, which yields an abundant amount of big data in biomedical literature. Due to the high volum of relevant literature, it is practically impossible to follow up the research manually. Topic modeling is a well-known unsupervised learning that aims to reveal latent topics from text data. In this paper, we propose a novel analytical framework for estimating topic interactions and effective visualization to improve topics' relationships. We first estimate topic-word distributions using the biterm topic model and estimate the topics' interaction based on the word distribution using the latent space item response model. We mapped these latent topics onto networks to visualize relationships among the topics. Moreover, in the proposed approach, we developed a score that is helpful in selecting meaningful words that characterize the topic. We figure out how topics are related by looking at how their relationships change. We do this with a "trajectory plot" that is made with different levels of word richness. These findings provide a thoroughly mined and intuitive representation of relationships between topics related to a specific research area. The application of this proposed framework to the PubMed literature demonstrates utility of our approach in understanding of the topic composition related to COVID-19 studies in the stage of its emergence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes