CLJan 5, 2023

TextDescriptives: A Python package for calculating a large variety of metrics from text

arXiv:2301.02057v341 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This provides a tool for researchers and practitioners in fields like healthcare and education to compute text metrics, but it is incremental as it builds on existing libraries like spaCy.

The authors introduced TextDescriptives, a Python package built on spaCy for calculating diverse text metrics, and demonstrated its application in analyzing clinical text stability, predicting neuropsychiatric conditions, and studying linguistic goals in education.

TextDescriptives is a Python package for calculating a large variety of metrics from text. It is built on top of spaCy and can be easily integrated into existing workflows. The package has already been used for analysing the linguistic stability of clinical texts, creating features for predicting neuropsychiatric conditions, and analysing linguistic goals of primary school students. This paper describes the package and its features.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes