49.5CLMay 23Code
From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlasZhaokun Yan, Shan Xu, Wuzheng Dong et al.
Public health reasoning requires population level inference grounded in scientific evidence, expert consensus, and safety constraints. However, it remains underexplored as a structured machine learning problem with limited supervised signals and benchmarks. We introduce GlobalHealthAtlas, a large scale multilingual dataset of 280,210 instances spanning 15 public health domains and 17 languages. We further propose a large language model (LLM) assisted construction and quality control pipeline with retrieval, deduplication, evidence grounding checks, and label validation to improve consistency at scale. Finally, we present a domain aligned evaluator distilled from high confidence judgments of diverse LLMs to assess outputs along six dimensions: Accuracy, Reasoning, Completeness, Consensus Alignment, Terminology Norms, and Insightfulness. Together, these contributions enable reproducible training and evaluation of LLMs for safety critical public health reasoning beyond conventional QA benchmarks. We publicly release project codebase, evaluator, and model at:: https://github.com/Jan8217/GlobalHealthAtlas, https://huggingface.co/aerovane0/GlobalHealthAtlas_Public_Evaluator and https://huggingface.co/aerovane0/GlobalHealthAtlas_Public_Model
IVMay 28, 2025
Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal PathologyLianghui Zhu, Xitong Ling, Minxi Ouyang et al.
Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterative optimization strategy combining pretraining with fine-screening, specifically designed to address the detection of sparsely distributed lesion areas in whole-slide images. Digepath is pretrained on over 353 million multi-scale images from 210,043 H&E-stained slides of GI diseases. It attains state-of-the-art performance on 33 out of 34 tasks related to GI pathology, including pathological diagnosis, protein expression status prediction, gene mutation prediction, and prognosis evaluation. We further translate the intelligent screening module for early GI cancer and achieve near-perfect 99.70% sensitivity across nine independent medical institutions. This work not only advances AI-driven precision pathology for GI diseases but also bridge critical gaps in histopathological practice.
NCJun 20, 2024
The neural correlates of logical-mathematical symbol systems processing resemble that of spatial cognition more than natural language processingYuannan Li, Shan Xu, Jia Liu
The ability to manipulate logical-mathematical symbols (LMS), encompassing tasks such as calculation, reasoning, and programming, is a cognitive skill arguably unique to humans. Considering the relatively recent emergence of this ability in human evolutionary history, it has been suggested that LMS processing may build upon more fundamental cognitive systems, possibly through neuronal recycling. Previous studies have pinpointed two primary candidates, natural language processing and spatial cognition. Existing comparisons between these domains largely relied on task-level comparison, which may be confounded by task idiosyncrasy. The present study instead compared the neural correlates at the domain level with both automated meta-analysis and synthesized maps based on three representative LMS tasks, reasoning, calculation, and mental programming. Our results revealed a more substantial cortical overlap between LMS processing and spatial cognition, in contrast to language processing. Furthermore, in regions activated by both spatial and language processing, the multivariate activation pattern for LMS processing exhibited greater multivariate similarity to spatial cognition than to language processing. A hierarchical clustering analysis further indicated that typical LMS tasks were indistinguishable from spatial cognition tasks at the neural level, suggesting an inherent connection between these two cognitive processes. Taken together, our findings support the hypothesis that spatial cognition is likely the basis of LMS processing, which may shed light on the limitations of large language models in logical reasoning, particularly those trained exclusively on textual data without explicit emphasis on spatial content.
HCJan 8, 2022
Subtle Contact Nuances in the Delivery of Human-to-Human Touch Distinguish Emotional SentimentShan Xu, Chang Xu, Sarah McIntyre et al.
We routinely communicate distinct social and emotional sentiments through nuanced touch. For example, we might gently hold another's arm to offer a sense of calm, yet intensively hold another's arm to express excitement or anxiety. As this example indicates, distinct sentiments may be shaped by the subtlety in one's touch delivery. This work investigates how slight distinctions in skin-to-skin contact influence both the recognition of cued emotional messages (e.g., anger, sympathy) and the rating of emotional content (i.e., arousal, valence). By self-selecting preferred gestures (e.g., holding, stroking), touchers convey distinct messages by touching the receiver's forearm. Skin-to-skin contact attributes (e.g., velocity, depth, area) are optically tracked in high resolution. Contact is then examined within gesture, between messages. The results indicate touchers subtly, but significantly, vary contact attributes of a gesture to communicate distinct messages, which are recognizable by receivers. This tuning also correlates with receivers' arousal and valence. For instance, arousal increases with velocity for stroking, and depth for holding. Moreover, as shown here with human-to-human touch, valence is tied with velocity, which is the same trend as reported with brushes. The findings indicate that subtle nuance in skin-to-skin contact is important in conveying social messages and inducing emotions.
CVOct 1, 2012
Super-resolution using Sparse Representations over Learned Dictionaries: Reconstruction of Brain Structure using Electron MicroscopyTao Hu, Juan Nunez-Iglesias, Shiv Vitaladevuni et al.
A central problem in neuroscience is reconstructing neuronal circuits on the synapse level. Due to a wide range of scales in brain architecture such reconstruction requires imaging that is both high-resolution and high-throughput. Existing electron microscopy (EM) techniques possess required resolution in the lateral plane and either high-throughput or high depth resolution but not both. Here, we exploit recent advances in unsupervised learning and signal processing to obtain high depth-resolution EM images computationally without sacrificing throughput. First, we show that the brain tissue can be represented as a sparse linear combination of localized basis functions that are learned using high-resolution datasets. We then develop compressive sensing-inspired techniques that can reconstruct the brain tissue from very few (typically 5) tomographic views of each section. This enables tracing of neuronal processes and, hence, high throughput reconstruction of neural circuits on the level of individual synapses.