Joana B. Pereira

LG
h-index14
4papers
Novelty43%
AI Score35

4 Papers

70.3SOC-PHMar 11
Technological Excellence Requires Human and Social Context

Karl Palmås, Mats Benner, Monica Billger et al.

Breakthrough technologies increasingly shape social institutions, economic systems, and political futures. Yet models of research excellence associated with such technologies often prioritize technical performance, scalability, and short-term innovation metrics while treating ethical, social, and cultural dimensions as secondary considerations. This perspective article argues that such separation is no longer tenable. We propose a broader understanding of excellence that combines technical rigor with ethical robustness, social intelligibility, and long-term relevance. The rapid emergence of generative and agentic artificial intelligence further underscores this argument. As technological systems increasingly operate through language, interpretation, and normative alignment, expertise traditionally cultivated in the humanities and social sciences becomes integral to the design, governance, and responsible deployment of such systems. Drawing on historical examples and contemporary research practices, this article examines five interconnected domains where the humanities and social sciences, treated as integrated dimensions of research practice, can strengthen technological development: (1) ethical, legal, and social integration in agenda-setting and research design; (2) plural and reflexive foresight practices that shape technological futures; (3) graduate education as a leverage point for cross-disciplinary literacy; (4) visualization and communication as epistemic and civic practices; and (5) institutional frameworks that move beyond rigid distinctions between basic and applied research. Across these dimensions, we propose practical strategies for embedding interdisciplinary collaboration structurally rather than symbolically.

LGJul 21, 2025
Deep-Learning Investigation of Vibrational Raman Spectra for Plant-Stress Analysis

Anoop C. Patil, Benny Jian Rong Sng, Yu-Wei Chang et al.

Detecting stress in plants is crucial for both open-farm and controlled-environment agriculture. Biomolecules within plants serve as key stress indicators, offering vital markers for continuous health monitoring and early disease detection. Raman spectroscopy provides a powerful, non-invasive means to quantify these biomolecules through their molecular vibrational signatures. However, traditional Raman analysis relies on customized data-processing workflows that require fluorescence background removal and prior identification of Raman peaks of interest-introducing potential biases and inconsistencies. Here, we introduce DIVA (Deep-learning-based Investigation of Vibrational Raman spectra for plant-stress Analysis), a fully automated workflow based on a variational autoencoder. Unlike conventional approaches, DIVA processes native Raman spectra-including fluorescence backgrounds-without manual preprocessing, identifying and quantifying significant spectral features in an unbiased manner. We applied DIVA to detect a range of plant stresses, including abiotic (shading, high light intensity, high temperature) and biotic stressors (bacterial infections). By integrating deep learning with vibrational spectroscopy, DIVA paves the way for AI-driven plant health assessment, fostering more resilient and sustainable agricultural practices.

LGMar 7, 2025
Global graph features unveiled by unsupervised geometric deep learning

Mirja Granfors, Jesús Pineda, Blanca Zufiria Gerbolés et al.

Graphs provide a powerful framework for modeling complex systems, but their structural variability poses significant challenges for analysis and classification. To address these challenges, we introduce GAUDI (Graph Autoencoder Uncovering Descriptive Information), a novel unsupervised geometric deep learning framework designed to capture both local details and global structure. GAUDI employs an innovative hourglass architecture with hierarchical pooling and upsampling layers linked through skip connections, which preserve essential connectivity information throughout the encoding-decoding process. Even though identical or highly similar underlying parameters describing a system's state can lead to significant variability in graph realizations, GAUDI consistently maps them into nearby regions of a structured and continuous latent space, effectively disentangling invariant process-level features from stochastic noise. We demonstrate GAUDI's versatility across multiple applications, including small-world networks modeling, characterization of protein assemblies from super-resolution microscopy, analysis of collective motion in the Vicsek model, and identification of age-related changes in brain connectivity. Comparison with related approaches highlights GAUDI's superior performance in analyzing complex graphs, providing new insights into emergent phenomena across diverse scientific domains.

LGJul 1, 2021
Neural Network Training with Highly Incomplete Datasets

Yu-Wei Chang, Laura Natali, Oveis Jamialahmadi et al.

Neural network training and validation rely on the availability of large high-quality datasets. However, in many cases only incomplete datasets are available, particularly in health care applications, where each patient typically undergoes different clinical procedures or can drop out of a study. Since the data to train the neural networks need to be complete, most studies discard the incomplete datapoints, which reduces the size of the training data, or impute the missing features, which can lead to artefacts. Alas, both approaches are inadequate when a large portion of the data is missing. Here, we introduce GapNet, an alternative deep-learning training approach that can use highly incomplete datasets. First, the dataset is split into subsets of samples containing all values for a certain cluster of features. Then, these subsets are used to train individual neural networks. Finally, this ensemble of neural networks is combined into a single neural network whose training is fine-tuned using all complete datapoints. Using two highly incomplete real-world medical datasets, we show that GapNet improves the identification of patients with underlying Alzheimer's disease pathology and of patients at risk of hospitalization due to Covid-19. By distilling the information available in incomplete datasets without having to reduce their size or to impute missing values, GapNet will permit to extract valuable information from a wide range of datasets, benefiting diverse fields from medicine to engineering.