20.5SIJun 2
Evidence-Aware Protein Complex Detection: Methods, Benchmarks, and Reproducibility ChallengesSima Soltani, Mehrdad Jalali, Yahya Forghani et al.
Protein complexes are central units of cellular organization, yet their identification from protein-protein interaction (PPI) networks remains difficult because interactome maps are noisy, incomplete, context dependent, and unevenly annotated. This focused methodological review examines evidence-aware approaches that combine PPI topology with Gene Ontology (GO) annotations, expression profiles, subcellular localization, sequence or domain evidence, temporal information, and representation learning, with emphasis on post-2018 methods and selected historical baselines. The central synthesis is that transparent evidence-aware graph methods currently offer the strongest tradeoff between biological plausibility and reproducibility, while deep, hypergraph, and dynamic heterogeneous models expand biological realism but require stronger benchmark control. The central bottleneck is no longer only the lack of algorithms, but the lack of harmonized, overlap-aware, and reproducible evaluation protocols. We therefore recommend unified benchmark versions, explicit GO-circularity controls, overlap-aware metrics, uncertainty estimates, and executable software packages over isolated source-specific F-measure gains.
AIAug 7, 2023
MSLE: An ontology for Materials Science Laboratory Equipment. Large-Scale Devices for Materials CharacterizationMehrdad Jalali, Matthias Mail, Rossella Aversa et al.
This paper introduces a new ontology for Materials Science Laboratory Equipment, termed MSLE. A fundamental issue with materials science laboratory (hereafter lab) equipment in the real world is that scientists work with various types of equipment with multiple specifications. For example, there are many electron microscopes with different parameters in chemical and physical labs. A critical development to unify the description is to build an equipment domain ontology as basic semantic knowledge and to guide the user to work with the equipment appropriately. Here, we propose to develop a consistent ontology for equipment, the MSLE ontology. In the MSLE, two main existing ontologies, the Semantic Sensor Network (SSN) and the Material Vocabulary (MatVoc), have been integrated into the MSLE core to build a coherent ontology. Since various acronyms and terms have been used for equipment, this paper proposes an approach to use a Simple Knowledge Organization System (SKOS) to represent the hierarchical structure of equipment terms. Equipment terms were collected in various languages and abbreviations and coded into the MSLE using the SKOS model. The ontology development was conducted in close collaboration with domain experts and focused on the large-scale devices for materials characterization available in our research group. Competency questions are expected to be addressed through the MSLE ontology. Constraints are modeled in the Shapes Query Language (SHACL); a prototype is shown and validated to show the value of the modeling constraints.
4.7SIMay 20
ECHO-PPI: Trustworthy AI for Evidence-Bundled Detection of Overlapping Protein Modules in Protein-Protein Interaction NetworksSima Soltani, Mehrdad Jalali, Yahya Forghani
Protein-protein interaction networks provide a graph-level view of cellular organization, yet their functional modules are overlapping, noisy, and difficult to interpret from cluster assignments alone. Existing community-detection methods can recover candidate protein complexes, but they rarely explain why an individual protein is assigned to a specific module or whether that assignment should be treated as core, peripheral, or uncertain. Here we introduce ECHO-PPI, an evidence-bundled framework for interpretable overlapping protein-module detection in protein-protein interaction networks. ECHO-PPI integrates weighted network topology, semantic protein profiles, and Gene Ontology evidence to identify evidence-potential nuclei, construct candidate modules, perform overlap-aware assignment, and export hierarchical confidence labels. The framework supports trustworthy computational decision support through assignment-level interpretability: each protein-module assignment is accompanied by topology, semantic, and Gene Ontology evidence scores and a hierarchical confidence label, enabling curators to inspect, rank, and triage overlapping module predictions. Evaluation on yeast protein-interaction data shows that ECHO-PPI preserves the behaviour of strong overlap-aware baselines while adding evidence-bundled auditability. Rather than claiming universal predictive superiority, ECHO-PPI addresses a complementary need: making overlapping protein-module predictions inspectable, confidence-aware, and reproducible for downstream biological interpretation.
7.7SIMay 15
CitePrism: Human-in-the-Loop AI for Citation Auditing and Editorial IntegrityGowrika Mahesh, Budanur Madappa Darshan Gowda, Kavana Gopladevarahalli Papegowda et al.
Editors and reviewers are expected to ensure that manuscripts cite relevant, accurate, current, and ethically appropriate literature, yet manuscript-level citation auditing remains largely manual, fragmented, and difficult to scale. Citation context, metadata quality, self-citation patterns, and bibliographic integrity all affect whether a reference appropriately supports a local claim. We present CitePrism, a transparent hybrid decision-support framework for editorial citation auditing that combines LLM-assisted contextual reasoning, embedding-based semantic similarity, metadata verification, integrity-oriented flags, and human-in-the-loop analyst review. CitePrism extracts citation neighborhoods, enriches reference metadata, computes fused relevance scores, surfaces metadata and self-citation review prompts, and supports configurable threshold-based triage. In a preliminary validation on a single case-study manuscript with 104 references from pavement engineering, agreement with human binary relevance labels reached Cohen's kappa = 0.429. At operating threshold tau = 17, CitePrism flagged all human-labeled irrelevant citations, while also producing false positives requiring analyst review. These results suggest that CitePrism may support conservative editorial screening and citation-quality triage, but they do not establish general editorial performance. CitePrism is intended as pilot-stage decision support, not as an autonomous misconduct detector or automated editorial decision system. Broader validation across manuscripts, domains, annotators, baselines, and deployment settings is required before operational use.
29.6SIMay 10
Astro Generative Network: A Variational Framework for Controlled Node Insertion in Incomplete Complex NetworksMehrdad Jalali, Binh Vu, Swati Chandna et al.
Empirical networked systems are often only partially observed: sampling frames, crawling policies, privacy constraints, and temporal gaps can leave actors and edges unobserved. This complicates robustness and sensitivity analysis because many graph-learning pipelines implicitly treat the observed node set as exhaustive. Link prediction and graph completion repair structure among known vertices, whereas full-graph generators synthesize new graphs rather than extending an observed one as a fixed backbone. We study the complementary task of controlled node insertion: generating plausible new actors and attaching them to an existing graph while preserving interpretable global topology. We introduce the Astro Generative Network (AGN), a variational graph autoencoder that samples latent vectors to decode node features and then integrates new vertices through similarity-based attachment to the observed backbone. We distinguish the recommended configuration, AGN, from AGN-original, a diagnostic baseline that permits generated-generated edges. Across three synthetic regimes, AGN-original forms dense generated-generated subgraphs that artificially inflate clustering and density. Disabling those edges removes this artifact while preserving degree and path-length behavior. In our experiments, AGN keeps clustering and modularity changes modest relative to pre-insertion values, while novelty diagnostics show non-trivial separation from existing nodes without claiming domain-grounded identities. Our contribution is methodological: a reproducible insertion protocol and evaluation lens for incomplete network science and engineering
AIJan 26, 2021
Adaptive Neuro Fuzzy Networks based on Quantum Subtractive ClusteringAli Mousavi, Mehrdad Jalali, Mahdi Yaghoubi
Data mining techniques can be used to discover useful patterns by exploring and analyzing data and it's feasible to synergitically combine machine learning tools to discover fuzzy classification rules.In this paper, an adaptive Neuro fuzzy network with TSK fuzzy type and an improved quantum subtractive clustering has been developed. Quantum clustering (QC) is an intuition from quantum mechanics which uses Schrodinger potential and time-consuming gradient descent method. The principle advantage and shortcoming of QC is analyzed and based on its shortcomings, an improved algorithm through a subtractive clustering method is proposed. Cluster centers represent a general model with essential characteristics of data which can be use as premise part of fuzzy rules.The experimental results revealed that proposed Anfis based on quantum subtractive clustering yielded good approximation and generalization capabilities and impressive decrease in the number of fuzzy rules and network output accuracy in comparison with traditional methods.
SIJun 12, 2019
A decentralized trust-aware collaborative filtering recommender system based on weighted items for social tagging systemsHossein Monshizadeh Naeen, Mehrdad Jalali
Recommender systems are used with the purpose of suggesting contents and resources to the users in a social network. These systems use ranks or tags each user assign to different resources to predict or make suggestions to users. Lately, social tagging systems, in which users can insert new contents, tag, organize, share, and search for contents are becoming more popular. These systems have a lot of valuable information, but data growth is one of its biggest challenges and this has led to the need for recommender systems that will predict what each user may like or need. One approach to the design of these systems which uses social environment of users is known as collaborative filtering (CF). One of the problems in CF systems is trustworthy of users and their tags. In this work, we consider a trust metric (which is concluded from users tagging behavior) beside the similarities to give suggestions and examine its effect on results. On the other hand, a decentralized approach is introduced which calculates similarity and trust relationships between users in a distributed manner. This causes the capability of implementing the proposed approach among all types of users with respect to different types of items, which are accessed by unique id across heterogeneous networks and environments. Finally, we show that the proposed model for calculating similarities between users reduces the size of the user-item matrix and considering trust in collaborative systems can lead to a better performance in generating suggestions.
LGJan 7, 2018
Applying an Ensemble Learning Method for Improving Multi-label Classification PerformanceAmirreza Mahdavi-Shahri, Mahboobeh Houshmand, Mahdi Yaghoobi et al.
In recent years, multi-label classification problem has become a controversial issue. In this kind of classification, each sample is associated with a set of class labels. Ensemble approaches are supervised learning algorithms in which an operator takes a number of learning algorithms, namely base-level algorithms and combines their outcomes to make an estimation. The simplest form of ensemble learning is to train the base-level algorithms on random subsets of data and then let them vote for the most popular classifications or average the predictions of the base-level algorithms. In this study, an ensemble learning method is proposed for improving multi-label classification evaluation criteria. We have compared our method with well-known base-level algorithms on some data sets. Experiment results show the proposed approach outperforms the base well-known classifiers for the multi-label classification problem.
SEFeb 10, 2014
An Optimized Semantic Web Service Composition Method Based on Clustering and Ant Colony AlgorithmNarges Hesami Rostami, Esmaeil Kheirkhah, Mehrdad Jalali
In today's Web, Web Services are created and updated on the fly. For answering complex needs of users, the construction of new web services based on existing ones is required. It has received a great attention from different communities. This problem is known as web services composition. However, it is one of big challenge problems of recent years in a distributed and dynamic environment. Web services can be composed manually but it is a time consuming task. The automatic web service composition is one of the key features for future the semantic web. The various approaches in field of web service compositions proposed by the researchers. In this paper, we propose a novel architecture for semantic web service composition using clustering and Ant colony algorithm.