Eyhab Al-Masri

IR
h-index19
4papers
4citations
Novelty43%
AI Score33

4 Papers

IRMar 9
Quantifying Divergence in Inter-LLM Communication Through API Retrieval and Ranking

Eyhab Al-Masri

Large language models (LLMs) increasingly operate as autonomous agents that reason over external APIs to perform complex tasks. However, their reliability and agreement remain poorly characterized. We present a unified benchmarking framework to quantify inter-LLM divergence, defined as the extent to which models differ in API discovery and ranking under identical tasks. Across 15 canonical API domains and 5 major model families, we measure pairwise and group-level agreement using set-, rank-, and consensus-based metrics including Average Overlap, Jaccard similarity, Rank-Biased Overlap, Kendall's tau, Kendall's W, and Cronbach's alpha. Results show moderate overall alignment (AO about 0.50, tau about 0.45) but strong domain dependence: structured tasks (Weather, Speech-to-Text) are stable, while open-ended tasks (Sentiment Analysis) exhibit substantially higher divergence. Volatility and consensus analyses reveal that coherence clusters around data-bound domains and degrades for abstract reasoning tasks. These insights enable reliability-aware orchestration in multi-agent systems, where consensus weighting can improve coordination among heterogeneous LLMs. Beyond performance benchmarking, our results reveal systematic failure modes in multi-agent LLM coordination, where apparent agreement can mask instability in action-relevant rankings. This hidden divergence poses a pre-deployment safety risk and motivates diagnostic benchmarks for early detection.

LGJun 5, 2025
Deciding When Not to Decide: Indeterminacy-Aware Intrusion Detection with NeutroSENSE

Eyhab Al-Masri

This paper presents NeutroSENSE, a neutrosophic-enhanced ensemble framework for interpretable intrusion detection in IoT environments. By integrating Random Forest, XGBoost, and Logistic Regression with neutrosophic logic, the system decomposes prediction confidence into truth (T), falsity (F), and indeterminacy (I) components, enabling uncertainty quantification and abstention. Predictions with high indeterminacy are flagged for review using both global and adaptive, class-specific thresholds. Evaluated on the IoT-CAD dataset, NeutroSENSE achieved 97% accuracy, while demonstrating that misclassified samples exhibit significantly higher indeterminacy (I = 0.62) than correct ones (I = 0.24). The use of indeterminacy as a proxy for uncertainty enables informed abstention and targeted review-particularly valuable in edge deployments. Figures and tables validate the correlation between I-scores and error likelihood, supporting more trustworthy, human-in-the-loop AI decisions. This work shows that neutrosophic logic enhances both accuracy and explainability, providing a practical foundation for trust-aware AI in edge and fog-based IoT security systems.

ITApr 30, 2025
Democratizing Differential Privacy: A Participatory AI Framework for Public Decision-Making

Wenjun Yang, Eyhab Al-Masri

This paper introduces a conversational interface system that enables participatory design of differentially private AI systems in public sector applications. Addressing the challenge of balancing mathematical privacy guarantees with democratic accountability, we propose three key contributions: (1) an adaptive $ε$-selection protocol leveraging TOPSIS multi-criteria decision analysis to align citizen preferences with differential privacy (DP) parameters, (2) an explainable noise-injection framework featuring real-time Mean Absolute Error (MAE) visualizations and GPT-4-powered impact analysis, and (3) an integrated legal-compliance mechanism that dynamically modulates privacy budgets based on evolving regulatory constraints. Our results advance participatory AI practices by demonstrating how conversational interfaces can enhance public engagement in algorithmic privacy mechanisms, ensuring that privacy-preserving AI in public sector governance remains both mathematically robust and democratically accountable.

ROJul 30, 2021
A Self-Adaptive IoT-based Approach for Improving the Decision Making of Active Surgical Robots in Hospitals

Alina Saduova, Eyhab Al-Masri

In recent years, surgical robots have become instrumental tools for assisting surgeons in performing complex surgical procedures in hospitals. Unlike conventional surgical methods, robotic systems help surgeons, for example, to perform minimally invasive surgical procedures while enhancing the precision and control of operations (e.g. tiny incisions, wound sutures, endoscopic suturing, among others). To this extent, it is essential to consider several factors that may influence the feasibility and decision making of employing robotic systems in surgical procedures. In this paper, we propose an IoT-based self-adaptive approach that uses multi-criteria decision analysis methods (MCDA) for enhancing the decision making of operations involving surgical robots. Throughout this paper, we present experimental validation results in utilizing MCDA as an effective strategy for enhancing the decisions of employing robotic systems in surgical procedures.