Sebastian Stein

h-index23

10papers

54citations

Novelty37%

AI Score37

Ranked #90,085 of 194,257 authors (top 46%)#686 in HC (top 27%)

10 Papers

5.1MAOct 5, 2022

From Intelligent Agents to Trustworthy Human-Centred Multiagent Systems

Mohammad Divband Soorati, Enrico H. Gerding, Enrico Marchioni et al.

The Agents, Interaction and Complexity research group at the University of Southampton has a long track record of research in multiagent systems (MAS). We have made substantial scientific contributions across learning in MAS, game-theoretic techniques for coordinating agent systems, and formal methods for representation and reasoning. We highlight key results achieved by the group and elaborate on recent work and open research challenges in developing trustworthy autonomous systems and deploying human-centred AI systems that aim to support societal good.

6.1NAMay 23

Optimal Network Pricing for Oblivious Users under Projected Decision-Dependent Distributions

Yixuan Li, Andersen Ang, Sebastian Stein

Efficient large-scale network allocation requires pricing mechanisms that internalize the stochastic and non-linear dynamics of user behavior. Moving beyond classical models of strategic agents, we introduce an Optimal Network Pricing (ONP) problem for ``oblivious'' users. This shift introduces a Decision-Dependent (DD) environment where pricing decisions endogenously shift the flow demand distribution. A key novelty of our model is the incorporation of a projection operator, creating a nonsmooth optimization landscape. We demonstrate that Performative Stability (PS) fails in ONP, degenerating to a trivial solution. Instead, we prove that the expected objective admits a unique global optimum, termed the Projected Performative Optimum (ΠPO). To overcome the algorithmic challenges, we propose a rigorous framework combining Sample Average Approximation (SAA) with a Trust-Region Sequential Quadratic Programming (TR-SQP) solver. Our method targets ΠPO by explicitly modeling the nonsmooth Jacobian, effectively handling saturation constraints. We establish theoretical guarantees for probabilistic convexity and sample complexity, and exploit network sparsity to reduce per-iteration computational complexity to near-linear in the number of routes. Experimental validation on the classic Braess network and large-scale real-world topologies demonstrates that our ΠPO-targeting solver significantly outperforms PS-seeking heuristics and our proposed baseline. The results highlight that properly accounting for the ``gating'' effects of capacity unlocks substantial gains in social welfare, providing a robust foundation for network pricing.

2.3AIFeb 18, 2024Code

Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing

Tesfay Zemuy Gebrekidan, Sebastian Stein, Timothy J. Norman

Recently, there has been an explosion of mobile applications that perform computationally intensive tasks such as video streaming, data mining, virtual reality, augmented reality, image processing, video processing, face recognition, and online gaming. However, user devices (UDs), such as tablets and smartphones, have a limited ability to perform the computation needs of the tasks. Mobile edge computing (MEC) has emerged as a promising technology to meet the increasing computing demands of UDs. Task offloading in MEC is a strategy that meets the demands of UDs by distributing tasks between UDs and MEC servers. Deep reinforcement learning (DRL) is gaining attention in task-offloading problems because it can adapt to dynamic changes and minimize online computational complexity. However, the various types of continuous and discrete resource constraints on UDs and MEC servers pose challenges to the design of an efficient DRL-based task-offloading strategy. Existing DRL-based task-offloading algorithms focus on the constraints of the UDs, assuming the availability of enough storage resources on the server. Moreover, existing multiagent DRL (MADRL)--based task-offloading algorithms are homogeneous agents and consider homogeneous constraints as a penalty in their reward function. We proposed a novel combinatorial client-master MADRL (CCM\_MADRL) algorithm for task offloading in MEC (CCM\_MADRL\_MEC) that enables UDs to decide their resource requirements and the server to make a combinatorial decision based on the requirements of the UDs. CCM\_MADRL\_MEC is the first MADRL in task offloading to consider server storage capacity in addition to the constraints in the UDs. By taking advantage of the combinatorial action selection, CCM\_MADRL\_MEC has shown superior convergence over existing MADDPG and heuristic algorithms.

4.3CYNov 29, 2024

Responsible AI Governance: A Response to UN Interim Report on Governing AI for Humanity

Sarah Kiden, Bernd Stahl, Beverley Townsend et al.

This report presents a comprehensive response to the United Nation's Interim Report on Governing Artificial Intelligence (AI) for Humanity. It emphasizes the transformative potential of AI in achieving the Sustainable Development Goals (SDGs) while acknowledging the need for robust governance to mitigate associated risks. The response highlights opportunities for promoting equitable, secure, and inclusive AI ecosystems, which should be supported by investments in infrastructure and multi-stakeholder collaborations across jurisdictions. It also underscores challenges, including societal inequalities exacerbated by AI, ethical concerns, and environmental impacts. Recommendations advocate for legally binding norms, transparency, and multi-layered data governance models, alongside fostering AI literacy and capacity-building initiatives. Internationally, the report calls for harmonising AI governance frameworks with established laws, human rights standards, and regulatory approaches. The report concludes with actionable principles for fostering responsible AI governance through collaboration among governments, industry, academia, and civil society, ensuring the development of AI aligns with universal human values and the public good.

8.5IRJan 20, 2025

TutorLLM: Customizing Learning Recommendations with Knowledge Tracing and Retrieval-Augmented Generation

Zhaoxing Li, Vahid Yazdanpanah, Jindi Wang et al.

The integration of AI in education offers significant potential to enhance learning efficiency. Large Language Models (LLMs), such as ChatGPT, Gemini, and Llama, allow students to query a wide range of topics, providing unprecedented flexibility. However, LLMs face challenges, such as handling varying content relevance and lack of personalization. To address these challenges, we propose TutorLLM, a personalized learning recommender LLM system based on Knowledge Tracing (KT) and Retrieval-Augmented Generation (RAG). The novelty of TutorLLM lies in its unique combination of KT and RAG techniques with LLMs, which enables dynamic retrieval of context-specific knowledge and provides personalized learning recommendations based on the student's personal learning state. Specifically, this integration allows TutorLLM to tailor responses based on individual learning states predicted by the Multi-Features with Latent Relations BERT-based KT (MLFBK) model and to enhance response accuracy with a Scraper model. The evaluation includes user assessment questionnaires and performance metrics, demonstrating a 10% improvement in user satisfaction and a 5\% increase in quiz scores compared to using general LLMs alone.

2.9HCJan 10, 2022

Does Interacting Help Users Better Understand the Structure of Probabilistic Models?

Evdoxia Taka, Sebastian Stein, John H. Williamson

Despite growing interest in probabilistic modeling approaches and availability of learning tools, people with no or less statistical background feel hesitant to use them. There is need for tools for communicating probabilistic models to less experienced users more intuitively to help them build, validate, use effectively or trust probabilistic models. Users' comprehension of probabilistic models is vital in these cases and interactive visualizations could enhance it. Although there are various studies evaluating interactivity in Bayesian reasoning and available tools for visualizing the sample-based distributions, we focus specifically on evaluating the effect of interaction on users' comprehension of probabilistic models' structure. We conducted a user study based on our Interactive Pair Plot for visualizing models' distribution and conditioning the sample space graphically. Our results suggest that improvements in the understanding of the interaction group are most pronounced for more exotic structures, such as hierarchical models or unfamiliar parameterizations in comparison to the static group. As the detail of the inferred information increases, interaction does not lead to considerably longer response times. Finally, interaction improves users' confidence.

2.9HCJan 10, 2022

Evaluating Bayesian Model Visualisations

Sebastian Stein, John H. Williamson

Probabilistic models inform an increasingly broad range of business and policy decisions ultimately made by people. Recent algorithmic, computational, and software framework development progress facilitate the proliferation of Bayesian probabilistic models, which characterise unobserved parameters by their joint distribution instead of point estimates. While they can empower decision makers to explore complex queries and to perform what-if-style conditioning in theory, suitable visualisations and interactive tools are needed to maximise users' comprehension and rational decision making under uncertainty. In this paper, propose a protocol for quantitative evaluation of Bayesian model visualisations and introduce a software framework implementing this protocol to support standardisation in evaluation practice and facilitate reproducibility. We illustrate the evaluation and analysis workflow on a user study that explores whether making Boxplots and Hypothetical Outcome Plots interactive can increase comprehension or rationality and conclude with design guidelines for researchers looking to conduct similar studies in the future.

1.0LGMar 2, 2019

neuralRank: Searching and ranking ANN-based model repositories

Nirmit Desai, Linsong Chu, Raghu K. Ganti et al.

Widespread applications of deep learning have led to a plethora of pre-trained neural network models for common tasks. Such models are often adapted from other models via transfer learning. The models may have varying training sets, training algorithms, network architectures, and hyper-parameters. For a given application, what isthe most suitable model in a model repository? This is a critical question for practical deployments but it has not received much attention. This paper introduces the novel problem of searching and ranking models based on suitability relative to a target dataset and proposes a ranking algorithm called \textit{neuralRank}. The key idea behind this algorithm is to base model suitability on the discriminating power of a model, using a novel metric to measure it. With experimental results on the MNIST, Fashion, and CIFAR10 datasets, we demonstrate that (1) neuralRank is independent of the domain, the training set, or the network architecture and (2) that the models ranked highly by neuralRank ranking tend to have higher model accuracy in practice.

8.0MAFeb 6, 2018

On the Preliminary Investigation of Selfish Mining Strategy with Multiple Selfish Miners

Tin Leelavimolsilp, Long Tran-Thanh, Sebastian Stein

Eyal and Sirer's selfish mining strategy has demonstrated that Bitcoin system is not secure even if 50% of total mining power is held by altruistic miners. Since then, researchers have been investigating either to improve the efficiency of selfish mining, or how to defend against it, typically in a single selfish miner setting. Yet there is no research on a selfish mining strategies concurrently used by multiple miners in the system. The effectiveness of such selfish mining strategies and their required mining power under such multiple selfish miners setting remains unknown. In this paper, a preliminary investigation and our findings of selfish mining strategy used by multiple miners are reported. In addition, the conventional model of Bitcoin system is slightly redesigned to tackle its shortcoming: namely, a concurrency of individual mining processes. Although a theoretical analysis of selfish mining strategy under this setting is yet to be established, the current findings based on simulations is promising and of great interest. In particular, our work shows that a lower bound of power threshold required for selfish mining strategy decreases in proportion to a number of selfish miners. Moreover, there exist Nash equilibria where all selfish miners in the system do not change to an honest mining strategy and simultaneously earn their unfair amount of mining reward given that they equally possess sufficiently large mining power. Lastly, our new model yields a power threshold for mounting selfish mining strategy slightly greater than one from the conventional model.

3.5HCSep 5, 2016

Incentive Engineering Framework for Crowdsourcing Systems

Nhat V. Q. Truong, Sebastian Stein, Long Tran-Thanh et al.

Significant effort has been made to understand user motivation and to elicit user participation in crowdsourcing systems. However, incentive engineering, i.e., designing incentives that can purposefully motivate users, is still an open question and remains one of the key challenges of crowdsourcing initiatives. In this work in progress, we propose a general and systematic incentive engineering framework that system designers can use to implement appropriate incentives in order to effect desirable user behaviours.