David B. Skillicorn

h-index33

7papers

10citations

Novelty42%

AI Score21

Ranked #183,072 of 194,257 authors (top 94%)#29,786 in CL (top 97%)

7 Papers

2.3SIJan 26, 2023

Detecting Pump&Dump Stock Market Manipulation from Online Forums

D. Nam, D. B. Skillicorn

The intersection of social media, low-cost trading platforms, and naive investors has created an ideal situation for information-based market manipulations, especially pump&dumps. Manipulators accumulate small-cap stocks, disseminate false information on social media to inflate their price, and sell at the peak. We collect a dataset of stocks whose price and volume profiles have the characteristic shape of a pump&dump, and social media posts for those same stocks that match the timing of the initial price rises. From these we build predictive models for pump&dump events based on the language used in the social media posts. There are multiple difficulties: not every post will cause the intended market reaction, some pump&dump events may be triggered by posts in other forums, and there may be accidental confluences of post timing and market movements. Nevertheless, our best model achieves a prediction accuracy of 85% and an F1-score of 62%. Such a tool can provide early warning to investors and regulators that a pump&dump may be underway.

0.5CLAug 10, 2020

A Bootstrapped Model to Detect Abuse and Intent in White Supremacist Corpora

B. Simons, D. B. Skillicorn

Intelligence analysts face a difficult problem: distinguishing extremist rhetoric from potential extremist violence. Many are content to express abuse against some target group, but only a few indicate a willingness to engage in violence. We address this problem by building a predictive model for intent, bootstrapping from a seed set of intent words, and language templates expressing intent. We design both an n-gram and attention-based deep learner for intent and use them as colearners to improve both the basis for prediction and the predictions themselves. They converge to stable predictions in a few rounds. We merge predictions of intent with predictions of abusive language to detect posts that indicate a desire for violent action. We validate the predictions by comparing them to crowd-sourced labelling. The methodology can be applied to other linguistic properties for which a plausible starting point can be defined.

2.9CRAug 6, 2020

Activity Detection from Encrypted Remote Desktop Protocol Traffic

L. Lapczyk, D. B. Skillicorn

An increasing amount of Internet traffic has its content encrypted. We address the question of whether it is possible to predict the activities taking place over an encrypted channel, in particular Microsoft's Remote Desktop Protocol. We show that the presence of five typical activities can be detected with precision greater than 97\% and recall greater than 94\% in 30-second traces. We also show that the design of the protocol exposes fine-grained actions such as keystrokes and mouse movements which may be leveraged to reveal properties such as lengths of passwords.

2.3CRSep 12, 2018

Reversing the asymmetry in data exfiltration

David Skillicorn, Xiao Li, Karen Chen

Preventing data exfiltration from computer systems typically depends on perimeter defences, but these are becoming increasingly fragile. Instead we suggest an approach in which each at-risk document is supplemented by many fake versions of itself. An attacker must either exfiltrate all of them; or try to discover which is the real one while operating within the penetrated system, and both are difficult. Creating and maintaining many fakes is relatively inexpensive, so the advantage that typically accrues to an attacker now lies with the defender. We show that algorithmically generated fake documents are reasonably difficult to detect using algorithmic analytics.

0.2CLJun 13, 2018

Beyond Bags of Words: Inferring Systemic Nets

D. B. Skillicorn, N. Alsadhan

Textual analytics based on representations of documents as bags of words have been reasonably successful. However, analysis that requires deeper insight into language, into author properties, or into the contexts in which documents were created requires a richer representation. Systemic nets are one such representation. They have not been extensively used because they required human effort to construct. We show that systemic nets can be algorithmically inferred from corpora, that the resulting nets are plausible, and that they can provide practical benefits for knowledge discovery problems. This opens up a new class of practical analysis techniques for textual analytics.

1.6ROJan 15, 2018

The Design Space of Social Robots

D. B. Skillicorn, R. Billingsley, M. -A. Williams

We consider the design space available for social robots in terms of a hierarchy of functional definitions: the essential properties in terms of a locus of interaction, autonomy, intelligence, awareness of humans as possessors of mental state, and awareness of humans as social interactors. We also suggest that the emphasis on physical embodiment in some segments of the social robotics community has obscured commonalities with a class of agents that are identical in all other respects. These definitions naturally suggest research issues, directions, and possibilities which we explore. Social robotics also lacks compelling 'killer apps' which we suggest would help focus the community on a research agenda.

1.7ROMay 2, 2017

Social Robot Modelling of Human Affective State

D. B. Skillicorn, N. Alsadhan, R. Billingsley et al.

Social robots need to understand the affective state of the humans with whom they interact. Successful interactions require understanding mood and emotion in the short term, and personality and attitudes over longer periods. Social robots should also be able to infer the desires, wishes, and preferences of humans without being explicitly told. We investigate how effectively affective state can be inferred from corpora in which documents are plausible surrogates for what a robot might hear. For mood, emotions, wishes, desires, and attitudes we show highly ranked documents; for personality dimensions, estimates of ground truth are available and we report performance accuracy. The results are surprisingly strong given the limited information in short documents.