SISep 28, 2024
Public interest in science or bots? Selective amplification of scientific articles on TwitterAshiqur Rahman, Ehsan Mohammadi, Hamed Alhoori
With the remarkable capability to reach the public instantly, social media has become integral in sharing scholarly articles to measure public response. Since spamming by bots on social media can steer the conversation and present a false public interest in given research, affecting policies impacting the public's lives in the real world, this topic warrants critical study and attention. We used the Altmetric dataset in combination with data collected through the Twitter Application Programming Interface (API) and the Botometer API. We combined the data into an extensive dataset with academic articles, several features from the article and a label indicating whether the article had excessive bot activity on Twitter or not. We analyzed the data to see the possibility of bot activity based on different characteristics of the article. We also trained machine-learning models using this dataset to identify possible bot activity in any given article. Our machine-learning models were capable of identifying possible bot activity in any academic article with an accuracy of 0.70. We also found that articles related to "Health and Human Science" are more prone to bot activity compared to other research areas. Without arguing the maliciousness of the bot activity, our work presents a tool to identify the presence of bot activity in the dissemination of an academic article and creates a baseline for future research in this direction.
49.5SIMar 23
Tied In on TikTok: Tie Strength and Emotional Dynamics in Algorithmic CommunitiesCharles Bickham, Minh Duc Chu, Arianna Yuan et al.
Whether genuine communities can form on algorithmically-driven short-form video platforms like TikTok remains an open question, given that user interactions are often brief, dispersed, and difficult to trace. Building on theories of tie strength and online community formation, we examine whether eating disorder (ED) discourse on TikTok exhibits behavioral and emotional signatures of strong ties, including more frequent, reciprocal, and affectively intense interactions. In this paper, we analyze 43,040 ED-related TikTok videos and over 560,000 comments, alongside a Non-ED comparison dataset. We find that at the user-pair level, greater interaction frequency is associated with increasingly positive emotional expression, a pattern that is amplified in ED-related conversations. This trend is also reflected linguistically, with pairs that interact more frequently exhibiting more of a positive tone. At the same time, how a relationship starts matters: pairs that begin with positive exchanges usually stay mostly positive as they continue interacting, while pairs that begin negatively may add some positive exchanges over time but rarely become mostly positive. To contextualize these dynamics, we classify ED videos into three content types (Pro-Recovery, Pro-ED, and ED Experiences) and find that each exhibits distinct emotional interaction patterns. These findings suggest that dense, emotionally structured relationships can emerge within ED discourse on TikTok. More broadly, our work provides one of the first empirical demonstrations of how community-like relational dynamics form and persist on algorithmically driven short-form video platforms.
DLOct 25, 2025
Can Small and Reasoning Large Language Models Score Journal Articles for Research Quality and Do Averaging and Few-shot Help?Mike Thelwall, Ehsan Mohammadi
Assessing published academic journal articles is a common task for evaluations of departments and individuals. Whilst it is sometimes supported by citation data, Large Language Models (LLMs) may give more useful indications of article quality. Evidence of this capability exists for two of the largest LLM families, ChatGPT and Gemini, and the medium sized LLM Gemma3 27b, but it is unclear whether smaller LLMs and reasoning models have similar abilities. This is important because larger models may be slow and impractical in some situations, and reasoning models may perform differently. Four relevant questions are addressed with Gemma3 variants, Llama4 Scout, Qwen3, Magistral Small and DeepSeek R1, on a dataset of 2,780 medical, health and life science papers in 6 fields, with two different gold standards, one novel. The results suggest that smaller (open weights) and reasoning LLMs have similar performance to ChatGPT 4o-mini and Gemini 2.0 Flash, but that 1b parameters may often, and 4b sometimes, be too few. Moreover, averaging scores from multiple identical queries seems to be a universally successful strategy, and few-shot prompts (four examples) tended to help but the evidence was equivocal. Reasoning models did not have a clear advantage. Overall, the results show, for the first time, that smaller LLMs >4b, including reasoning models, have a substantial capability to score journal articles for research quality, especially if score averaging is used.
CYJun 15, 2024
Cutting through the noise to motivate people: A comprehensive analysis of COVID-19 social media posts de/motivating vaccinationAshiqur Rahman, Ehsan Mohammadi, Hamed Alhoori
The COVID-19 pandemic exposed significant weaknesses in the healthcare information system. The overwhelming volume of misinformation on social media and other socioeconomic factors created extraordinary challenges to motivate people to take proper precautions and get vaccinated. In this context, our work explored a novel direction by analyzing an extensive dataset collected over two years, identifying the topics de/motivating the public about COVID-19 vaccination. We analyzed these topics based on time, geographic location, and political orientation. We noticed that while the motivating topics remain the same over time and geographic location, the demotivating topics change rapidly. We also identified that intrinsic motivation, rather than external mandate, is more advantageous to inspire the public. This study addresses scientific communication and public motivation in social media. It can help public health officials, policymakers, and social media platforms develop more effective messaging strategies to cut through the noise of misinformation and educate the public about scientific findings.
DLSep 9, 2021
Mapping the Structure and Evolution of Software Testing Research Over the Past Three DecadesAlireza Salahirad, Gregory Gay, Ehsan Mohammadi
Background: The field of software testing is growing and rapidly-evolving. Aims: Based on keywords assigned to publications, we seek to identify predominant research topics and understand how they are connected and have evolved. Method: We apply co-word analysis to map the topology of testing research as a network where author-assigned keywords are connected by edges indicating co-occurrence in publications. Keywords are clustered based on edge density and frequency of connection. We examine the most popular keywords, summarize clusters into high-level research topics, examine how topics connect, and examine how the field is changing. Results: Testing research can be divided into 16 high-level topics and 18 subtopics. Creation guidance, automated test generation, evolution and maintenance, and test oracles have particularly strong connections to other topics, highlighting their multidisciplinary nature. Emerging keywords relate to web and mobile apps, machine learning, energy consumption, automated program repair and test generation, while emerging connections have formed between web apps, test oracles, and machine learning with many topics. Random and requirements-based testing show potential decline. Conclusions: Our observations, advice, and map data offer a deeper understanding of the field and inspiration regarding challenges and connections to explore.
DLJan 24, 2019
Readership Data and Research ImpactEhsan Mohammadi, Mike Thelwall
Reading academic publications is a key scholarly activity. Scholars accessing and recording academic publications online are producing new types of readership data. These include publisher, repository, and academic social network download statistics as well as online reference manager records. This chapter discusses the use of download and reference manager data for research evaluation and library collection development. The focus is on the validity and application of readership data as an impact indicator for academic publications across different disciplines. Mendeley is particularly promising in this regard, although all data sources are not subjected to rigorous quality control and can be manipulated.