CVAug 10, 2023Code
Recognizing Handwritten Mathematical Expressions of Vertical Addition and SubtractionDaniel Rosa, Filipe R. Cordeiro, Ruan Carvalho et al.
Handwritten Mathematical Expression Recognition (HMER) is a challenging task with many educational applications. Recent methods for HMER have been developed for complex mathematical expressions in standard horizontal format. However, solutions for elementary mathematical expression, such as vertical addition and subtraction, have not been explored in the literature. This work proposes a new handwritten elementary mathematical expression dataset composed of addition and subtraction expressions in a vertical format. We also extended the MNIST dataset to generate artificial images with this structure. Furthermore, we proposed a solution for offline HMER, able to recognize vertical addition and subtraction expressions. Our analysis evaluated the object detection algorithms YOLO v7, YOLO v8, YOLO-NAS, NanoDet and FCOS for identifying the mathematical symbols. We also proposed a transcription method to map the bounding boxes from the object detection stage to a mathematical expression in the LATEX markup sequence. Results show that our approach is efficient, achieving a high expression recognition rate. The code and dataset are available at https://github.com/Danielgol/HME-VAS
HCJun 19, 2025
Can GPT-4o Evaluate Usability Like Human Experts? A Comparative Study on Issue Identification in Heuristic EvaluationGuilherme Guerino, Luiz Rodrigues, Bruna Capeleti et al.
Heuristic evaluation is a widely used method in Human-Computer Interaction (HCI) to inspect interfaces and identify issues based on heuristics. Recently, Large Language Models (LLMs), such as GPT-4o, have been applied in HCI to assist in persona creation, the ideation process, and the analysis of semi-structured interviews. However, considering the need to understand heuristics and the high degree of abstraction required to evaluate them, LLMs may have difficulty conducting heuristic evaluation. However, prior research has not investigated GPT-4o's performance in heuristic evaluation compared to HCI experts in web-based systems. In this context, this study aims to compare the results of a heuristic evaluation performed by GPT-4o and human experts. To this end, we selected a set of screenshots from a web system and asked GPT-4o to perform a heuristic evaluation based on Nielsen's Heuristics from a literature-grounded prompt. Our results indicate that only 21.2% of the issues identified by human experts were also identified by GPT-4o, despite it found 27 new issues. We also found that GPT-4o performed better for heuristics related to aesthetic and minimalist design and match between system and real world, whereas it has difficulty identifying issues in heuristics related to flexibility, control, and user efficiency. Additionally, we noticed that GPT-4o generated several false positives due to hallucinations and attempts to predict issues. Finally, we highlight five takeaways for the conscious use of GPT-4o in heuristic evaluations.
CLMar 29
Understanding Teacher Revisions of Large Language Model-Generated FeedbackConrad Borchers, Luiz Rodrigues, Newarney Torrezão da Costa et al.
Large language models (LLMs) increasingly generate formative feedback for students, yet little is known about how teachers revise this feedback before it reaches learners. Teachers' revisions shape what students receive, making revision practices central to evaluating AI classroom tools. We analyze a dataset of 1,349 instances of AI-generated feedback and corresponding teacher-edited explanations from 117 teachers. We examine (i) textual characteristics associated with teacher revisions, (ii) whether revision decisions can be predicted from the AI feedback text, and (iii) how revisions change the pedagogical type of feedback delivered. First, we find that teachers accept AI feedback without modification in about 80% of cases, while edited feedback tends to be significantly longer and subsequently shortened by teachers. Editing behavior varies substantially across teachers: about 50% never edit AI feedback, and only about 10% edit more than two-thirds of feedback instances. Second, machine learning models trained only on the AI feedback text as input features, using sentence embeddings, achieve fair performance in identifying which feedback will be revised (AUC=0.75). Third, qualitative coding shows that when revisions occur, teachers often simplify AI-generated feedback, shifting it away from high-information explanations toward more concise, corrective forms. Together, these findings characterize how teachers engage with AI-generated feedback in practice and highlight opportunities to design feedback systems that better align with teacher priorities while reducing unnecessary editing effort.
HCJul 31, 2025
A Mixed User-Centered Approach to Enable Augmented Intelligence in Intelligent Tutoring Systems: The Case of MathAIde appGuilherme Guerino, Luiz Rodrigues, Luana Bianchini et al.
This study explores the integration of Augmented Intelligence (AuI) in Intelligent Tutoring Systems (ITS) to address challenges in Artificial Intelligence in Education (AIED), including teacher involvement, AI reliability, and resource accessibility. We present MathAIde, an ITS that uses computer vision and AI to correct mathematics exercises from student work photos and provide feedback. The system was designed through a collaborative process involving brainstorming with teachers, high-fidelity prototyping, A/B testing, and a real-world case study. Findings emphasize the importance of a teacher-centered, user-driven approach, where AI suggests remediation alternatives while teachers retain decision-making. Results highlight efficiency, usability, and adoption potential in classroom contexts, particularly in resource-limited environments. The study contributes practical insights into designing ITSs that balanceuser needs and technological feasibility, while advancing AIED research by demonstrating the effectiveness of a mixed-methods, user-centered approach to implementing AuI in educational technologies.
CLJul 11, 2025
Enhancing Essay Cohesion Assessment: A Novel Item Response Theory ApproachBruno Alexandre Rosa, Hilário Oliveira, Luiz Rodrigues et al.
Essays are considered a valuable mechanism for evaluating learning outcomes in writing. Textual cohesion is an essential characteristic of a text, as it facilitates the establishment of meaning between its parts. Automatically scoring cohesion in essays presents a challenge in the field of educational artificial intelligence. The machine learning algorithms used to evaluate texts generally do not consider the individual characteristics of the instances that comprise the analysed corpus. In this meaning, item response theory can be adapted to the context of machine learning, characterising the ability, difficulty and discrimination of the models used. This work proposes and analyses the performance of a cohesion score prediction approach based on item response theory to adjust the scores generated by machine learning models. In this study, the corpus selected for the experiments consisted of the extended Essay-BR, which includes 6,563 essays in the style of the National High School Exam (ENEM), and the Brazilian Portuguese Narrative Essays, comprising 1,235 essays written by 5th to 9th grade students from public schools. We extracted 325 linguistic features and treated the problem as a machine learning regression task. The experimental results indicate that the proposed approach outperforms conventional machine learning models and ensemble methods in several evaluation metrics. This research explores a potential approach for improving the automatic evaluation of cohesion in educational essays.
HCJun 18, 2021
Does gamification affect flow experience? A systematic literature reviewWilk Oliveira, Olena Pastushenko, Luiz Rodrigues et al.
In recent years, studies in different areas have used gamification to improve users' flow experience. However, due to the high variety of the conducted studies and the lack of secondary studies (e.g., systematic literature reviews) in this field, it is difficult to get the state-of-the-art of this research domain. To address this problem, we conducted a systematic literature review to identify i) which gamification design methods have been used in the studies about gamification and Flow Theory, ii) which gamification elements have been used in these studies, iii) which methods have been used to evaluate the users' flow experience in gamified settings, and iv) how gamification affects users' flow experience. The main results show that there is growing interest to this field, as the number of publications is increasing. The most significant interest is in the area of gamification in education. However, there is no unanimity regarding the preferred method of the study or the effects of gamification on users' experience. Our results highlight the importance of conducting new experimental studies investigating how gamification affects the users' flow experience in different gamified settings, applications and domains.
HCJan 14, 2021
Automating Gamification Personalization: To the User and BeyondLuiz Rodrigues, Armando M. Toda, Wilk Oliveira et al.
Personalized gamification explores knowledge about the users to tailor gamification designs to improve one-size-fits-all gamification. The tailoring process should simultaneously consider user and contextual characteristics (e.g., activity to be done and geographic location), which leads to several occasions to tailor. Consequently, tools for automating gamification personalization are needed. The problems that emerge are that which of those characteristics are relevant and how to do such tailoring are open questions, and that the required automating tools are lacking. We tackled these problems in two steps. First, we conducted an exploratory study, collecting participants' opinions on the game elements they consider the most useful for different learning activity types (LAT) via survey. Then, we modeled opinions through conditional decision trees to address the aforementioned tailoring process. Second, as a product from the first step, we implemented a recommender system that suggests personalized gamification designs (which game elements to use), addressing the problem of automating gamification personalization. Our findings i) present empirical evidence that LAT, geographic locations, and other user characteristics affect users' preferences, ii) enable defining gamification designs tailored to user and contextual features simultaneously, and iii) provide technological aid for those interested in designing personalized gamification. The main implications are that demographics, game-related characteristics, geographic location, and LAT to be done, as well as the interaction between different kinds of information (user and contextual characteristics), should be considered in defining gamification designs and that personalizing gamification designs can be improved with aid from our recommender system.
HCAug 12, 2020
Validating the Effectiveness of Data-Driven Gamification Recommendations: An Exploratory StudyArmando Toda, Paula Palomino, Luiz Rodrigues et al.
Gamification design has benefited from data-driven approaches to creating strategies based on students characteristics. However, these strategies need further validation to verify their effectiveness in e-learning environments. The exploratory study presented in this paper thus aims at verifying how data-driven gamified strategies are perceived by the students, i.e., the users of e-learning environments. In this study, we conducted a survey presenting 25 predefined strategies, based on a previous study, to students and analysed each strategys perceived relevance, instanced in an e-learning environment. Our results show that students perceive Acknowledgement, Objective and Progression as important elements in a gamified e-learning environment. We also provide new insights about existing elements and design recommendations for domain specialists.
HCAug 12, 2020
Analysing gamification elements in educational environments using an existing Gamification taxonomyArmando M. Toda, Ana C. T. Klock, Wilk Oliveira et al.
Gamification has been widely employed in the educational domain over the past eight years when the term became a trend. However, the literature states that gamification still lacks formal definitions to support the design and analysis of gamified strategies. This paper analysed the game elements employed in gamified learning environments through a previously proposed and evaluated taxonomy while detailing and expanding this taxonomy. In the current paper, we describe our taxonomy in-depth as well as expand it. Our new structured results demonstrate an extension of the proposed taxonomy which results from this process, is divided into five dimensions, related to the learner and the learning environment. Our main contribution is the detailed taxonomy that can be used to design and evaluate gamification design in learning environments.