CLFeb 10Code
Conceptual Cultural Index: A Metric for Cultural Specificity via Relative GeneralityTakumi Ohashi, Hitoshi Iyatomi
Large language models (LLMs) are increasingly deployed in multicultural settings; however, systematic evaluation of cultural specificity at the sentence level remains underexplored. We propose the Conceptual Cultural Index (CCI), which estimates cultural specificity at the sentence level. CCI is defined as the difference between the generality estimate within the target culture and the average generality estimate across other cultures. This formulation enables users to operationally control the scope of culture via comparison settings and provides interpretability, since the score derives from the underlying generality estimates. We validate CCI on 400 sentences (200 culture-specific and 200 general), and the resulting score distribution exhibits the anticipated pattern: higher for culture-specific sentences and lower for general ones. For binary separability, CCI outperforms direct LLM scoring, yielding more than a 10-point improvement in AUC for models specialized to the target culture. Our code is available at https://github.com/IyatomiLab/CCI .
CLMay 1
A11y-Compressor: A Framework for Enhancing the Efficiency of GUI Agent Observations through Visual Context Reconstruction and Redundancy ReductionMichito Takeshita, Takuro Kawada, Takumi Ohashi et al.
AI agents that interact with graphical user interfaces (GUIs) require effective observation representations for reliable grounding. The accessibility tree is a commonly used text-based format that encodes UI element attributes, but it suffers from redundancy and lacks structural information such as spatial relationships among elements. We propose A11y-Compressor, a framework that transforms linearized accessibility trees into compact and structured representations. Our implementation, Compressed-a11y, applies a lightweight and structured transformation pipeline with modal detection, redundancy reduction, and semantic structuring. Experiments on the OSWorld benchmark show that Compressed-a11y reduces input tokens to 22% of the original while improving task success rates by 5.1 percentage points on average.
CLOct 12, 2024
Extended Japanese Commonsense Morality Dataset with Masked Token and Label EnhancementTakumi Ohashi, Tsubasa Nakagawa, Hitoshi Iyatomi
Rapid advancements in artificial intelligence (AI) have made it crucial to integrate moral reasoning into AI systems. However, existing models and datasets often overlook regional and cultural differences. To address this shortcoming, we have expanded the JCommonsenseMorality (JCM) dataset, the only publicly available dataset focused on Japanese morality. The Extended JCM (eJCM) has grown from the original 13,975 sentences to 31,184 sentences using our proposed sentence expansion method called Masked Token and Label Enhancement (MTLE). MTLE selectively masks important parts of sentences related to moral judgment and replaces them with alternative expressions generated by a large language model (LLM), while re-assigning appropriate labels. The model trained using our eJCM achieved an F1 score of 0.857, higher than the scores for the original JCM (0.837), ChatGPT one-shot classification (0.841), and data augmented using AugGPT, a state-of-the-art augmentation method (0.850). Specifically, in complex moral reasoning tasks unique to Japanese culture, the model trained with eJCM showed a significant improvement in performance (increasing from 0.681 to 0.756) and achieved a performance close to that of GPT-4 Turbo (0.787). These results demonstrate the validity of the eJCM dataset and the importance of developing models and datasets that consider the cultural context.
HCJan 18, 2022
The Evolution of Assistive Technology: A Literature Review of Technology Developments and ApplicationsMatteo Zallio, Takumi Ohashi
The term Assistive Technology has evolved over the years and identifies equipment or product systems, whether acquired, modified, or customized, that are used to increase, maintain, or improve functional capabilities of individuals with disabilities. Considering the advances that have been made, what trends can be identified to provide evidence of the evolution of AT as devices that foster accessibility and empower users with different abilities? Through a systematic literature review we identify research items that offer evidence of the evolution of the meaning, purpose, and applications of AT throughout the history. This paper provides evidence that AT evolved from products to improve functional capabilities of individuals with disabilities toward enabling technologies that facilitate tasks for people with different needs, abilities, gender, age, and culture. This evolution will lead to a positive demystification of the meaning and applications of AT toward broad usage acceptance among mainstream users.