CLJul 15, 2024Code
MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language ModelsChengguang Gan, Sunbowen Lee, Qingyu Yin et al.
The Mutual Reinforcement Effect (MRE) represents a promising avenue in information extraction and multitasking research. Nevertheless, its applicability has been constrained due to the exclusive availability of MRE mix datasets in Japanese, thereby limiting comprehensive exploration by the global research community. To address this limitation, we introduce a Multilingual MRE mix dataset (MMM) that encompasses 21 sub-datasets in English, Japanese, and Chinese. In this paper, we also propose a method for dataset translation assisted by Large Language Models (LLMs), which significantly reduces the manual annotation time required for dataset construction by leveraging LLMs to translate the original Japanese datasets. Additionally, we have enriched the dataset by incorporating open-domain Named Entity Recognition (NER) and sentence classification tasks. Utilizing this expanded dataset, we developed a unified input-output framework to train an Open-domain Information Extraction Large Language Model (OIELLM). The OIELLM model demonstrates the capability to effectively process novel MMM datasets, exhibiting significant improvements in performance. The OIELLM model and datasets is open-source in HuggingFace: https://ganchengguang.github.io/MRE/
COMay 19
Divisibility of Trace CodesHexiang Huang, Haihua Deng, Sihuang Hu
A linear code is said to be $Δ$-divisible if the Hamming weights of all its codewords are divisible by $Δ$. The $p$-adic valuation of a code is defined as the greatest integer $t$ such that the code is $p^t$-divisible. In this paper, we establish a divisibility criterion for trace codes. Specifically, this criterion provides a systematic method to determine the $p$-adic valuation of the associated trace code, thereby extending Ward's classical divisibility criterion from standard generating sets (or matrices) to generalized generator matrices over an extension field. Furthermore, we present two applications of our framework. The first application provides a concise proof of the celebrated divisibility results on abelian codes established by Delsarte and McEliece. The second application establishes several explicit lower bounds on the $p$-adic valuation of the number of solutions over $\mathbb{F}_{q^m}$ (where $q = p^e$) to the Artin-Schreier type equation $ f(x_1,\ldots,x_k)=y^q-y $. In particular, under the condition $\left(d,\frac{q^m-1}{q-1}\right)=1$, we determine the exact minimum $p$-adic valuation of the number of solutions when $f$ is restricted to homogeneous polynomials of degree $d$.
CVJul 20, 2025
Hybrid-supervised Hypergraph-enhanced Transformer for Micro-gesture Based Emotion RecognitionZhaoqiang Xia, Hexiang Huang, Haoyu Chen et al.
Micro-gestures are unconsciously performed body gestures that can convey the emotion states of humans and start to attract more research attention in the fields of human behavior understanding and affective computing as an emerging topic. However, the modeling of human emotion based on micro-gestures has not been explored sufficiently. In this work, we propose to recognize the emotion states based on the micro-gestures by reconstructing the behavior patterns with a hypergraph-enhanced Transformer in a hybrid-supervised framework. In the framework, hypergraph Transformer based encoder and decoder are separately designed by stacking the hypergraph-enhanced self-attention and multiscale temporal convolution modules. Especially, to better capture the subtle motion of micro-gestures, we construct a decoder with additional upsampling operations for a reconstruction task in a self-supervised learning manner. We further propose a hypergraph-enhanced self-attention module where the hyperedges between skeleton joints are gradually updated to present the relationships of body joints for modeling the subtle local motion. Lastly, for exploiting the relationship between the emotion states and local motion of micro-gestures, an emotion recognition head from the output of encoder is designed with a shallow architecture and learned in a supervised way. The end-to-end framework is jointly trained in a one-stage way by comprehensively utilizing self-reconstruction and supervision information. The proposed method is evaluated on two publicly available datasets, namely iMiGUE and SMG, and achieves the best performance under multiple metrics, which is superior to the existing methods.