Zhijian Wang

h-index4

5papers

26citations

Novelty51%

AI Score42

Ranked #86,524 of 201,326 authors (top 43%)#29,226 in CV (top 50%)

5 Papers

CVJan 23, 2025Code

mmEgoHand: Egocentric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMU

Yizhe Lv, Tingting Zhang, Zhijian Wang et al.

Recent advancements in millimeter-wave (mmWave) radar have demonstrated its potential for human action recognition and pose estimation, offering privacy-preserving advantages over conventional cameras while maintaining occlusion robustness, with promising applications in human-computer interaction and wellness care. However, existing mmWave systems typically employ fixed-position configurations, restricting user mobility to predefined zones and limiting practical deployment scenarios. We introduce mmEgoHand, a head-mounted egocentric system for hand pose estimation to support applications such as gesture recognition, VR interaction, skill digitization and assessment, and robotic teleoperation. mmEgoHand synergistically integrates mmWave radar with inertial measurement units (IMUs) to enable dynamic perception. The IMUs actively compensate for radar interference induced by head movements, while our novel end-to-end Transformer architecture simultaneously estimates 3D hand keypoint coordinates through multi-modal sensor fusion. This dual-modality framework achieves spatial-temporal alignment of mmWave heatmaps with IMUs, overcoming viewpoint instability inherent in egocentric sensing scenarios. We further demonstrate that intermediate hand pose representations substantially improve performance in downstream task, e.g., VR gesture recognition. Extensive evaluations with 10 subjects performing 8 gestures across 3 distinct postures -- standing, sitting, lying -- achieve 90.8% recognition accuracy, outperforming state-of-the-art solutions by a large margin. Dataset and code are available at https://github.com/WhisperYi/mmVR.

NAMay 11

Data-driven moving-window Bayesian inference for transient CO2-temperature network models of buildings

Zhijian Wang, Stein K. F. Stoter, Clemens V. Verhoosel et al.

In this work, we proposes a CO2-temperature network model that links multi-zone mass transport and thermal dynamics through shared latent drivers, airflow and occupancy. The thermal component is formulated as a resistance-capacitance (RC) network augmented with airflow-driven convective exchange, while the CO2 component is governed by inter-zonal convective transport. To calibrate the model and track time-varying operating conditions based on sparse sensing, we introduce a moving-window Bayesian inference procedure that jointly estimates thermal parameters, airflow and occupancy trajectories. The estimation also provides room-specific sensor noise levels, yielding posterior predictive forecasts with credible intervals. The framework is assessed using a controlled synthetic benchmark, and a scaled physical validation experiment using CO2 and temperature sensing. In both settings, the posterior accurately reconstructs trajectories within windows and delivers low forecast errors. When inference windows overlap abrupt regime transitions, the widened uncertainty bands and increased inferred noise levels provide an interpretable diagnostic of model-data mismatch, followed by rapid recovery once the new regime is observed. Overall, coupling CO2-informed airflow with thermal dynamics yields a robust approach for conductive and advective temperature prediction, supporting practical monitoring and energy-performance assessment under limited sensing.

CVNov 16, 2021

Real-time 3D human action recognition based on Hyperpoint sequence

Xing Li, Qian Huang, Zhijian Wang et al.

Real-time 3D human action recognition has broad industrial applications, such as surveillance, human-computer interaction, and healthcare monitoring. By relying on complex spatio-temporal local encoding, most existing point cloud sequence networks capture spatio-temporal local structures to recognize 3D human actions. To simplify the point cloud sequence modeling task, we propose a lightweight and effective point cloud sequence network referred to as SequentialPointNet for real-time 3D action recognition. Instead of capturing spatio-temporal local structures, SequentialPointNet encodes the temporal evolution of static appearances to recognize human actions. Firstly, we define a novel type of point data, Hyperpoint, to better describe the temporally changing human appearances. A theoretical foundation is provided to clarify the information equivalence property for converting point cloud sequences into Hyperpoint sequences. Secondly, the point cloud sequence modeling task is decomposed into a Hyperpoint embedding task and a Hyperpoint sequence modeling task. Specifically, for Hyperpoint embedding, the static point cloud technology is employed to convert point cloud sequences into Hyperpoint sequences, which introduces inherent frame-level parallelism; for Hyperpoint sequence modeling, a Hyperpoint-Mixer module is designed as the basic building block to learning the spatio-temporal features of human actions. Extensive experiments on three widely-used 3D action recognition datasets demonstrate that the proposed SequentialPointNet achieves competitive classification performance with up to 10X faster than existing approaches.

EMFeb 21, 2018

Algorithmic Collusion in Cournot Duopoly Market: Evidence from Experimental Economics

Nan Zhou, Li Zhang, Shijian Li et al.

Algorithmic collusion is an emerging concept in current artificial intelligence age. Whether algorithmic collusion is a creditable threat remains as an argument. In this paper, we propose an algorithm which can extort its human rival to collude in a Cournot duopoly competing market. In experiments, we show that, the algorithm can successfully extorted its human rival and gets higher profit in long run, meanwhile the human rival will fully collude with the algorithm. As a result, the social welfare declines rapidly and stably. Both in theory and in experiment, our work confirms that, algorithmic collusion can be a creditable threat. In application, we hope, the frameworks, the algorithm design as well as the experiment environment illustrated in this work, can be an incubator or a test bed for researchers and policymakers to handle the emerging algorithmic collusion.

AINov 16, 2017

Using experimental game theory to transit human values to ethical AI

Yijia Wang, Yan Wan, Zhijian Wang

Knowing the reflection of game theory and ethics, we develop a mathematical representation to bridge the gap between the concepts in moral philosophy (e.g., Kantian and Utilitarian) and AI ethics industry technology standard (e.g., IEEE P7000 standard series for Ethical AI). As an application, we demonstrate how human value can be obtained from the experimental game theory (e.g., trust game experiment) so as to build an ethical AI. Moreover, an approach to test the ethics (rightness or wrongness) of a given AI algorithm by using an iterated Prisoner's Dilemma Game experiment is discussed as an example. Compared with existing mathematical frameworks and testing method on AI ethics technology, the advantages of the proposed approach are analyzed.