Qi Hong

h-index3

6papers

483citations

Novelty39%

AI Score49

Ranked #44,921 of 205,806 authors (top 22%)#17,123 in CV (top 29%)

6 Papers

NAJun 23, 2018

Second-order linear structure-preserving modified finite volume schemes for the regularized long-wave equation

Qi Hong, Jialing Wang, Yuezheng Gong

In this paper, based on the weak form of the Hamiltonian formulation of the regularized long-wave equation and a novel approach of transforming the original Hamiltonian energy into a quadratic functional, a fully implicit and three linear-implicit energy conservation numerical schemes are respectively proposed. The resulting numerical schemes are proved theoretically to satisfy the energy conservation law in the discrete level. Moreover, these linear-implicit schemes are efficient in practical computation because only a linear system need to be solved at each time step. The proposed schemes are both second order accurate in time and space. Numerical experiments are presented to show all the proposed schemes have satisfactory performance in providing accurate solution and the remarkable energy-preserving property.

70.9DCMay 10Code

Split CNN Inference on Networked Microcontrollers

Junyu Lu, Shashwath Suresh, Hao Liu et al.

Running deep neural networks on microcontroller units (MCUs) is severely constrained by limited memory resources. While TinyML techniques reduce model size and computation, they often fail in practice due to excessive peak Random Access Memory (RAM) usage during inference, dominated by intermediate activations. As a result, many models remain infeasible on standalone MCUs. In this work, we present a fine-grained split inference system for networked MCUs that enables collaborative inference of Convolutional Neural Networks (CNN) models across multiple devices. Our key insight is that breaking the memory bottleneck requires splitting inference at sub-layer granularity rather than at layer boundaries. We reinterpret pre-trained models to enable kernel-wise and neuron-wise partitioning, and distribute both model parameters and intermediate activations across multiple MCUs. A lightweight, resource-aware coordinator orchestrates the inference across MCU devices with heterogeneous resources. We implement the proposed system on a real testbed and evaluate it on up to 8 MCUs using MobileNetV2, a representative CNN model. Our experimental results show that CNN models infeasible on a single MCU can be executed across networked MCUs, reducing the per-MCU peak RAM usage while maintaining the practical end-to-end inference latency. All the source code of this work can be found here: https://github.com/shashsuresh/split-inference-on-MCUs.

73.0CVApr 29

TRIP-Evaluate: An Open Multimodal Benchmark for Evaluating Large Models in Transportation

Han Gong, Zhen Zhou, Yunyang Shi et al.

Large language models (LLMs) and multimodal large models (MLLMs) are increasingly used for transportation tasks such as regulation question answering, traffic management support, engineering review, and autonomous-driving scene reasoning. Yet transportation workflows are rule-intensive, computation-intensive, safety-critical, and inherently multimodal. Existing general benchmarks provide limited evidence of whether a model can apply regulations correctly, perform verifiable engineering calculations, or interpret traffic scenes reliably, while the small number of public transportation benchmarks remain narrow in scope and rarely support fine-grained diagnosis across text, images, and point-cloud data. To address this gap, we present TRIP-Evaluate, an open multimodal benchmark for large models in transportation. The benchmark organizes 837 items using a role-task-knowledge taxonomy that covers vehicle, traffic-management, traveler, and planning-and-design functions. Each item is annotated with capability, modality, and difficulty labels, enabling diagnosis from overall accuracy down to specific failure modes. The current release includes 596 text items, 198 image items, and 43 point-cloud items. TRIP-Evaluate also standardizes item construction, quality control, prompting, decoding, and scoring to improve cross-model comparability. Results on a diverse panel of models show that text-based performance is improving, but substantial weaknesses remain in multi-step engineering calculation, rule-constrained reasoning, multimodal scene understanding, and point-cloud understanding. Overall, TRIP-Evaluate provides a reproducible, diagnosable, and engineering-aligned evaluation baseline for model selection, regression testing, and safer deployment in transportation applications.

CVMar 20, 2020Code

Masked Face Recognition Dataset and Application

Zhongyuan Wang, Guangcheng Wang, Baojin Huang et al.

In order to effectively prevent the spread of COVID-19 virus, almost everyone wears a mask during coronavirus epidemic. This almost makes conventional facial recognition technology ineffective in many cases, such as community access control, face access control, facial attendance, facial security checks at train stations, etc. Therefore, it is very urgent to improve the recognition performance of the existing face recognition technology on the masked faces. Most current advanced face recognition approaches are designed based on deep learning, which depend on a large number of face samples. However, at present, there are no publicly available masked face recognition datasets. To this end, this work proposes three types of masked face datasets, including Masked Face Detection Dataset (MFDD), Real-world Masked Face Recognition Dataset (RMFRD) and Simulated Masked Face Recognition Dataset (SMFRD). Among them, to the best of our knowledge, RMFRD is currently theworld's largest real-world masked face dataset. These datasets are freely available to industry and academia, based on which various applications on masked faces can be developed. The multi-granularity masked face recognition model we developed achieves 95% accuracy, exceeding the results reported by the industry. Our datasets are available at: https://github.com/X-zhangyang/Real-World-Masked-Face-Dataset.

LGFeb 4

Let Experts Feel Uncertainty: A Multi-Expert Label Distribution Approach to Probabilistic Time Series Forecasting

Zhen Zhou, Zhirui Wang, Qi Hong et al.

Time series forecasting in real-world applications requires both high predictive accuracy and interpretable uncertainty quantification. Traditional point prediction methods often fail to capture the inherent uncertainty in time series data, while existing probabilistic approaches struggle to balance computational efficiency with interpretability. We propose a novel Multi-Expert Learning Distributional Labels (LDL) framework that addresses these challenges through mixture-of-experts architectures with distributional learning capabilities. Our approach introduces two complementary methods: (1) Multi-Expert LDL, which employs multiple experts with different learned parameters to capture diverse temporal patterns, and (2) Pattern-Aware LDL-MoE, which explicitly decomposes time series into interpretable components (trend, seasonality, changepoints, volatility) through specialized sub-experts. Both frameworks extend traditional point prediction to distributional learning, enabling rich uncertainty quantification through Maximum Mean Discrepancy (MMD). We evaluate our methods on aggregated sales data derived from the M5 dataset, demonstrating superior performance compared to baseline approaches. The continuous Multi-Expert LDL achieves the best overall performance, while the Pattern-Aware LDL-MoE provides enhanced interpretability through component-wise analysis. Our frameworks successfully balance predictive accuracy with interpretability, making them suitable for real-world forecasting applications where both performance and actionable insights are crucial.

NASep 12, 2018

Optimal error estimate of two linear and momentum-preserving Fourier pseudo-spectral schemes for the RLW equation

Qi Hong, Yushun Wang, Yuezheng Gong

In this paper, two novel linear-implicit and momentum-preserving Fourier pseudo-spectral schemes are proposed and analyzed for the regularized long-wave equation. The numerical methods are based on the blend of the Fourier pseudo-spectral method in space and the linear-implicit Crank-Nicolson method or the leap-frog scheme in time. The two fully discrete linear schemes are shown to possess the discrete momentum conservation law, and the linear systems resulting from the schemes are proved uniquely solvable. Due to the momentum conservative property of the proposed schemes, the Fourier pseudo-spectral solution is proved to be bounded in the discrete $L^{\infty}$ norm. Then by using the standard energy method, both the linear-implicit Crank-Nicolson momentum-preserving scheme and the linear-implicit leap-frog momentum-preserving scheme are shown to have the accuracy of $\mathcal{O}(τ^2+N^{-r})$ in the discrete $L^{\infty}$ norm without any restrictions on the grid ratio, where $N$ is the number of nodes and $τ$ is the time step size. Numerical examples are carried out to verify the correction of the theory analysis and the efficiency of the proposed schemes.