Bo Gong

RO
h-index18
6papers
37citations
Novelty47%
AI Score52

6 Papers

50.7NAApr 16
Spurious-mode-free finite element method for scattering resonances in transmission problems

Bo Gong, Jiguang Sun

Scattering resonances arise in wave phenomena and play an important role in many applications. While extensive theoretical studies have been conducted, effective numerical computation remains limited, and most existing methods suffer from spurious modes. In this paper, we propose a spurious-mode-free method for computing scattering resonances in transmission problems. The unbounded domain is truncated using a Dirichlet-to-Neumann (DtN) map. The resonances are formulated as eigenvalues of a holomorphic Fredholm operator function, which is discretized by the finite element method. The spectrum indicator method is then used to compute the eigenvalues of the nonlinear matrix eigenvalue problems. We establish optimal order convergence and present extensive examples that demonstrate the effectiveness of the proposed method. The results are consistent with existing theoretical findings in the literature and offer new insights that may inform further theoretical developments.

90.4NAMar 16
Edge element DtN method for electromagnetic scattering poles of perfectly conducting obstacles

Bo Gong, Takumi Sato, Jiguang Sun et al.

Meromorphic continuation of the scattering operator leads to scattering poles (resonances) in the complex plane. Despite their significance, numerical investigation of scattering poles remains limited. In this paper, we propose and analyze a numerical method to compute electromagnetic poles of perfectly conducting obstacles. The unbounded domain for the scattering problem is truncated using the DtN mapping and the poles are shown to be the eigenvalues of a holomorphic Fredholm operator function related to Maxwell's equations. Edge elements are used for discretization. The convergence is proved using the abstract approximation theory for eigenvalue problems of holomorphic Fredholm operator functions. The proposed finite element DtN approach is free of non-physical poles. A spectral indicator method is then employed to compute the resulting nonlinear matrix eigenvalue problem. Numerical examples are presented to demonstrate the effectiveness of the method.

CLSep 15, 2025Code
Fun-ASR Technical Report

Keyu An, Yanni Chen, Chong Deng et al.

In recent years, automatic speech recognition (ASR) has witnessed transformative advancements driven by three complementary paradigms: data scaling, model size scaling, and deep integration with large language models (LLMs). However, LLMs are prone to hallucination, which can significantly degrade user experience in real-world ASR applications. In this paper, we present Fun-ASR, a large-scale, LLM-based ASR system that synergistically combines massive data, large model capacity, LLM integration, and reinforcement learning to achieve state-of-the-art performance across diverse and complex speech recognition scenarios. Moreover, Fun-ASR is specifically optimized for practical deployment, with enhancements in streaming capability, noise robustness, code-switching, hotword customization, and satisfying other real-world application requirements. Experimental results show that while most LLM-based ASR systems achieve strong performance on open-source benchmarks, they often underperform on real industry evaluation sets. Thanks to production-oriented optimizations, Fun-ASR achieves state-of-the-art performance on real application datasets, demonstrating its effectiveness and robustness in practical settings.

35.5ROApr 16
Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios

Yuting Zeng, Zhiwen Zheng, Jingya Wang et al.

Safe and efficient assistive planning for visually impaired scenarios remains challenging, since existing methods struggle with multi-objective optimization, generalization, and interpretability. In response, this paper proposes a Momentum-Constrained Hybrid Heuristic Trajectory Optimization Framework (MHHTOF). To balance multiple objectives of comfort and safety, the framework designs a Heuristic Trajectory Sampling Cluster (HTSC) with a Momentum-Constrained Trajectory Optimization (MTO), which suppresses abrupt velocity and acceleration changes. In addition, a novel residual-enhanced deep reinforcement learning (DRL) module refines candidate trajectories, advancing temporal modeling and policy generalization. Finally, a dual-stage cost modeling mechanism (DCMM) is introduced to regulate optimization, where costs in the Frenet space ensure consistency, and reward-driven adaptive weights in the Cartesian space integrate user preferences for interpretability and user-centric decision-making. Experimental results show that the proposed framework converges in nearly half the iterations of baselines and achieves lower and more stable costs. In complex dynamic scenarios, MHHTOF further demonstrates stable velocity and acceleration curves with reduced risk, confirming its advantages in robustness, safety, and efficiency.

ROSep 19, 2025
Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios

Yuting Zeng, Zhiwen Zheng, You Zhou et al.

This paper proposes a momentum-constrained hybrid heuristic trajectory optimization framework (MHHTOF) tailored for assistive navigation in visually impaired scenarios, integrating trajectory sampling generation, optimization and evaluation with residual-enhanced deep reinforcement learning (DRL). In the first stage, heuristic trajectory sampling cluster (HTSC) is generated in the Frenet coordinate system using third-order interpolation with fifth-order polynomials and momentum-constrained trajectory optimization (MTO) constraints to ensure smoothness and feasibility. After first stage cost evaluation, the second stage leverages a residual-enhanced actor-critic network with LSTM-based temporal feature modeling to adaptively refine trajectory selection in the Cartesian coordinate system. A dual-stage cost modeling mechanism (DCMM) with weight transfer aligns semantic priorities across stages, supporting human-centered optimization. Experimental results demonstrate that the proposed LSTM-ResB-PPO achieves significantly faster convergence, attaining stable policy performance in approximately half the training iterations required by the PPO baseline, while simultaneously enhancing both reward outcomes and training stability. Compared to baseline method, the selected model reduces average cost and cost variance by 30.3% and 53.3%, and lowers ego and obstacle risks by over 77%. These findings validate the framework's effectiveness in enhancing robustness, safety, and real-time feasibility in complex assistive planning tasks.

CVMay 25, 2020
SegAttnGAN: Text to Image Generation with Segmentation Attention

Yuchuan Gou, Qiancheng Wu, Minghao Li et al.

In this paper, we propose a novel generative network (SegAttnGAN) that utilizes additional segmentation information for the text-to-image synthesis task. As the segmentation data introduced to the model provides useful guidance on the generator training, the proposed model can generate images with better realism quality and higher quantitative measures compared with the previous state-of-art methods. We achieved Inception Score of 4.84 on the CUB dataset and 3.52 on the Oxford-102 dataset. Besides, we tested the self-attention SegAttnGAN which uses generated segmentation data instead of masks from datasets for attention and achieved similar high-quality results, suggesting that our model can be adapted for the text-to-image synthesis task.