Xiaowu Zhang

CL
h-index2
5papers
14citations
Novelty33%
AI Score29

5 Papers

CLAug 3, 2023
Does Correction Remain A Problem For Large Language Models?

Xiaowu Zhang, Xiaotian Zhang, Cheng Yang et al.

As large language models, such as GPT, continue to advance the capabilities of natural language processing (NLP), the question arises: does the problem of correction still persist? This paper investigates the role of correction in the context of large language models by conducting two experiments. The first experiment focuses on correction as a standalone task, employing few-shot learning techniques with GPT-like models for error correction. The second experiment explores the notion of correction as a preparatory task for other NLP tasks, examining whether large language models can tolerate and perform adequately on texts containing certain levels of noise or errors. By addressing these experiments, we aim to shed light on the significance of correction in the era of large language models and its implications for various NLP applications.

CLApr 10, 2025Code
Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design

Xiaowu Zhang, Hongfei Zhao, Jingyi Hou et al.

The Chinese Spelling Correction (CSC) task focuses on detecting and correcting spelling errors in sentences. Current research primarily explores two approaches: traditional multimodal pre-trained models and large language models (LLMs). However, LLMs face limitations in CSC, particularly over-correction, making them suboptimal for this task. While existing studies have investigated the use of phonetic and graphemic information in multimodal CSC models, effectively leveraging these features to enhance correction performance remains a challenge. To address this, we propose the Multimodal Analysis for Character Usage (\textbf{MACU}) experiment, identifying potential improvements for multimodal correctison. Based on empirical findings, we introduce \textbf{NamBert}, a novel multimodal model for Chinese spelling correction. Experiments on benchmark datasets demonstrate NamBert's superiority over SOTA methods. We also conduct a comprehensive comparison between NamBert and LLMs, systematically evaluating their strengths and limitations in CSC. Our code and model are available at https://github.com/iioSnail/NamBert.

AIMay 23, 2023Code
XRoute Environment: A Novel Reinforcement Learning Environment for Routing

Zhanwen Zhou, Hankz Hankui Zhuo, Xiaowu Zhang et al.

Routing is a crucial and time-consuming stage in modern design automation flow for advanced technology nodes. Great progress in the field of reinforcement learning makes it possible to use those approaches to improve the routing quality and efficiency. However, the scale of the routing problems solved by reinforcement learning-based methods in recent studies is too small for these methods to be used in commercial EDA tools. We introduce the XRoute Environment, a new reinforcement learning environment where agents are trained to select and route nets in an advanced, end-to-end routing framework. Novel algorithms and ideas can be quickly tested in a safe and reproducible manner in it. The resulting environment is challenging, easy to use, customize and add additional scenarios, and it is available under a permissive open-source license. In addition, it provides support for distributed deployment and multi-instance experiments. We propose two tasks for learning and build a full-chip test bed with routing benchmarks of various region sizes. We also pre-define several static routing regions with different pin density and number of nets for easier learning and testing. For net ordering task, we report baseline results for two widely used reinforcement learning algorithms (PPO and DQN) and one searching-based algorithm (TritonRoute). The XRoute Environment will be available at https://github.com/xplanlab/xroute_env.

CLNov 15, 2024
Research on Domain-Specific Chinese Spelling Correction Method Based on Plugin Extension Modules

Xiaowu Zhang, Hongfei Zhao, Xuan Chang

This paper proposes a Chinese spelling correction method based on plugin extension modules, aimed at addressing the limitations of existing models in handling domain-specific texts. Traditional Chinese spelling correction models are typically trained on general-domain datasets, resulting in poor performance when encountering specialized terminology in domain-specific texts. To address this issue, we design an extension module that learns the features of domain-specific terminology, thereby enhancing the model's correction capabilities within specific domains. This extension module can provide domain knowledge to the model without compromising its general spelling correction performance, thus improving its accuracy in specialized fields. Experimental results demonstrate that after integrating extension modules for medical, legal, and official document domains, the model's correction performance is significantly improved compared to the baseline model without any extension modules.

SYAug 1, 2017
Optimal design of three-planetary-gear power-split hybrid powertrains

Weichao Zhuang, Xiaowu Zhang, Ding Zhao et al.

Many of today's power-split hybrid electric vehicles (HEVs) utilize planetary gears (PGs) to connect the powertrain elements together. Recent power-split HEVs tend to use two PGs and some of them have multiple modes to achieve better fuel economy and driving performance. Looking to the future, hybrid powertrain technologies must be enhanced to design hybrid light trucks. For light trucks, the need for multi-mode and more PGs is stronger, to achieve the required performance. To systematically explore all the possible designs of multi-mode HEVs with three PGs, an efficient searching and optimization methodology is proposed. All possible clutch topology and modes for one existing configuration that uses three PGs were exhaustively searched. The launching performance is first used to screen out designs that fail to satisfy the required launching performance. A near-optimal and computationally efficient energy management strategy was then employed to identify designs that achieve good fuel economy. The proposed design process successfully identify 8 designs that achieve better launching performance and better fuel economy, while using fewer number of clutches than the benchmark and a patented design.