50.7LGApr 13Code
Low-rank Optimization Trajectories Modeling for LLM RLVR AccelerationZhipeng Chen, Tao Qian, Wayne Xin Zhao et al.
Recently, scaling reinforcement learning with verifiable rewards (RLVR) for large language models (LLMs) has emerged as an effective training paradigm for significantly improving model capabilities, which requires guiding the model to perform extensive exploration and learning, leading to substantial computational overhead and becoming a key challenge. To reduce the number of training steps, Prior work performs linear extrapolation of model parameters. However, the dynamics of model parameter updates during RLVR training remain insufficiently understood. To further investigate the evolution of LLMs during RLVR training, we conduct empirical experiments and find that the rank-1 subspace of the model does not evolve linearly, and its dominance over the original parameters is further amplified during LoRA training. Based on the above insights, we propose the \textbf{N}onlinear \textbf{Ext}rapolation of low-rank trajectories (\textbf{NExt}), a novel framework that models and extrapolates low-rank parameter trajectories in a nonlinear manner. Concretely, we first train the model using LoRA and extract the rank-1 subspace of parameter differences at multiple training steps, which is then used for the subsequent nonlinear extrapolation. Afterward, we utilized the extracted rank-1 subspace to train a predictor, which can model the trajectory of parameter updates during RLVR, and then perform the predict-extend process to extrapolate model parameters, achieving the acceleration of RLVR. To further study and understand NExt, we conduct comprehensive experiments that demonstrate the effectiveness and robustness of the method. Our method reduces computational overhead by approximately 37.5\% while remaining compatible with a wide range of RLVR algorithms and tasks. We release our code in https://github.com/RUCAIBox/NExt.
SYDec 13, 2024Code
V2Sim: An Open-Source Microscopic V2G Simulation Platform in Urban Power and Transportation NetworkTao Qian, Mingyu Fang, Qinran Hu et al.
This paper proposes V2Sim, an open source Pythonbased simulation platform designed for advanced vehicle-to-grid (V2G) analysis in coupled urban power and transportation networks. By integrating a microscopic urban transportation network (MUTN) with a power distribution network (PDN), V2Sim enables precise modeling of electric vehicle charging loads (EVCL) and dynamic V2G operations. The platform uniquely combines SUMO for MUTN simulations and an optimized DistFlow model for PDN analysis, with dedicated models for fast charging stations (FCS) and slow charging stations (SCS), capturing detailed charging dynamics often overlooked in existing simulation tools. V2Sim supports a range of customizable V2G strategies, advanced fault-sensing in charging stations, and parallel simulation through multi-processing to accelerate large-scale case studies. Case studies using a real-world MUTN from Nanjing, China, demonstrate V2Sim's capability to analyze the spatial-temporal distribution of EVCL and evaluate V2G impacts, such as fault dissemination and pricing variations, in unprecedented detail. Unlike traditional equilibrium models, V2Sim captures single-vehicle behavior and charging interactions at the microscopic level, offering unparalleled accuracy in assessing the operational and planning needs of V2G-compatible systems. This platform serves as a comprehensive tool for researchers and urban planners aiming to optimize integrated power and transportation networks.
NAOct 24, 2017
Aspects of 2D-Adaptive Fourier DecompositionsYou Gao, Tao Qian, Vladimir Temlyakov et al.
As a new type of series expansion, the so-called one-dimensional adaptive Fourier decomposition (AFD) and its variations (1D-AFDs) have effective applications in signal analysis and system identification. The 1D-AFDs have considerable influence to the rational approximation of one complex variable and phase retrieving problems, etc. In a recent paper, Qian developed 2D-AFDs for treating square images as the essential boundary of the 2-torus embedded into the space of two complex variables. This paper studies the numerical aspects of multi-dimensional AFDs, and in particular 2D-AFDs, which mainly include (i) Numerical algorithms of several types of 2D-AFDs in relation to image representation; (ii) Perform experiments for the algorithms with comparisons between 5 types of image reconstruction methods in the Fourier category; and (iii) New and sharper estimations for convergence rates of orthogonal greedy algorithm and pre-orthogonal greedy algorithm. The comparison shows that the 2D-AFD methods achieve optimal results among the others.
NANov 23, 2017
Fast algorithm of adaptive Fourier seriesYou Gao, Min Ku, Tao Qian
Adaptive Fourier decomposition (AFD, precisely 1-D AFD or Core-AFD) was originated for the goal of positive frequency representations of signals. It achieved the goal and at the same time offered fast decompositions of signals. There then arose several types of AFDs. AFD merged with the greedy algorithm idea, and in particular, motivated the so-called pre-orthogonal greedy algorithm (Pre-OGA) that was proven to be the most efficient greedy algorithm. The cost of the advantages of the AFD type decompositions is, however, the high computational complexity due to the involvement of maximal selections of the dictionary parameters. The present paper offers one formulation of the 1-D AFD algorithm by building the FFT algorithm into it. Accordingly, the algorithm complexity is reduced, from the original $\mathcal{O}(M N^2)$ to $\mathcal{O}(M N\log_2 N)$, where $N$ denotes the number of the discretization points on the unit circle and $M$ denotes the number of points in $[0,1)$. This greatly enhances the applicability of AFD. Experiments are carried out to show the high efficiency of the proposed algorithm.
ASMar 31, 2022
SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training StrategyShuai Guo, Jiatong Shi, Tao Qian et al.
Deep learning based singing voice synthesis (SVS) systems have been demonstrated to flexibly generate singing with better qualities, compared to conventional statistical parametric based methods. However, neural systems are generally data-hungry and have difficulty to reach reasonable singing quality with limited public available training data. In this work, we explore different data augmentation methods to boost the training of SVS systems, including several strategies customized to SVS based on pitch augmentation and mix-up augmentation. To further stabilize the training, we introduce the cycle-consistent training strategy. Extensive experiments on two public singing databases demonstrate that our proposed augmentation methods and the stabilizing training strategy can significantly improve the performance on both objective and subjective evaluations.
OCJul 14, 2021
A Granular Sieving Algorithm for Deterministic Global OptimizationTao Qian, Lei Dai, Liming Zhang et al.
A gradient-free deterministic method is developed to solve global optimization problems for Lipschitz continuous functions defined in arbitrary path-wise connected compact sets in Euclidean spaces. The method can be regarded as granular sieving with synchronous analysis in both the domain and range of the objective function. With straightforward mathematical formulation applicable to both univariate and multivariate objective functions, the global minimum value and all the global minimizers are located through two decreasing sequences of compact sets in, respectively, the domain and range spaces. The algorithm is easy to implement with moderate computational cost. The method is tested against extensive benchmark functions in the literature. The experimental results show remarkable effectiveness and applicability of the algorithm.
CLAug 27, 2016
A Bi-LSTM-RNN Model for Relation Classification Using Low-Cost Sequence FeaturesFei Li, Meishan Zhang, Guohong Fu et al.
Relation classification is associated with many potential applications in the artificial intelligence area. Recent approaches usually leverage neural networks based on structure features such as syntactic or dependency features to solve this problem. However, high-cost structure features make such approaches inconvenient to be directly used. In addition, structure features are probably domain-dependent. Therefore, this paper proposes a bi-directional long-short-term-memory recurrent-neural-network (Bi-LSTM-RNN) model based on low-cost sequence features to address relation classification. This model divides a sentence or text segment into five parts, namely two target entities and their three contexts. It learns the representations of entities and their contexts, and uses them to classify relations. We evaluate our model on two standard benchmark datasets in different domains, namely SemEval-2010 Task 8 and BioNLP-ST 2016 Task BB3. In the former dataset, our model achieves comparable performance compared with other models using sequence features. In the latter dataset, our model obtains the third best results compared with other models in the official evaluation. Moreover, we find that the context between two target entities plays the most important role in relation classification. Furthermore, statistic experiments show that the context between two target entities can be used as an approximate replacement of the shortest dependency path when dependency parsing is not used.