Yuanxin Wu

RO
h-index39
19papers
2,008citations
Novelty43%
AI Score39

19 Papers

58.1AIDec 14, 2023Code
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Peiyi Wang, Lei Li, Zhihong Shao et al. · pku

In this paper, we present an innovative process-oriented math process reward model called \textbf{Math-Shepherd}, which assigns a reward score to each step of math problem solutions. The training of Math-Shepherd is achieved using automatically constructed process-wise supervision data, breaking the bottleneck of heavy reliance on manual annotation in existing work. We explore the effectiveness of Math-Shepherd in two scenarios: 1) \textit{Verification}: Math-Shepherd is utilized for reranking multiple outputs generated by Large Language Models (LLMs); 2) \textit{Reinforcement Learning}: Math-Shepherd is employed to reinforce LLMs with step-by-step Proximal Policy Optimization (PPO). With Math-Shepherd, a series of open-source LLMs demonstrates exceptional performance. For instance, the step-by-step PPO with Math-Shepherd significantly improves the accuracy of Mistral-7B (77.9\%$\to$84.1\% on GSM8K and 28.6\%$\to$33.0\% on MATH). The accuracy can be further enhanced to 89.1\% and 43.5\% on GSM8K and MATH with the verification of Math-Shepherd, respectively. We believe that automatic process supervision holds significant potential for the future evolution of LLMs.

3.6CVAug 10, 2025
Generic Calibration: Pose Ambiguity/Linear Solution and Parametric-hybrid Pipeline

Yuqi Han, Qi Cai, Yuanxin Wu

Offline camera calibration techniques typically employ parametric or generic camera models. Selecting parametric models relies heavily on user experience, and an inappropriate camera model can significantly affect calibration accuracy. Meanwhile, generic calibration methods involve complex procedures and cannot provide traditional intrinsic parameters. This paper reveals a pose ambiguity in the pose solutions of generic calibration methods that irreversibly impacts subsequent pose estimation. A linear solver and a nonlinear optimization are proposed to address this ambiguity issue. Then a global optimization hybrid calibration method is introduced to integrate generic and parametric models together, which improves extrinsic parameter accuracy of generic calibration and mitigates overfitting and numerical instability in parametric calibration. Simulation and real-world experimental results demonstrate that the generic-parametric hybrid calibration method consistently excels across various lens types and noise contamination, hopefully serving as a reliable and accurate solution for camera calibration in complex scenarios.

11.1CVMar 2, 2021
A Pose-only Solution to Visual Reconstruction and Navigation

Qi Cai, Lilian Zhang, Yuanxin Wu et al.

Visual navigation and three-dimensional (3D) scene reconstruction are essential for robotics to interact with the surrounding environment. Large-scale scenes and critical camera motions are great challenges facing the research community to achieve this goal. We raised a pose-only imaging geometry framework and algorithms that can help solve these challenges. The representation is a linear function of camera global translations, which allows for efficient and robust camera motion estimation. As a result, the spatial feature coordinates can be analytically reconstructed and do not require nonlinear optimization. Experiments demonstrate that the computational efficiency of recovering the scene and associated camera poses is significantly improved by 2-4 orders of magnitude. This solution might be promising to unlock real-time 3D visual computing in many forefront applications.

3.0ROFeb 24, 2021
A Trident Quaternion Framework for Inertial-based Navigation Part II: Error Models and Application to Initial Alignment

Wei Ouyang, Yuanxin Wu

This work deals with error models for trident quaternion framework proposed in the companion paper (Part I) and further uses them to investigate the odometer-aided static/in-motion inertial navigation attitude alignment for land vehicles. By linearizing the trident quaternion kinematic equation, the left and right trident quaternion error models are obtained, which are found to be equivalent to those derived from profound group affine. The two error models are used to design their corresponding extended Kalman filters (EKF), namely, the left-quaternion EKF (LQEKF) and the right-quaternion EKF (RQEKF). Simulations and field tests are conducted to evaluate their actual performances. Owing to the high estimation consistency, the L/RQEKF converge much faster in the static alignment than the traditional error model-based EKF, even under arbitrary large heading initialization. For the in-motion alignment, the L/RQEKF possess much larger convergence region than the traditional EKF does, although they still require the aid of attitude initialization so as to avoid large initial attitude errors.

7.3ROFeb 24, 2021
A Trident Quaternion Framework for Inertial-based Navigation Part I: Rigid Motion Representation and Computation

Wei Ouyang, Yuanxin Wu

Strapdown inertial navigation research involves the parameterization and computation of the attitude, velocity and position of a rigid body in a chosen reference frame. The community has long devoted to finding the most concise and efficient representation for the strapdown inertial navigation system (INS). The current work is motivated by simplifying the existing dual quaternion representation of the kinematic model. This paper proposes a compact and elegant representation of the body's attitude, velocity and position, with the aid of a devised trident quaternion tool in which the position is accounted for by adding a second imaginary part to the dual quaternion. Eventually, the kinematics of strapdown INS are cohesively unified in one concise differential equation, which bears the same form as the classical attitude quaternion equation. In addition, the computation of this trident quaternion-based kinematic equation is implemented with the recently proposed functional iterative integration approach. Numerical results verify the analysis and show that incorporating the new representation into the functional iterative integration scheme achieves high inertial navigation computation accuracy as well.

1.2CVOct 11, 2020
Segmenting Epipolar Line

Shengjie Li, Qi Cai, Yuanxin Wu

Identifying feature correspondence between two images is a fundamental procedure in three-dimensional computer vision. Usually the feature search space is confined by the epipolar line. Using the cheirality constraint, this paper finds that the feature search space can be restrained to one of two or three segments of the epipolar line that are defined by the epipole and a so-called virtual infinity point.

2.3NASep 22, 2019
Strapdown Attitude Computation: Functional Iterative Integration versus Taylor Series Expansion

Yuanxin Wu, Yury A. Litmanovich

This paper compares two basic approaches to solving ordinary differential equations, which form the basis for attitude computation in strapdown inertial navigation systems, namely, the Taylor series expansion approach that was used in its low-order form for deriving all mainstream algorithms and the functional iterative integration approach developed recently. They are respectively applied to solve the kinematic equations of major attitude parameters, including the quaternion, the Rodrigues vector and the rotation vector. Specifically, the mainstream algorithms, which have relied on the simplified rotation vector without exception, are considerably extended by the Taylor series expansion approach using the exact rotation vector and recursive calculation of high-order derivatives. The functional iterative integration approach is respectively implemented on both the normal polynomial and the Chebyshev polynomial. Numerical results under the classical coning motion are reported to assess all derived attitude algorithms. It is revealed that in the relative frequency range when the coning to sampling frequency ratio is below 0.05-0.1 (depending on the chosen polynomial truncation order), all algorithms have the same order of accuracy if the same number of samples are used to fit the angular velocity over the iteration interval; in the range of higher relative frequency, the group of Quat/Rod/RotFIter algorithms (by the functional iterative integration approach combined with the Chebyshev polynomial) perform the best in both accuracy and robustness, thanks to the excellent numerical stability and powerful functional representation capability of the Chebyshev polynomial.

2.0AIMay 9, 2019
General Method for Prime-point Cyclic Convolution over the Real Field

Qi Cai, Tsung-Ching Lin, Yuanxin Wu et al.

A general and fast method is conceived for computing the cyclic convolution of n points, where n is a prime number. This method fully exploits the internal structure of the cyclic matrix, and hence leads to significant reduction of the multiplication complexity in terms of CPU time by 50%, as compared with Winograd's algorithm. In this paper, we only consider the real and complex fields due to their most important applications, but in general, the idea behind this method can be extended to any finite field of interest. Clearly, it is well-known that the discrete Fourier transform (DFT) can be expressed in terms of cyclic convolution, so it can be utilized to compute the DFT when the block length is a prime.

4.9ROMar 28, 2019
On Inertial Navigation and Attitude Initialization in Polar Areas

Yuanxin Wu, Chao He, Gang Liu

Inertial navigation and attitude initialization in polar areas become a hot topic in recent years in the navigation community, as the widely-used navigation mechanization of the local level frame encounters the inherent singularity when the latitude approaches 90 degrees. Great endeavors have been devoted to devising novel navigation mechanizations such as the grid or transversal frames. This paper highlights the fact that the common Earth-frame mechanization is sufficiently good to well handle the singularity problem in polar areas. Simulation results are reported to demonstrate the singularity problem and the effectiveness of the Earth-frame mechanization.

15.5ROOct 16, 2018Code
StructVIO : Visual-inertial Odometry with Structural Regularity of Man-made Environments

Danping Zou, Yuanxin Wu, Ling Pei et al.

We propose a novel visual-inertial odometry approach that adopts structural regularity in man-made environments. Instead of using Manhattan world assumption, we use Atlanta world model to describe such regularity. An Atlanta world is a world that contains multiple local Manhattan worlds with different heading directions. Each local Manhattan world is detected on-the-fly, and their headings are gradually refined by the state estimator when new observations are coming. With fully exploration of structural lines that aligned with each local Manhattan worlds, our visual-inertial odometry method become more accurate and robust, as well as much more flexible to different kinds of complex man-made environments. Through extensive benchmark tests and real-world tests, the results show that the proposed approach outperforms existing visual-inertial systems in large-scale man-made environments

7.3SYAug 16, 2017
RodFIter: Attitude Reconstruction from Inertial Measurement by Functional Iteration

Yuanxin Wu

Rigid motion computation or estimation is a cornerstone in numerous fields. Attitude computation can be achieved by integrating the angular velocity measured by gyroscopes, the accuracy of which is crucially important for the dead-reckoning inertial navigation. The state-of-the-art attitude algorithms have unexceptionally relied on the simplified differential equation of the rotation vector to obtain the attitude. This paper proposes a Functional Iteration technique with the Rodrigues vector (named the RodFIter method) to analytically reconstruct the attitude from gyroscope measurements. The RodFIter method is provably exact in reconstructing the incremental attitude as long as the angular velocity is exact. Notably, the Rodrigues vector is analytically obtained and can be used to update the attitude over the considered time interval. The proposed method gives birth to an ultimate attitude algorithm scheme that can be naturally extended to the general rigid motion computation. It is extensively evaluated under the attitude coning motion and compares favorably in accuracy with the mainstream attitude algorithms. This work is believed having eliminated the long-standing theoretical barrier in exact motion integration from inertial measurements.

3.2ROJul 22, 2017
Gyroscope Calibration via Magnetometer

Yuanxin Wu, Ling Pei

Magnetometers, gyroscopes and accelerometers are commonly used sensors in a variety of applications. The paper proposes a novel gyroscope calibration method in the homogeneous magnetic field by the help of magnetometer. It is shown that, with sufficient rotation excitation, the homogeneous magnetic field vector can be exploited to serve as a good reference for calibrating low-cost gyroscopes. The calibration parameters include the gyroscope scale factor, non-orthogonal coefficient and bias for three axes, as well as its misalignment to the magnetometer frame. Simulation and field test results demonstrate the method's effectiveness.

2.1ROSep 7, 2015
Underwater Doppler Navigation with Self-calibration

Xianfei Pan, Yuanxin Wu

Precise autonomous navigation remains a substantial challenge to all underwater platforms. Inertial Measurement Units (IMU) and Doppler Velocity Logs (DVL) have complementary characteristics and are promising sensors that could enable fully autonomous underwater navigation in unexplored areas without relying on additional external Global Positioning System (GPS) or acoustic beacons. This paper addresses the combined IMU/DVL navigation system from the viewpoint of observability. We show by analysis that under moderate conditions the combined system is observable. Specifically, the DVL parameters, including the scale factor and misalignment angles, can be calibrated in-situ without using external GPS or acoustic beacon sensors. Simulation results using a practical estimator validate the analytic conclusions.

10.1ROSep 7, 2015
On Calibration of Three-axis Magnetometer

Yuanxin Wu, Wei Shi

Magnetometer has received wide applications in attitude determination and scientific measurements. Calibration is an important step for any practical magnetometer use. The most popular three-axis magnetometer calibration methods are attitude-independent and have been founded on an approximate maximum likelihood (ML) estimation with a quartic subjective function, derived from the fact that the magnitude of the calibrated measurements should be constant in a homogeneous magnetic field. This paper highlights the shortcomings of those popular methods and proposes to use the quadratic optimal ML estimation instead for magnetometer calibration. Simulation and test results show that the optimal ML calibration is superior to the approximate ML methods for magnetometer calibration in both accuracy and stability, especially for those situations without sufficient attitude excitation. The significant benefits deserve the moderately increased computation burden. The main conclusion obtained in the context of magnetometer in this paper is potentially applicable to various kinds of three-axis sensors.

9.1ROSep 2, 2014
Versatile Land Navigation Using Inertial Sensors and Odometry: Self-calibration, In-motion Alignment and Positioning

Yuanxin Wu

Inertial measurement unit (IMU) and odometer have been commonly-used sensors for autonomous land navigation in the global positioning system (GPS)-denied scenarios. This paper systematically proposes a versatile strategy for self-contained land vehicle navigation using the IMU and an odometer. Specifically, the paper proposes a self-calibration and refinement method for IMU/odometer integration that is able to overcome significant variation of the misalignment parameters, which are induced by many inevitable and adverse factors such as load changing, refueling and ambient temperature. An odometer-aided IMU in-motion alignment algorithm is also devised that enables the first-responsive functionality even when the vehicle is running freely. The versatile strategy is successfully demonstrated and verified via long-distance real tests.

15.6RONov 18, 2013
On 'A Kalman Filter-Based Algorithm for IMU-Camera Calibration: Observability Analysis and Performance Evaluation'

Yuanxin Wu

The above-mentioned work [1] in IEEE-TR'08 presented an extended Kalman filter for calibrating the misalignment between a camera and an IMU. As one of the main contributions, the locally weakly observable analysis was carried out using Lie derivatives. The seminal paper [1] is undoubtedly the cornerstone of current observability work in SLAM and a number of real SLAM systems have been developed on the observability result of this paper, such as [2, 3]. However, the main observability result of this paper [1] is founded on an incorrect proof and actually cannot be acquired using the local observability technique therein, a fact that is apparently not noticed by the SLAM community over a number of years.

12.1ROJul 6, 2012
Velocity/Position Integration Formula (II): Application to Inertial Navigation Computation

Yuanxin Wu, Xianfei Pan

Inertial navigation applications are usually referenced to a rotating frame. Consideration of the navigation reference frame rotation in the inertial navigation algorithm design is an important but so far less seriously treated issue, especially for ultra-high-speed flying aircraft or the future ultra-precision navigation system of several meters per hour. This paper proposes a rigorous approach to tackle the issue of navigation frame rotation in velocity/position computation by use of the newly-devised velocity/position integration formulae in the Part I companion paper. The two integration formulae set a well-founded cornerstone for the velocity/position algorithms design that makes the comprehension of the inertial navigation computation principle more accessible to practitioners, and different approximations to the integrals involved will give birth to various velocity/position update algorithms. Two-sample velocity and position algorithms are derived to exemplify the design process. In the context of level-flight airplane examples, the derived algorithm is analytically and numerically compared to the typical algorithms existing in the literature. The results throw light on the problems in existing algorithms and the potential benefits of the derived algorithm.

16.7ROJul 6, 2012
Velocity/Position Integration Formula (I): Application to In-flight Coarse Alignment

Yuanxin Wu, Xianfei Pan

The in-flight alignment is a critical stage for airborne INS/GPS applications. The alignment task is usually carried out by the Kalman filtering technique that necessitates a good initial attitude to obtain satisfying performance. Due to the airborne dynamics, the in-flight alignment is much difficult than alignment on the ground. This paper proposes an optimization-based coarse alignment approach using GPS position/velocity as input, founded on the newly-derived velocity/position integration formulae. Simulation and flight test results show that, with the GPS lever arm well handled, it is potentially able to yield the initial heading up to one degree accuracy in ten seconds. It can serve as a nice coarse in-flight alignment without any prior attitude information for the subsequent fine Kalman alignment. The approach can also be applied to other applications that require aligning the INS on the run.