Tomonori Kouya

2papers

2 Papers

25.4MSMar 16
Acceleration of multi-component multiple-precision arithmetic with branch-free algorithms and SIMD vectorization

Tomonori Kouya

Multiple-precision floating-point branch-free algorithms can significantly accelerate multi-component arithmetic implemented by combining hardware-based binary64 and binary32, particularly for triple- and quadruple-precision computations. In this study, we achieved benchmark results on x86 and ARM CPU platforms to quantify the accelerations achieved in linear computations and polynomial evaluation by integrating these algorithms.

NASep 18, 2018
Performance Evaluation of an Extrapolation Method for Ordinary Differential Equations with Error-free Transformation

Tomonori Kouya

The application of error-free transformation (EFT) is recently being developed to solve ill-conditioned problems. It can reduce the number of arithmetic operations required, compared with multiple precision arithmetic, and also be applied by using functions supported by a well-tuned BLAS library. In this paper, we propose the application of EFT to explicit extrapolation methods to solve initial value problems of ordinary differential equations. Consequently, our implemented routines can be effective for large-sized linear ODE and small-sized nonlinear ODE, especially in the case when harmonic sequence is used.