NAApr 13, 2016
A characterization of energy-preserving methods and the construction of parallel integrators for Hamiltonian systemsYuto Miyatake, John C. Butcher
High order energy-preserving methods for Hamiltonian systems are presented. For this aim, an energy-preserving condition of continuous stage Runge--Kutta methods is proved. Order conditions are simplified and parallelizable conditions are also given. The computational cost of our high order methods is comparable to that of the average vector field method of order two.
NAApr 23, 2017
Geometric numerical integrators for Hunter-Saxton-like equationsYuto Miyatake, David Cohen, Daisuke Furihata et al.
We present novel geometric numerical integrators for Hunter--Saxton-like equations by means of new multi-symplectic formulations and known Hamiltonian structures of the problems. We consider the Hunter--Saxton equation, the modified Hunter--Saxton equation, and the two-component Hunter--Saxton equation. Multi-symplectic discretisations based on these new formulations of the problems are exemplified by means of the explicit Euler box scheme, and Hamiltonian-preserving discretisations are exemplified by means of the discrete variational derivative method. We explain and justify the correct treatment of boundary conditions in a unified manner. This is necessary for a proper numerical implementation of these equations and was never explicitly clarified in the literature before, to the best of our knowledge. Finally, numerical experiments demonstrate the favourable behaviour of the proposed numerical integrators.
NANov 7, 2017
On the equivalence between SOR-type methods for linear systems and discrete gradient methods for gradient systemsYuto Miyatake, Tomohiro Sogabe, Shao-Liang Zhang
The iterative nature of many discretisation methods for continuous dynamical systems has led to the study of the connections between iterative numerical methods in numerical linear algebra and continuous dynamical systems. Certain researchers have used the explicit Euler method to understand this connection, but this method has its limitation. In this study, we present a new connection between successive over-relaxation (SOR)-type methods and gradient systems; this connection is based on discrete gradient methods. The focus of the discussion is the equivalence between SOR-type methods and discrete gradient methods applied to gradient systems. The discussion leads to new interpretations for SOR-type methods. For example, we found a new way to derive these methods; these methods monotonically decrease a certain quadratic function and obtain a new interpretation of the relaxation parameter. We also obtained a new discrete gradient while studying the new connection.
NAJun 5, 2018
Solution of the $k$-th eigenvalue problem in large-scale electronic structure calculationsDongjin Lee, Takeo Hoshi, Tomohiro Sogabe et al.
We consider computing the $k$-th eigenvalue and its corresponding eigenvector of a generalized Hermitian eigenvalue problem of $n\times n$ large sparse matrices. In electronic structure calculations, several properties of materials, such as those of optoelectronic device materials, are governed by the eigenpair with a material-specific index $k.$ We present a three-stage algorithm for computing the $k$-th eigenpair with validation of its index. In the first stage of the algorithm, we propose an efficient way of finding an interval containing the $k$-th eigenvalue $(1 \ll k \ll n)$ with a non-standard application of the Lanczos method. In the second stage, spectral bisection for large-scale problems is realized using a sparse direct linear solver to narrow down the interval of the $k$-th eigenvalue. In the third stage, we switch to a modified shift-and-invert Lanczos method to reduce bisection iterations and compute the $k$-th eigenpair with validation. Numerical results with problem sizes up to 1.5 million are reported, and the results demonstrate the accuracy and efficiency of the three-stage algorithm.
LGApr 25, 2023
Lyapunov-Stable Deep Equilibrium ModelsHaoyu Chu, Shikui Wei, Ting Liu et al.
Deep equilibrium (DEQ) models have emerged as a promising class of implicit layer models, which abandon traditional depth by solving for the fixed points of a single nonlinear layer. Despite their success, the stability of the fixed points for these models remains poorly understood. By considering DEQ models as nonlinear dynamic systems, we propose a robust DEQ model named LyaDEQ with guaranteed provable stability via Lyapunov theory. The crux of our method is ensuring the Lyapunov stability of the DEQ model's fixed points, which enables the proposed model to resist minor initial perturbations. To avoid poor adversarial defense due to Lyapunov-stable fixed points being located near each other, we orthogonalize the layers after the Lyapunov stability module to separate different fixed points. We evaluate LyaDEQ models under well-known adversarial attacks, and experimental results demonstrate significant improvement in robustness. Furthermore, we show that the LyaDEQ model can be combined with other defense methods, such as adversarial training, to achieve even better adversarial robustness.
NAOct 24, 2018
Structure-preserving model reduction for dynamical systems with a first integralYuto Miyatake
Since the expense of the numerical integration of large scale dynamical systems is often computationally prohibitive, model reduction methods, which approximate such systems by simpler and much lower order ones, are often employed to reduce the computational effort. In this paper, for dynamical systems with a first integral, new structure-preserving model reduction approaches are presented that yield reduced-order systems while preserving the first integral. We apply energy-preserving integrators to the reduced-order systems and show some numerical experiments that demonstrate the favourable behaviour of the proposed approaches.
NAJun 1, 2019
Adaptive SOR methods based on the Wolfe conditionsYuto Miyatake, Tomohiro Sogabe, Shao-Liang Zhang
Because the expense of estimating the optimal value of the relaxation parameter in the successive over-relaxation (SOR) method is usually prohibitive, the parameter is often adaptively controlled. In this paper, new adaptive SOR methods are presented that are applicable to a variety of symmetric positive definite linear systems and do not require additional matrix-vector products when updating the parameter. To this end, we regard the SOR method as an algorithm for minimising a certain objective function, which yields an interpretation of the relaxation parameter as the step size following a certain change of variables. This interpretation enables us to adaptively control the step size based on some line search techniques, such as the Wolfe conditions. Numerical examples demonstrate the favourable behaviour of the proposed methods.
NANov 5, 2015
On a relationship between the T-congruence Sylvester equation and the Lyapunov equationMasaya Oozawa, Tomohiro Sogabe, Yuto Miyatake et al.
We consider the T-congruence Sylvester equation $AX+X^{\rm T}B=C$, where $A\in \mathbb R^{m\times n}$, $B\in \mathbb R^{n\times m}$ and $C\in \mathbb R^{m\times m}$ are given, and matrix $X \in \mathbb R^{n\times m}$ is to be determined. The T-congruence Sylvester equation has recently attracted attention because of a relationship with palindromic eigenvalue problems. For example, necessary and sufficient conditions for the existence and uniqueness of solutions, and numerical solvers have been intensively studied. In this note, we will show that, under a certain condition, the T-congruence Sylvester equation can be transformed into the Lyapunov equation. This may lead to further properties and efficient numerical solvers by utilizing a great deal of studies on the Lyapunov equation.
NAMar 13, 2019
Relation between the T-congruence Sylvester equation and the generalized Sylvester equationYuki Satake, Masaya Oozawa, Tomohiro Sogabe et al.
The T-congruence Sylvester equation is the matrix equation $AX+X^{\mathrm{T}}B=C$, where $A\in\mathbb{R}^{m\times n}$, $B\in\mathbb{R}^{n\times m}$, and $C\in\mathbb{R}^{m\times m}$ are given, and $X\in\mathbb{R}^{n\times m}$ is to be determined. Recently, Oozawa et al. discovered a transformation that the matrix equation is equivalent to one of the well-studied matrix equations (the Lyapunov equation); however, the condition of the transformation seems to be too limited because matrices $A$ and $B$ are assumed to be square matrices ($m=n$). In this paper, two transformations are provided for rectangular matrices $A$ and $B$. One of them is an extension of the result of Oozawa et al. for the case $m\ge n$, and the other is a novel transformation for the case $m\le n$.
NADec 28, 2016
A cost-efficient variant of the incremental Newton iteration for the matrix $p$th rootFuminori Tatsuoka, Tomohiro Sogabe, Yuto Miyatake et al.
Incremental Newton (IN) iteration, proposed by Iannazzo, is stable for computing the matrix $p$th root, and its computational cost is $\mathcal{O}(n^3p)$ flops per iteration. In this paper, a cost-efficient variant of IN iteration is presented. The computational cost of the variant well agrees with $\mathcal{O} (n^3 \log p)$ flops per iteration, if $p$ is up to at least 100.
NAOct 31, 2016
Energy-preserving $H^1$-Galerkin schemes for the Hunter--Saxton equationYuto Miyatake, Geonsik Eom, Tomohiro Sogabe et al.
We consider the numerical integration of the Hunter--Saxton equation, which models the propagation of weakly nonlinear orientation waves. For the equation, we present two weak forms and their Galerkin discretizations. The Galerkin schemes preserve the Hamiltonian of the equation and can be implemented with cheap $H^1$ elements. Numerical experiments confirm the effectiveness of the schemes.
48.7NAMar 12
An error control framework for computing the exponential of matrices arising from the finite element discretizationFuminori Tatsuoka, Yuto Miyatake, Tomohiro Sogabe
Several methods for computing the action of the matrix exponential $\mathrm{e}^{\boldsymbol{A}} \boldsymbol{b}$ are expressed by substituting $\boldsymbol{A}$ into a rational approximation of the scalar exponential function. The error of such methods can be estimated using the numerical range of $\boldsymbol{A}$, which enables the computation of $\mathrm{e}^{\boldsymbol{A}}\boldsymbol{b}$ with a prescribed accuracy. However, when the input matrix has the structure $\boldsymbol{A} = Ï\boldsymbol{M}^{-1} \boldsymbol{K}$, this approach is challenging because computing the bounding box of numerical range is difficult and the numerical range may be too large to construct rational approximations on it. In this paper, focusing on the case where $\boldsymbol{M}$ is a well-conditioned symmetric positive definite matrix, we propose considering the numerical range of a similarity transformed matrix of $\boldsymbol{A}$. The numerical range of transformed matrix is not only numerically computable but can also be theoretically bounded depending on properties of $\boldsymbol{K}$. Numerical experiments confirm that the computations can be performed within the prescribed error tolerance.
CLJun 8, 2023
A modified model for topic detection from a corpus and a new metric evaluating the understandability of topicsTomoya Kitano, Yuto Miyatake, Daisuke Furihata
This paper presents a modified neural model for topic detection from a corpus and proposes a new metric to evaluate the detected topics. The new model builds upon the embedded topic model incorporating some modifications such as document clustering. Numerical experiments suggest that the new model performs favourably regardless of the document's length. The new metric, which can be computed more efficiently than widely-used metrics such as topic coherence, provides variable information regarding the understandability of the detected topics.
LGJan 10, 2024
Structure-Preserving Physics-Informed Neural Networks With Energy or Lyapunov StructureHaoyu Chu, Yuto Miyatake, Wenjun Cui et al.
Recently, there has been growing interest in using physics-informed neural networks (PINNs) to solve differential equations. However, the preservation of structure, such as energy and stability, in a suitable manner has yet to be established. This limitation could be a potential reason why the learning process for PINNs is not always efficient and the numerical results may suggest nonphysical behavior. Besides, there is little research on their applications on downstream tasks. To address these issues, we propose structure-preserving PINNs to improve their performance and broaden their applications for downstream tasks. Firstly, by leveraging prior knowledge about the physical system, a structure-preserving loss function is designed to assist the PINN in learning the underlying structure. Secondly, a framework that utilizes structure-preserving PINN for robust image recognition is proposed. Here, preserving the Lyapunov structure of the underlying system ensures the stability of the system. Experimental results demonstrate that the proposed method improves the numerical accuracy of PINNs for partial differential equations. Furthermore, the robustness of the model against adversarial perturbations in image data is enhanced.
LGJan 29, 2022
Composing a surrogate observation operator for sequential data assimilationKosuke Akita, Yuto Miyatake, Daisuke Furihata
In data assimilation, state estimation is not straightforward when the observation operator is unknown. This study proposes a method for composing a surrogate operator when the true operator is unknown. A neural network is used to improve the surrogate model iteratively to decrease the difference between the observations and the results of the surrogate model. A twin experiment suggests that the proposed method outperforms approaches that tentatively use a specific operator throughout the data assimilation process.
LGFeb 19, 2021
Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal MemoryTakashi Matsubara, Yuto Miyatake, Takaharu Yaguchi
A neural network model of a differential equation, namely neural ODE, has enabled the learning of continuous-time dynamical systems and probabilistic distributions with high accuracy. The neural ODE uses the same network repeatedly during a numerical integration. The memory consumption of the backpropagation algorithm is proportional to the number of uses times the network size. This is true even if a checkpointing scheme divides the computation graph into sub-graphs. Otherwise, the adjoint method obtains a gradient by a numerical integration backward in time. Although this method consumes memory only for a single network use, it requires high computational cost to suppress numerical errors. This study proposes the symplectic adjoint method, which is an adjoint method solved by a symplectic integrator. The symplectic adjoint method obtains the exact gradient (up to rounding error) with memory proportional to the number of uses plus the network size. The experimental results demonstrate that the symplectic adjoint method consumes much less memory than the naive backpropagation algorithm and checkpointing schemes, performs faster than the adjoint method, and is more robust to rounding errors.
NAJul 30, 2015
Structure-preserving integrators for the Benjamin-type equationsKimiaki Kinugasa, Yuto Miyatake, Takayasu Matsuo
The numerical integration of the Benjamin and Benjamin--Ono equations are considered. They are non-local partial differential equations involving the Hilbert transform, and due to this, so far quite few structure-preserving integrators have been proposed. In this paper, a new reformulation of the equations is stated, and new structure-preserving discretizations are proposed based on it. Numerical experiments confirm the effectiveness of the proposed integrators.