Yoshihiro Sawano

3papers

33citations

Novelty52%

AI Score23

Ranked #181,577 of 201,326 authors (top 90%)#39,191 in LG (top 92%)

3 Papers

LGNov 20, 2020

A global universality of two-layer neural networks with ReLU activations

Naoya Hatano, Masahiro Ikeda, Isao Ishikawa et al.

In the present study, we investigate a universality of neural networks, which concerns a density of the set of two-layer neural networks in a function spaces. There are many works that handle the convergence over compact sets. In the present paper, we consider a global convergence by introducing a norm suitably, so that our results will be uniform over any compact set.

FANov 27, 2019

Composition operators on reproducing kernel Hilbert spaces with analytic positive definite functions

Masahiro Ikeda, Isao Ishikawa, Yoshihiro Sawano

In this paper, we specify what functions induce the bounded composition operators on a reproducing kernel Hilbert space (RKHS) associated with an analytic positive definite function defined on $\mathbf{R}^d$. We prove that only affine transforms can do so in a pretty large class of RKHS. Our result covers not only the Paley-Wiener space on the real line, studied in previous works, but also much more general RKHSs corresponding to analytic positive definite functions where existing methods do not work. Our method only relies on an intrinsic properties of the RKHSs, and we establish a connection between the behavior of composition operators and the asymptotic properties of the greatest zeros of orthogonal polynomials on a weighted $L^2$-spaces on the real line. We also investigate the compactness of the composition operators and show that any bounded composition operators cannot be compact in our situation.

MLMay 19, 2018

The global optimum of shallow neural network is attained by ridgelet transform

Sho Sonoda, Isao Ishikawa, Masahiro Ikeda et al.

We prove that the global minimum of the backpropagation (BP) training problem of neural networks with an arbitrary nonlinear activation is given by the ridgelet transform. A series of computational experiments show that there exists an interesting similarity between the scatter plot of hidden parameters in a shallow neural network after the BP training and the spectrum of the ridgelet transform. By introducing a continuous model of neural networks, we reduce the training problem to a convex optimization in an infinite dimensional Hilbert space, and obtain the explicit expression of the global optimizer via the ridgelet transform.