Dániel Rácz

h-index1

5papers

2citations

Novelty39%

AI Score27

Ranked #157,532 of 194,257 authors (top 81%)#34,542 in LG (top 86%)

5 Papers

2.0LGOct 26, 2023Code

Optimization dependent generalization bound for ReLU networks based on sensitivity in the tangent bundle

Dániel Rácz, Mihály Petreczky, András Csertán et al.

Recent advances in deep learning have given us some very promising results on the generalization ability of deep neural networks, however literature still lacks a comprehensive theory explaining why heavily over-parametrized models are able to generalize well while fitting the training data. In this paper we propose a PAC type bound on the generalization error of feedforward ReLU networks via estimating the Rademacher complexity of the set of networks available from an initial parameter vector via gradient descent. The key idea is to bound the sensitivity of the network's gradient to perturbation of the input data along the optimization trajectory. The obtained bound does not explicitly depend on the depth of the network. Our results are experimentally verified on the MNIST and CIFAR-10 datasets.

2.0LGJul 7, 2023

PAC bounds of continuous Linear Parameter-Varying systems related to neural ODEs

Dániel Rácz, Mihály Petreczky, Bálint Daróczy

We consider the problem of learning Neural Ordinary Differential Equations (neural ODEs) within the context of Linear Parameter-Varying (LPV) systems in continuous-time. LPV systems contain bilinear systems which are known to be universal approximators for non-linear systems. Moreover, a large class of neural ODEs can be embedded into LPV systems. As our main contribution we provide Probably Approximately Correct (PAC) bounds under stability for LPV systems related to neural ODEs. The resulting bounds have the advantage that they do not depend on the integration interval.

4.1LGMar 17, 2025

A finite-sample bound for identifying partially observed linear switched systems from a single trajectory

Daniel Racz, Mihaly Petreczky, Balint Daroczy

We derive a finite-sample probabilistic bound on the parameter estimation error of a system identification algorithm for Linear Switched Systems. The algorithm estimates Markov parameters from a single trajectory and applies a variant of the Ho-Kalman algorithm to recover the system matrices. Our bound guarantees statistical consistency under the assumption that the true system exhibits quadratic stability. The proof leverages the theory of weakly dependent processes. To the best of our knowledge, this is the first finite-sample bound for this algorithm in the single-trajectory setting.

2.6LGMay 16, 2024

A finite-sample generalization bound for stable LPV systems

Daniel Racz, Martin Gonzalez, Mihaly Petreczky et al.

One of the main theoretical challenges in learning dynamical systems from data is providing upper bounds on the generalization error, that is, the difference between the expected prediction error and the empirical prediction error measured on some finite sample. In machine learning, a popular class of such bounds are the so-called Probably Approximately Correct (PAC) bounds. In this paper, we derive a PAC bound for stable continuous-time linear parameter-varying (LPV) systems. Our bound depends on the H2 norm of the chosen class of the LPV systems, but does not depend on the time interval for which the signals are considered.

3.1LGOct 26, 2021Code

Gradient representations in ReLU networks as similarity functions

Dániel Rácz, Bálint Daróczy

Feed-forward networks can be interpreted as mappings with linear decision surfaces at the level of the last layer. We investigate how the tangent space of the network can be exploited to refine the decision in case of ReLU (Rectified Linear Unit) activations. We show that a simple Riemannian metric parametrized on the parameters of the network forms a similarity function at least as good as the original network and we suggest a sparse metric to increase the similarity gap.