ML LGFeb 11, 2021

Variational Bayesian Sequence-to-Sequence Networks for Memory-Efficient Sign Language Translation

Harris Partaourides, Andreas Voskou, Dimitrios Kosmopoulos, Sotirios Chatzis, Dimitris N. Metaxas

arXiv:2102.06143v16.35 citations

Originality Incremental advance

AI Analysis

This addresses real-time assisted technology for the deaf, though it appears incremental as it builds on existing recurrent networks.

The paper tackles memory-efficient continuous sign language translation by introducing a variational Bayesian sequence-to-sequence network with a Gaussian posterior and nonparametric prior, achieving substantial weight compression without performance loss.

Memory-efficient continuous Sign Language Translation is a significant challenge for the development of assisted technologies with real-time applicability for the deaf. In this work, we introduce a paradigm of designing recurrent deep networks whereby the output of the recurrent layer is derived from appropriate arguments from nonparametric statistics. A novel variational Bayesian sequence-to-sequence network architecture is proposed that consists of a) a full Gaussian posterior distribution for data-driven memory compression and b) a nonparametric Indian Buffet Process prior for regularization applied on the Gated Recurrent Unit non-gate weights. We dub our approach Stick-Breaking Recurrent network and show that it can achieve a substantial weight compression without diminishing modeling performance.

View on arXiv PDF

Similar