Publicatie

Designing Interpretable Recurrent Neural Networks for Video Reconstruction Via Deep Unfolding

Tijdschriftbijdrage - Tijdschriftartikel

Deep unfolding methods design deep neural networks as learned variations of optimization algorithms through the unrolling of their iterations. These networks have been shown to achieve faster convergence and higher accuracy than the original optimization methods. In this line of research, this paper presents novel interpretable deep recurrent neural networks (RNNs), designed by the unfolding of iterative algorithms that solve the task of sequential signal reconstruction (in particular, video reconstruction). The proposed networks are designed by accounting that video frames’ patches have a sparse representation and the temporal difference between consecutive representations is also sparse. Specifically, we design an interpretable deep RNN (coined reweighted-RNN) by unrolling the iterations of a proximal method that solves a reweighted version of the ℓ1-ℓ1 minimization problem. Due to the underlying minimization model, our reweighted-RNN has a different thresholding function (alias, different activation function) for each hidden unit in each layer. In this way, it has higher network expressivity than existing deep unfolding RNN models. We also present the derivative ℓ1-ℓ1-RNN model, which is obtained by unfolding a proximal method for the ℓ1-ℓ1 minimization problem. We apply the proposed interpretable RNNs to the task of video frame reconstruction from low-dimensional measurements, that is, sequential video frame reconstruction. The experimental results on various datasets demonstrate that the proposed deep RNNs outperform various RNN models.

Tijdschrift: IEEE Trans Image Process

ISSN: 1057-7149

Volume: 30

Pagina's: 4099 - 4113

Jaar van publicatie:2021

Institutional Repository URL: https://cris.vub.be/ws/files/67600763/Designing_Interpretable_Recurrent_Neural_Networks_for_Video_Reconstruction_Via_Deep_Unfolding_REVISION_Open_access_.pdf
WoS Id: 000639653800002
Scopus Id: 85103763371
ORCID: /0000-0002-2881-2727/work/91819009
ORCID: /0000-0001-9300-5860/work/91817445
DOI: https://doi.org/10.1109/tip.2021.3069296

BOF-keylabel:ja

IOF-keylabel:ja

BOF-publication weight:10

Auteurs:Regional

Authors from:Government, Higher Education

Toegankelijkheid:Open

Publicatie

Designing Interpretable Recurrent Neural Networks for Video Reconstruction Via Deep Unfolding

Tijdschriftbijdrage - Tijdschriftartikel

Auteurs/uitgever

Onderzoekseenheden

Projecten