all AI news
Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN. (arXiv:2012.09542v3 [cs.CV] UPDATED)
cs.CV updates on arXiv.org arxiv.org
3D Convolutional Neural Network (3D CNN) captures spatial and temporal
information on 3D data such as video sequences. However, due to the convolution
and pooling mechanism, the information loss seems unavoidable. To improve the
visual explanations and classification in 3D CNN, we propose two approaches; i)
aggregate layer-wise global to local (global-local) discrete gradients using
trained 3DResNext network, and ii) implement attention gating network to
improve the accuracy of the action recognition. The proposed approach intends
to show the usefulness …
3d arxiv attention cnn cv global local attention localization weakly-supervised