ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network. (arXiv:2207.09927v1 [cs.CV]) | allainews.com

July 21, 2022, 1:12 a.m. | Nikolaos Gkalelis, Dimitrios Daskalakis, Vasileios Mezaris

cs.CV updates on arXiv.org arxiv.org

In this paper a pure-attention bottom-up approach, called ViGAT, that
utilizes an object detector together with a Vision Transformer (ViT) backbone
network to derive object and frame features, and a head network to process
these features for the task of event recognition and explanation in video, is
proposed. The ViGAT head consists of graph attention network (GAT) blocks
factorized along the spatial and temporal dimensions in order to capture
effectively both local and long-term dependencies between objects or frames.
Moreover, …

arxiv attention cv event graph network video

More from arxiv.org / cs.CV updates on arXiv.org

DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness 21 hours ago | arxiv.org

abstract arxiv augment automated +18

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation 21 hours ago | arxiv.org

arxiv cs.cv eess.iv embedding +5

KI-PMF: Knowledge Integrated Plausible Motion Forecasting 21 hours ago | arxiv.org

abstract actors arxiv autonomous +18

Adaptive Landmark Color for AUV Docking in Visually Dynamic Environments 21 hours ago | arxiv.org

abstract arxiv autonomous batteries +15

OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction 21 hours ago | arxiv.org

abstract arxiv autonomous autonomous vehicles +21

Multimodal Chain-of-Thought Reasoning in Language Models 21 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +7

Robust Self-Tuning Data Association for Geo-Referencing Using Lane Markings 21 hours ago | arxiv.org

abstract advantages aerial arxiv +16

Surrogate-based cross-correlation for particle image velocimetry 21 hours ago | arxiv.org

arxiv correlation cs.cv eess.iv +5

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos 21 hours ago | arxiv.org

abstract action action recognition advance +20

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

.NET Software Engineer (AI Focus)

@ Boskalis | Papendrecht, Netherlands

View on ai-jobs.net