Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding | allainews.com

March 19, 2024, 4:49 a.m. | Chaolei Tan, Jianhuang Lai, Wei-Shi Zheng, Jian-Fang Hu

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.11463v1 Announce Type: new
Abstract: Video Paragraph Grounding (VPG) is an emerging task in video-language understanding, which aims at localizing multiple sentences with semantic relations and temporal order from an untrimmed video. However, existing VPG approaches are heavily reliant on a considerable number of temporal labels that are laborious and time-consuming to acquire. In this work, we introduce and explore Weakly-Supervised Video Paragraph Grounding (WSVPG) to eliminate the need of temporal annotations. Different from previous weakly-supervised grounding frameworks based on …

abstract alignment arxiv cs.cv however labels language language understanding multiple regression relations semantic temporal type understanding video weakly-supervised

More from arxiv.org / cs.CV updates on arXiv.org

Physics-Informed Computer Vision: A Review and Perspectives 2 hours ago | arxiv.org

abstract application arxiv computer +26

Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior 2 hours ago | arxiv.org

arxiv boosting cs.cv feature +8

Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset 2 hours ago | arxiv.org

abstract accuracy arxiv autonomous +23

GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition 2 hours ago | arxiv.org

abstract action recognition advancement arxiv +23

Revisiting Sampson Approximations for Geometric Estimation Problems 2 hours ago | arxiv.org

abstract arxiv collection computer +8

Frequency-Time Diffusion with Neural Cellular Automata 2 hours ago | arxiv.org

abstract arxiv capabilities cellular +16

A Comprehensive Overview of Fish-Eye Camera Distortion Correction Methods 2 hours ago | arxiv.org

abstract applications arxiv cameras +13

Adaptive Depth Networks with Skippable Sub-Paths 2 hours ago | arxiv.org

abstract arxiv control cs.ai +11

Attention-aware Social Graph Transformer Networks for Stochastic Trajectory Prediction 2 hours ago | arxiv.org

abstract arxiv attention autonomous +26

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net