Jan. 21, 2022, 2:10 a.m. | Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou

cs.CL updates on arXiv.org arxiv.org

Temporal sentence grounding in videos (TSGV), a.k.a., natural language video
localization (NLVL) or video moment retrieval (VMR), aims to retrieve a
temporal moment that semantically corresponds to a language query from an
untrimmed video. Connecting computer vision and natural language, TSGV has
drawn significant attention from researchers in both communities. This survey
attempts to provide a summary of fundamental concepts in TSGV and current
research status, as well as future research directions. As the background, we
present a common structure …

arxiv cv future survey videos

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Integration Specialist

@ Accenture Federal Services | San Antonio, TX

Geospatial Data Engineer - Location Intelligence

@ Allegro | Warsaw, Poland

Site Autonomy Engineer (Onsite)

@ May Mobility | Tokyo, Japan

Summer Intern, AI (Artificial Intelligence)

@ Nextech Systems | Tampa, FL

Permitting Specialist/Wetland Scientist

@ AECOM | Chelmsford, MA, United States