all AI news
The Elements of Temporal Sentence Grounding in Videos: A Survey and Future Directions. (arXiv:2201.08071v1 [cs.CV])
Jan. 21, 2022, 2:10 a.m. | Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou
cs.CL updates on arXiv.org arxiv.org
Temporal sentence grounding in videos (TSGV), a.k.a., natural language video
localization (NLVL) or video moment retrieval (VMR), aims to retrieve a
temporal moment that semantically corresponds to a language query from an
untrimmed video. Connecting computer vision and natural language, TSGV has
drawn significant attention from researchers in both communities. This survey
attempts to provide a summary of fundamental concepts in TSGV and current
research status, as well as future research directions. As the background, we
present a common structure …
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Integration Specialist
@ Accenture Federal Services | San Antonio, TX
Geospatial Data Engineer - Location Intelligence
@ Allegro | Warsaw, Poland
Site Autonomy Engineer (Onsite)
@ May Mobility | Tokyo, Japan
Summer Intern, AI (Artificial Intelligence)
@ Nextech Systems | Tampa, FL
Permitting Specialist/Wetland Scientist
@ AECOM | Chelmsford, MA, United States