Sept. 19, 2022, 1:14 a.m. | Wanrong Zhu, Bo Pang, Ashish V. Thapliyal, William Yang Wang, Radu Soricut

cs.CV updates on arXiv.org arxiv.org

Dense video captioning aims to identify the events of interest in an input
video, and generate descriptive captions for each event. Previous approaches
usually follow a two-stage generative process, which first proposes a segment
for each event, then renders a caption for each identified segment. Recent
advances in large-scale sequence generation pretraining have seen great success
in unifying task formulation for a great variety of tasks, but so far, more
complex tasks such as dense video captioning are not able …

arxiv captioning video

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Sr. Data Science Consultant

@ Blue Yonder | Bengaluru

Artificial Intelligence Developer

@ HP | PSR01 - Bengaluru, Pritech Park- SEZ (PSR01)

Senior Software Engineer - Cloud Data Extraction

@ Celonis | Munich, Germany

Finance Master Data Management

@ Airbus | Lisbon (Airbus Portugal)

Imaging Support Associate

@ Lexington Medical Center | West Columbia, SC, US, 29169