Recipe Generation from Unsegmented Cooking Videos | allainews.com

Feb. 20, 2024, 5:48 a.m. | Taichi Nishimura, Atsushi Hashimoto, Yoshitaka Ushiku, Hirotaka Kameko, Shinsuke Mori

cs.CV updates on arXiv.org arxiv.org

arXiv:2209.10134v2 Announce Type: replace-cross
Abstract: This paper tackles recipe generation from unsegmented cooking videos, a task that requires agents to (1) extract key events in completing the dish and (2) generate sentences for the extracted events. Our task is similar to dense video captioning (DVC), which aims at detecting events thoroughly and generating sentences for them. However, unlike DVC, in recipe generation, recipe story awareness is crucial, and a model should extract an appropriate number of events in the correct …

abstract agents arxiv captioning cooking cs.cl cs.cv cs.mm dvc events extract generate key paper recipe type video videos

More from arxiv.org / cs.CV updates on arXiv.org

Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges 20 hours ago | arxiv.org

abstract analysis arxiv challenges +11

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder 20 hours ago | arxiv.org

abstract arxiv become challenge +17

Yuille-Poggio's Flow and Global Minimizer of Polynomials through Convexification by Heat Evolution 20 hours ago | arxiv.org

abstract algorithm arxiv cs.cv +9

Motion State: A New Benchmark Multiple Object Tracking 20 hours ago | arxiv.org

abstract analysis arxiv benchmark +18

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering 20 hours ago | arxiv.org

arxiv convolutional cs.ai cs.cv +10

A Unified Approach for Text- and Image-guided 4D Scene Generation 20 hours ago | arxiv.org

3d scene generation abstract arxiv cs.cv +17

From Pixels to Titles: Video Game Identification by Screenshots using Convolutional Neural Networks 20 hours ago | arxiv.org

abstract architectures arxiv cnn +24

Amodal Optical Flow 20 hours ago | arxiv.org

arxiv cs.ai cs.cv cs.ro +4

Interpretable Geoscience Artificial Intelligence (XGeoS-AI): Application to Demystify Image Recognition 20 hours ago | arxiv.org

abstract ai models application artificial +21

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net