April 1, 2024, 4:45 a.m. | Isabel Funke, Dominik Rivoir, Stefanie Krell, Stefanie Speidel

cs.CV updates on arXiv.org arxiv.org

arXiv:2307.09997v3 Announce Type: replace
Abstract: To enable context-aware computer assistance in the operating room of the future, cognitive systems need to understand automatically which surgical phase is being performed by the medical team. The primary source of information for surgical phase recognition is typically video, which presents two challenges: extracting meaningful features from the video stream and effectively modeling temporal information in the sequence of visual features. For temporal modeling, attention mechanisms have gained popularity due to their ability to …

abstract arxiv attention challenges cognitive computer context cs.cv future information medical recognition room self-attention systems team temporal type video

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA