April 23, 2024, 4:47 a.m. | Zhangjing Yang, Dun Liu, Wensheng Cheng, Jinqiao Wang, Yi Wu

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.13863v1 Announce Type: new
Abstract: Labeling pixel-wise object masks in videos is a resource-intensive and laborious process. Box-supervised Video Instance Segmentation (VIS) methods have emerged as a viable solution to mitigate the labor-intensive annotation process. . In practical applications, the two-step approach is not only more flexible but also exhibits a higher recognition accuracy. Inspired by the recent success of Segment Anything Model (SAM), we introduce a novel approach that aims at harnessing instance box annotations from multiple perspectives to …

abstract accuracy annotation applications arxiv box cs.cv instance labeling labor masks object performance pixel practical process recognition segmentation solution type video video instance segmentation videos wise

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne