March 14, 2024, 4:45 a.m. | Sota Moriyama, Koji Watanabe, Katsumi Inoue, Akihiro Takemura

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.07885v1 Announce Type: new
Abstract: We introduce MOD-CL, a multi-label object detection framework that utilizes constrained loss in the training process to produce outputs that better satisfy the given requirements. In this paper, we use $\mathrm{MOD_{YOLO}}$, a multi-label object detection model built upon the state-of-the-art object detection model YOLOv8, which has been published in recent years. In Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate …

abstract art arxiv cs.ai cs.cv detection framework loss object paper process requirements state training type yolo yolov8

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne