March 19, 2024, 4:50 a.m. | Haoxiang Ma, Ran Qin, Modi shi, Boyang Gao, Di Huang

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.11511v1 Announce Type: cross
Abstract: This paper focuses on the sim-to-real issue of RGB-D grasp detection and formulates it as a domain adaptation problem. In this case, we present a global-to-local method to address hybrid domain gaps in RGB and depth data and insufficient multi-modal feature alignment. First, a self-supervised rotation pre-training strategy is adopted to deliver robust initialization for RGB and depth networks. We then propose a global-to-local alignment pipeline with individual global domain classifiers for scene features of …

abstract alignment arxiv case cs.cv cs.ro data detection domain domain adaptation feature global hybrid issue modal multi-modal paper pre-training rgb-d rotation sim strategy training type

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Applied Scientist

@ Microsoft | Redmond, Washington, United States

Data Analyst / Action Officer

@ OASYS, INC. | OASYS, INC., Pratt Avenue Northwest, Huntsville, AL, United States