Nov. 24, 2022, 7:17 a.m. | Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

cs.CV updates on arXiv.org arxiv.org

Current perception models in autonomous driving greatly rely on large-scale
labeled 3D data. However, it is expensive and time-consuming to annotate 3D
data. In this work, we aim at facilitating research on self-supervised learning
from the vast unlabeled 3D data in autonomous driving. We introduce a masked
autoencoding framework for pre-training large-scale point clouds, dubbed
Voxel-MAE. We take advantage of the geometric characteristics of large-scale
point clouds, and propose the range-aware random masking strategy and binary
voxel classification task. Specifically, …

arxiv pre-training scale training voxel

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States