May 15, 2023, 12:47 a.m. | Xuan He, Fan Yang, Jiacheng Lin, Haolong Fu, Jin Yuan, Kailun Yang, Zhiyong Li

cs.CV updates on arXiv.org arxiv.org

Transformer-based methods have demonstrated superior performance for
monocular 3D object detection recently, which predicts 3D attributes from a
single 2D image. Most existing transformer-based methods leverage visual and
depth representations to explore valuable query points on objects, and the
quality of the learned queries has a great impact on detection accuracy.
Unfortunately, existing unsupervised attention mechanisms in transformer are
prone to generate low-quality query features due to inaccurate receptive
fields, especially on hard objects. To tackle this problem, this paper …

2d image arxiv detection image impact objects performance quality query scale ssd transformer

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Scientist

@ Meta | Menlo Park, CA

Principal Data Scientist

@ Mastercard | O'Fallon, Missouri (Main Campus)