April 10, 2024, 4:45 a.m. | Chuang-Wei Liu, Qijun Chen, Rui Fan

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.06261v1 Announce Type: new
Abstract: Stereo matching has become a key technique for 3D environment perception in intelligent vehicles. For a considerable time, convolutional neural networks (CNNs) have remained the mainstream choice for feature extraction in this domain. Nonetheless, there is a growing consensus that the existing paradigm should evolve towards vision foundation models (VFM), particularly those developed based on vision Transformers (ViTs) and pre-trained through self-supervision on extensive, unlabeled datasets. While VFMs are adept at extracting informative, general-purpose visual …

abstract arxiv become cnns consensus convolutional neural networks cs.ai cs.cv cs.ro domain environment extraction feature feature extraction foundation foundation model intelligent key networks neural networks paradigm perception playing type vehicles vision

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Associate Data Engineer

@ Nominet | Oxford/ Hybrid, GB

Data Science Senior Associate

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India