all AI news
Playing to Vision Foundation Model's Strengths in Stereo Matching
April 10, 2024, 4:45 a.m. | Chuang-Wei Liu, Qijun Chen, Rui Fan
cs.CV updates on arXiv.org arxiv.org
Abstract: Stereo matching has become a key technique for 3D environment perception in intelligent vehicles. For a considerable time, convolutional neural networks (CNNs) have remained the mainstream choice for feature extraction in this domain. Nonetheless, there is a growing consensus that the existing paradigm should evolve towards vision foundation models (VFM), particularly those developed based on vision Transformers (ViTs) and pre-trained through self-supervision on extensive, unlabeled datasets. While VFMs are adept at extracting informative, general-purpose visual …
abstract arxiv become cnns consensus convolutional neural networks cs.ai cs.cv cs.ro domain environment extraction feature feature extraction foundation foundation model intelligent key networks neural networks paradigm perception playing type vehicles vision
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Associate Data Engineer
@ Nominet | Oxford/ Hybrid, GB
Data Science Senior Associate
@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India