Playing to Vision Foundation Model's Strengths in Stereo Matching | allainews.com

April 10, 2024, 4:45 a.m. | Chuang-Wei Liu, Qijun Chen, Rui Fan

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.06261v1 Announce Type: new
Abstract: Stereo matching has become a key technique for 3D environment perception in intelligent vehicles. For a considerable time, convolutional neural networks (CNNs) have remained the mainstream choice for feature extraction in this domain. Nonetheless, there is a growing consensus that the existing paradigm should evolve towards vision foundation models (VFM), particularly those developed based on vision Transformers (ViTs) and pre-trained through self-supervision on extensive, unlabeled datasets. While VFMs are adept at extracting informative, general-purpose visual …

abstract arxiv become cnns consensus convolutional neural networks cs.ai cs.cv cs.ro domain environment extraction feature feature extraction foundation foundation model intelligent key networks neural networks paradigm perception playing type vehicles vision

More from arxiv.org / cs.CV updates on arXiv.org

NOLA: Compressing LoRA using Linear Combination of Random Basis 13 hours ago | arxiv.org

arxiv combination cs.cl cs.cv +4

ReWiTe: Realistic Wide-angle and Telephoto Dual Camera Fusion Dataset via Beam Splitter Camera Rig 13 hours ago | arxiv.org

abstract arxiv become cs.cv +7

An Effective Image Copy-Move Forgery Detection Using Entropy Information 13 hours ago | arxiv.org

abstract academic algorithms arxiv +20

SimAC: A Simple Anti-Customization Method for Protecting Face Privacy against Text-to-Image Synthesis of Diffusion Models 13 hours ago | arxiv.org

arxiv cs.cv customization diffusion +9

SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification 13 hours ago | arxiv.org

arxiv cs.cv dataset identification +1

Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading 13 hours ago | arxiv.org

abstract arxiv cs.ai cs.cv +17

Conditioning Generative Latent Optimization for Sparse-View CT Image Reconstruction 13 hours ago | arxiv.org

abstract arxiv benefit cs.cv +17

Fast and Accurate Unknown Object Instance Segmentation through Error-Informed Refinement 13 hours ago | arxiv.org

abstract arxiv autonomous autonomous robots +17

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation 13 hours ago | arxiv.org

abstract arxiv challenge cs.cv +10

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Associate Data Engineer

@ Nominet | Oxford/ Hybrid, GB

View on ai-jobs.net

Data Science Senior Associate

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

View on ai-jobs.net