Bootstrapping SparseFormers from Vision Foundation Models | allainews.com

April 5, 2024, 4:45 a.m. | Ziteng Gao, Zhan Tong, Kevin Qinghong Lin, Joya Chen, Mike Zheng Shou

cs.CV updates on arXiv.org arxiv.org

arXiv:2312.01987v2 Announce Type: replace
Abstract: The recently proposed SparseFormer architecture provides an alternative approach to visual understanding by utilizing a significantly lower number of visual tokens via adjusting RoIs, greatly reducing computational costs while still achieving promising performance. However, training SparseFormers from scratch is still expensive, and scaling up the number of parameters can be challenging. In this paper, we propose to bootstrap SparseFormers from ViT-based vision foundation models in a simple and efficient way. Since the majority of SparseFormer …

arxiv bootstrapping cs.cv foundation type vision

More from arxiv.org / cs.CV updates on arXiv.org

Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms 16 hours ago | arxiv.org

abstract arxiv cases cs.cv +13

PREGO: online mistake detection in PRocedural EGOcentric videos 16 hours ago | arxiv.org

abstract applications arxiv capability +12

Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling 16 hours ago | arxiv.org

abstract arxiv automated automation +17

CG-HOI: Contact-Guided 3D Human-Object Interaction Generation 16 hours ago | arxiv.org

abstract arxiv cs.cv dynamic +9

DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection 16 hours ago | arxiv.org

abstract alignment arxiv bias +14

ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models 16 hours ago | arxiv.org

abstract arxiv capabilities commonsense +21

REB: Reducing Biases in Representation for Industrial Anomaly Detection 16 hours ago | arxiv.org

anomaly anomaly detection arxiv biases +7

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems 16 hours ago | arxiv.org

arxiv block bridge cs.ai +11

Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation 16 hours ago | arxiv.org

abstract arxiv brain cs.cv +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net