April 1, 2024, 4:42 a.m. | Haikuo Shao, Huihong Shi, Wendong Mao, Zhongfeng Wang

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.20230v1 Announce Type: cross
Abstract: Vision Transformers (ViTs) have achieved significant success in computer vision. However, their intensive computations and massive memory footprint challenge ViTs' deployment on embedded devices, calling for efficient ViTs. Among them, EfficientViT, the state-of-the-art one, features a Convolution-Transformer hybrid architecture, enhancing both accuracy and hardware efficiency. Unfortunately, existing accelerators cannot fully exploit the hardware benefits of EfficientViT due to its unique architecture. In this paper, we propose an FPGA-based accelerator for EfficientViT to advance the hardware …

abstract accelerator accuracy architecture art arxiv challenge computer computer vision convolution cs.ar cs.lg deployment devices efficiency efficientvit embedded embedded devices features fpga hardware however hybrid massive memory reconfigurable state success them transformer transformers type vision vision transformers

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

C003549 Data Analyst (NS) - MON 13 May

@ EMW, Inc. | Braine-l'Alleud, Wallonia, Belgium

Marketing Decision Scientist

@ Meta | Menlo Park, CA | New York City