April 1, 2024, 4:42 a.m. | Haikuo Shao, Huihong Shi, Wendong Mao, Zhongfeng Wang

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.20230v1 Announce Type: cross
Abstract: Vision Transformers (ViTs) have achieved significant success in computer vision. However, their intensive computations and massive memory footprint challenge ViTs' deployment on embedded devices, calling for efficient ViTs. Among them, EfficientViT, the state-of-the-art one, features a Convolution-Transformer hybrid architecture, enhancing both accuracy and hardware efficiency. Unfortunately, existing accelerators cannot fully exploit the hardware benefits of EfficientViT due to its unique architecture. In this paper, we propose an FPGA-based accelerator for EfficientViT to advance the hardware …

abstract accelerator accuracy architecture art arxiv challenge computer computer vision convolution cs.ar cs.lg deployment devices efficiency efficientvit embedded embedded devices features fpga hardware however hybrid massive memory reconfigurable state success them transformer transformers type vision vision transformers

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US