all AI news
NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator
May 8, 2024, 4:42 a.m. | Mohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh
cs.LG updates on arXiv.org arxiv.org
Abstract: Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edge. However, attention layers are difficult to map onto existing neuro accelerators since they have a much higher density of non-linear operations, which lead to inefficient utilization of today's vector units. This work introduces NOVA, a NoC-based Vector Unit that can perform non-linear operations within the NoC of …
abstract accelerator accelerators applications arxiv attention attention mechanisms cnn cs.ai cs.ar cs.lg domains edge however language language processing map mapping multiple natural natural language natural language processing network neural network neuro nlp nova popular processing the edge type vector vision
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US