Spectraformer: A Unified Random Feature Framework for Transformer | allainews.com

May 27, 2024, 4:42 a.m. | Duke Nguyen, Aditya Joshi, Flora Salim

cs.LG updates on arXiv.org arxiv.org

arXiv:2405.15310v1 Announce Type: new
Abstract: Linearization of attention using various kernel approximation and kernel learning techniques has shown promise. Past methods use a subset of combinations of component functions and weight matrices within the random features paradigm. We identify the need for a systematic comparison of different combinations of weight matrix and component functions for attention learning in Transformer. In this work, we introduce Spectraformer, a unified framework for approximating and learning the kernel function in linearized attention of the …

arxiv cs.lg feature framework random transformer type

More from arxiv.org / cs.LG updates on arXiv.org

Revisiting Active Learning in the Era of Vision Foundation Models 9 hours ago | arxiv.org

active learning arxiv cs.cv cs.lg +4

Fast gradient-free activation maximization for neurons in spiking neural networks 9 hours ago | arxiv.org

abstract artificial arxiv cognitive +16

Diverse Part Synthesis for 3D Shape Creation 9 hours ago | arxiv.org

abstract applications arxiv cs.cv +15

SoK: Facial Deepfake Detectors 9 hours ago | arxiv.org

abstract arxiv cs.cr cs.cv +19

XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies 9 hours ago | arxiv.org

arxiv cs.cv cs.gr cs.lg +7

Accelerating Electronic Stopping Power Predictions by 10 Million Times with a Combination of Time-Dependent Density … 9 hours ago | arxiv.org

abstract arxiv combination cond-mat.mtrl-sci +24

Jigsaw: Supporting Designers to Prototype Multimodal Applications by Chaining AI Foundation Models 9 hours ago | arxiv.org

abstract ai foundation ai foundation models applications +21

Analysis of learning a flow-based generative model from limited sample complexity 9 hours ago | arxiv.org

abstract analysis arxiv autoencoder +13

PiPar: Pipeline Parallelism for Collaborative Machine Learning 9 hours ago | arxiv.org

abstract arxiv collaborative cs.dc +19

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Staff Software Engineer (Data Platform)

@ Phaidra | Remote

View on ai-jobs.net

Distributed Compute Engineer

@ Magic | San Francisco

View on ai-jobs.net

Power Platform Developer/Consultant

@ Euromonitor | Bengaluru, Karnataka, India

View on ai-jobs.net

Finance Project Senior Manager

@ QIMA | London, United Kingdom

View on ai-jobs.net