Spectraformer: A Unified Random Feature Framework for Transformer | allainews.com

May 27, 2024, 4:42 a.m. | Duke Nguyen, Aditya Joshi, Flora Salim

cs.LG updates on arXiv.org arxiv.org

arXiv:2405.15310v1 Announce Type: new
Abstract: Linearization of attention using various kernel approximation and kernel learning techniques has shown promise. Past methods use a subset of combinations of component functions and weight matrices within the random features paradigm. We identify the need for a systematic comparison of different combinations of weight matrix and component functions for attention learning in Transformer. In this work, we introduce Spectraformer, a unified framework for approximating and learning the kernel function in linearized attention of the …

arxiv cs.lg feature framework random transformer type

More from arxiv.org / cs.LG updates on arXiv.org

Adaptive Robust Learning using Latent Bernoulli Variables 13 hours ago | arxiv.org

abstract arxiv cs.lg identify +11

Shedding the Bits: Pushing the Boundaries of Quantization with Minifloats on FPGAs 13 hours ago | arxiv.org

abstract arxiv compression context +17

Bayesian Optimization of Function Networks with Partial Evaluations 13 hours ago | arxiv.org

abstract arxiv bayesian cs.lg +14

SemantIC: Semantic Interference Cancellation Towards 6G Wireless Communications 13 hours ago | arxiv.org

abstract arxiv auto communications +21

AstroCLIP: A Cross-Modal Foundation Model for Galaxies 13 hours ago | arxiv.org

abstract arxiv astro-ph.im cs.ai +16

A Neural-preconditioned Poisson Solver for Mixed Dirichlet and Neumann Boundary Conditions 13 hours ago | arxiv.org

abstract arxiv cs.lg cs.na +9

NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory 13 hours ago | arxiv.org

abstract arxiv consumption cs.gr +16

Neural Operators for PDE Backstepping Control of First-Order Hyperbolic PIDE with Recycle and Delay 13 hours ago | arxiv.org

abstract advanced arxiv basic +17

GSURE-Based Diffusion Model Training with Corrupted Data 13 hours ago | arxiv.org

abstract arxiv classification corrupted data +18

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Lead Python Developer - Generative AI

@ S&P Global | US - TX - VIRTUAL

View on ai-jobs.net

Analytics Engineer - Design Experience

@ Canva | Sydney, Australia

View on ai-jobs.net

Data Architect

@ Unisys | Bengaluru - RGA Tech Park

View on ai-jobs.net

Data Architect

@ HP | PSR01 - Bengaluru, Pritech Park- SEZ (PSR01)

View on ai-jobs.net

Streetlight Analyst

@ DTE Energy | Belleville, MI, US

View on ai-jobs.net