April 17, 2024, 2:49 p.m. | /u/shuvamg007

Machine Learning www.reddit.com

I looked up in so many places but couldn't find an answer. What happens if we switch Q and K to be from the encoder and decoder respectively? Would it make any difference?

attention decoder difference encoder machinelearning

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Intelligence Manager

@ Sanofi | Budapest

Principal Engineer, Data (Hybrid)

@ Homebase | Toronto, Ontario, Canada