March 3, 2024, 11:15 p.m. | /u/Crazy_Suspect_9512

Machine Learning www.reddit.com

In industrial large scale search/recommendation context, DCN seems to be still a popular kid on the street, compared to more straightforward MLP. The idea feels more or less like ResNet, where the original raw input keeps showing up at every layer. But is the element-wise product of layer output of raw input really adding any value here? If so why is it not adopted by the transformer architecture?

context element every good industrial kid layer machinelearning mlp popular product raw recommendation resnet scale search street wise

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Machine Learning Engineer

@ Samsara | Canada - Remote