Time Complexity of Detach() in torch "[R]" | allainews.com

Oct. 5, 2022, 4:37 p.m. | /u/mishtimoi

Machine Learning www.reddit.com

Hi,

So I have an empirical observation that when I train a large model vs. the same model in a staggered fashion, .i.e. some layers are frozen and others receive a gradient update, the latter takes more training time although the number of trainable parameters are less. This leads me to suspect that the detach() operation is the culprit. I cannot find much resources online to help me with understanding the time-complexity of the detach() operation in torch. Did anyone …

complexity machinelearning

More from www.reddit.com / Machine Learning

[P] Multihead Mixture of Experts - Implementation of dense subtoken routing suggested in https://arxiv.org/pdf/2404.15045 8 hours ago | www.reddit.com

machinelearning think will

[D] HyenaDNA and Mamba are not good at sequential labelling ? 10 hours ago | www.reddit.com

dna foundation good hello +7

[P] Drug toxicity prediction model with graph-based neural networks 10 hours ago | www.reddit.com

gnn graph graph-based machinelearning +5

[D] What are your horror stories from being tasked impossible ML problems 11 hours ago | www.reddit.com

data good horror lost +8

[P] Dreamboothing MusicGen 12 hours ago | www.reddit.com

a100 aim artist build +8

[D] Old Paper - Troubling Trends in Machine Learning Scholarship 14 hours ago | www.reddit.com

free influence issue machine +6

[D] UAI-2024 results waiting area 14 hours ago | www.reddit.com

machinelearning

[D] Why transformers are not trained layer-wise? 15 hours ago | www.reddit.com

block example gradient layer +7

[D] Is there an equivalent BigDL project for NVIDIA GPUs, which allows distributing work loads … 19 hours ago | www.reddit.com

cluster gpus library machinelearning +3

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

View on ai-jobs.net

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya

View on ai-jobs.net